Yannic Kilcher Videos (Audio Only) Podcast Republic

Yannic Kilcher Videos (Audio Only)

By Yannic Kilcher

Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.

Image by Yannic Kilcher

Open Website

Rate for this podcast

Subscribers: 10
Reviews: 0
Episodes: 177

Description

I make videos about machine learning research papers, programming, and issues of the AI community, and the broader impact of AI in society. Twitter: https://twitter.com/ykilcher Discord: https://discord.gg/4H8xxDF If you want to support me, the best thing to do is to share out the content :) If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this): SubscribeStar (preferred to Patreon): https://www.subscribestar.com/yannickilcher Patreon: https://www.patreon.com/yannickilcher Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq

Episode	Date
Efficient Streaming Language Models with Attention Sinks (Paper Explained) Read the full episode description	Oct 17, 2023
Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution (Paper Explained) Read the full episode description	Oct 17, 2023
Retentive Network: A Successor to Transformer for Large Language Models (Paper Explained) Read the full episode description	Oct 05, 2023
Reinforced Self-Training (ReST) for Language Modeling (Paper Explained) Read the full episode description	Oct 05, 2023
[ML News] LLaMA2 Released \| LLMs for Robots \| Multimodality on the Rise Read the full episode description	Aug 28, 2023
How Cyber Criminals Are Using ChatGPT (w/ Sergey Shykevich) Read the full episode description	Aug 28, 2023
Recipe AI suggests FATAL CHLORINE GAS Recipe Read the full episode description	Aug 28, 2023
DeepFloyd IF - Pixel-Based Text-to-Image Diffusion (w/ Authors) Read the full episode description	Aug 28, 2023
[ML News] GPT-4 solves MIT Exam with 100% ACCURACY \| OpenLLaMA 13B released Read the full episode description	Aug 28, 2023
Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust (Explained) Read the full episode description	Aug 28, 2023
RWKV: Reinventing RNNs for the Transformer Era (Paper Explained) Read the full episode description	Aug 28, 2023
Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Full Paper Review) Read the full episode description	Aug 28, 2023
OpenAI suggests AI licenses (US Senate hearing on AI regulation w/ Sam Altman) Read the full episode description	Aug 28, 2023
[ML News] Geoff Hinton leaves Google \| Google has NO MOAT \| OpenAI down half a billion Read the full episode description	Aug 28, 2023
Scaling Transformer to 1M tokens and beyond with RMT (Paper Explained) Read the full episode description	Aug 28, 2023
OpenAssistant RELEASED! The world's best open-source Chat AI! Read the full episode description	Aug 28, 2023
OpenAssistant First Models are here! (Open-Source ChatGPT) Read the full episode description	Aug 28, 2023
The biggest week in AI (GPT-4, Office Copilot, Google PaLM, Anthropic Claude & more) Read the full episode description	Aug 28, 2023
GPT-4 is here! What we know so far (Full Analysis) Read the full episode description	Aug 28, 2023
This ChatGPT Skill will earn you $10B (also, AI reads your mind!) Read the full episode description	Aug 28, 2023
LLaMA: Open and Efficient Foundation Language Models (Paper Explained) Read the full episode description	Aug 28, 2023
Open Assistant Inference Backend Development (Hands-On Coding) Read the full episode description	Aug 28, 2023
OpenAssistant - ChatGPT's Open Alternative (We need your help!) Read the full episode description	Aug 28, 2023
ChatGPT: This AI has a JAILBREAK?! (Unbelievable AI Progress) Read the full episode description	Jan 02, 2023
[ML News] GPT-4 Rumors \| AI Mind Reading \| Neuron Interaction Solved \| AI Theorem Proving Read the full episode description	Nov 30, 2022
CICERO: An AI agent that negotiates, persuades, and cooperates with people Read the full episode description	Nov 30, 2022
[ML News] Multiplayer Stable Diffusion \| OpenAI needs more funding \| Text-to-Video models incoming Read the full episode description	Nov 23, 2022
The New AI Model Licenses have a Legal Loophole (OpenRAIL-M of BLOOM, Stable Diffusion, etc.) Read the full episode description	Nov 23, 2022
ROME: Locating and Editing Factual Associations in GPT (Paper Explained & Author Interview) Read the full episode description	Nov 23, 2022
Neural Networks are Decision Trees (w/ Alexander Mattick) Read the full episode description	Oct 23, 2022
This is a game changer! (AlphaTensor by DeepMind explained) Read the full episode description	Oct 23, 2022
[ML News] Stable Diffusion Takes Over! (Open Source AI Art) Read the full episode description	Oct 23, 2022
How to make your CPU as fast as a GPU - Advances in Sparsity w/ Nir Shavit Read the full episode description	Oct 23, 2022
More Is Different for AI - Scaling Up, Emergence, and Paperclip Maximizers (w/ Jacob Steinhardt) Read the full episode description	Sep 15, 2022
The hidden dangers of loading open-source AI models (ARBITRARY CODE EXPLOIT!) Read the full episode description	Sep 07, 2022
The Future of AI is Self-Organizing and Self-Assembling (w/ Prof. Sebastian Risi) Read the full episode description	Aug 29, 2022
The Man behind Stable Diffusion Read the full episode description	Aug 29, 2022
[ML News] BLOOM: 176B Open-Source \| Chinese Brain-Scale Computer \| Meta AI: No Language Left Behind Read the full episode description	Aug 03, 2022
JEPA - A Path Towards Autonomous Machine Intelligence (Paper Explained) Read the full episode description	Jul 10, 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos (Paper Explained) Read the full episode description	Jun 28, 2022
Parti - Scaling Autoregressive Models for Content-Rich Text-to-Image Generation (Paper Explained) Read the full episode description	Jun 28, 2022
Did Google's LaMDA chatbot just become sentient? Read the full episode description	Jun 20, 2022
[ML News] DeepMind's Flamingo Image-Text model \| Locked-Image Tuning \| Jurassic X & MRKL Read the full episode description	May 16, 2022
[ML News] Meta's OPT 175B language model \| DALL-E Mega is training \| TorToiSe TTS fakes my voice Read the full episode description	May 12, 2022
This A.I. creates infinite NFTs Read the full episode description	May 12, 2022
Author Interview: SayCan - Do As I Can, Not As I Say: Grounding Language in Robotic Affordances Read the full episode description	May 12, 2022
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances (SayCan - Paper Explained) Read the full episode description	May 02, 2022
Author Interview - ACCEL: Evolving Curricula with Regret-Based Environment Design Read the full episode description	May 02, 2022
ACCEL: Evolving Curricula with Regret-Based Environment Design (Paper Review) Read the full episode description	May 02, 2022
LAION-5B: 5 billion image-text-pairs dataset (with the authors) Read the full episode description	Apr 25, 2022
Sparse Expert Models (Switch Transformers, GLAM, and more... w/ the Authors) Read the full episode description	Apr 25, 2022
Author Interview - Transformer Memory as a Differentiable Search Index Read the full episode description	Apr 21, 2022
Transformer Memory as a Differentiable Search Index (Machine Learning Research Paper Explained) Read the full episode description	Apr 21, 2022
[ML News] Google's 540B PaLM Language Model & OpenAI's DALL-E 2 Text-to-Image Revolution Read the full episode description	Apr 12, 2022
The Weird and Wonderful World of AI Art (w/ Author Jack Morris) Read the full episode description	Apr 06, 2022
Author Interview - Improving Intrinsic Exploration with Language Abstractions Read the full episode description	Apr 06, 2022
Improving Intrinsic Exploration with Language Abstractions (Machine Learning Paper Explained) Read the full episode description	Apr 06, 2022
[ML News] GPT-3 learns to edit \| Google Pathways \| Make-A-Scene \| CLIP meets GamePhysics \| DouBlind Read the full episode description	Apr 06, 2022
Author Interview - Memory-assisted prompt editing to improve GPT-3 after deployment Read the full episode description	Mar 30, 2022
Memory-assisted prompt editing to improve GPT-3 after deployment (Machine Learning Paper Explained) Read the full episode description	Mar 30, 2022
Author Interview - Typical Decoding for Natural Language Generation Read the full episode description	Mar 28, 2022
Typical Decoding for Natural Language Generation (Get more human-like outputs from language models!) Read the full episode description	Mar 28, 2022
One Model For All The Tasks - BLIP (Author Interview) Read the full episode description	Mar 25, 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding&Generation Read the full episode description	Mar 25, 2022
[ML News] AI Threatens Biological Arms Race Read the full episode description	Mar 22, 2022
Active Dendrites avoid catastrophic forgetting - Interview with the Authors Read the full episode description	Mar 21, 2022
Avoiding Catastrophe: Active Dendrites Enable Multi-Task Learning in Dynamic Environments (Review) Read the full episode description	Mar 21, 2022
Author Interview - VOS: Learning What You Don't Know by Virtual Outlier Synthesis Read the full episode description	Mar 17, 2022
VOS: Learning What You Don't Know by Virtual Outlier Synthesis (Paper Explained) Read the full episode description	Mar 14, 2022
Spurious normativity enhances learning of compliance and enforcement behavior in artificial agents Read the full episode description	Mar 10, 2022
First Author Interview: AI & formal math (Formal Mathematics Statement Curriculum Learning) Read the full episode description	Mar 08, 2022
OpenAI tackles Math - Formal Mathematics Statement Curriculum Learning (Paper Explained) Read the full episode description	Mar 08, 2022
[ML News] DeepMind controls fusion \| Yann LeCun's JEPA architecture \| US: AI can't copyright its art Read the full episode description	Mar 08, 2022
AlphaCode - with the authors! Read the full episode description	Mar 08, 2022
Competition-Level Code Generation with AlphaCode (Paper Review) Read the full episode description	Mar 02, 2022
Can Wikipedia Help Offline Reinforcement Learning? (Author Interview) Read the full episode description	Mar 02, 2022
Can Wikipedia Help Offline Reinforcement Learning? (Paper Explained) Read the full episode description	Mar 02, 2022
[ML Olds] Meta Research Supercluster \| OpenAI GPT-Instruct \| Google LaMDA \| Drones fight Pigeons Read the full episode description	Mar 02, 2022
[ML News] Uber: Deep Learning for ETA \| MuZero Video Compression \| Block-NeRF \| EfficientNet-X Read the full episode description	Feb 24, 2022
Listening to You! - Channel Update (Author Interviews) Read the full episode description	Feb 22, 2022
All about AI Accelerators: GPU, TPU, Dataflow, Near-Memory, Optical, Neuromorphic & more (w/ Author) Read the full episode description	Feb 21, 2022
CM3: A Causal Masked Multimodal Model of the Internet (Paper Explained w/ Author Interview) Read the full episode description	Feb 21, 2022
AI against Censorship: Genetic Algorithms, The Geneva Project, ML in Security, and more! Read the full episode description	Feb 17, 2022
HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning (w/ Author) Read the full episode description	Feb 16, 2022
[ML News] DeepMind AlphaCode \| OpenAI math prover \| Meta battles harmful content with AI Read the full episode description	Feb 16, 2022
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents (+Author) Read the full episode description	Feb 16, 2022
OpenAI Embeddings (and Controversy?!) Read the full episode description	Feb 16, 2022
Unsupervised Brain Models - How does Deep Learning inform Neuroscience? (w/ Patrick Mineault) Read the full episode description	Feb 16, 2022
GPT-NeoX-20B - Open-Source huge language model by EleutherAI (Interview w/ co-founder Connor Leahy) Read the full episode description	Feb 16, 2022
Predicting the rules behind - Deep Symbolic Regression for Recurrent Sequences (w/ author interview) Read the full episode description	Feb 02, 2022
IT ARRIVED! YouTube sent me a package. (also: Limited Time Merch Deal) Read the full episode description	Jan 28, 2022
[ML News] ConvNeXt: Convolutions return \| China regulates algorithms \| Saliency cropping examined Read the full episode description	Jan 28, 2022
Dynamic Inference with Neural Interpreters (w/ author interview) Read the full episode description	Jan 24, 2022
Noether Networks: Meta-Learning Useful Conserved Quantities (w/ the authors) Read the full episode description	Jan 21, 2022
This Team won the Minecraft RL BASALT Challenge! (Paper Explanation & Interview with the authors) Read the full episode description	Jan 20, 2022
This Team won the Minecraft RL BASALT Challenge! (Paper Explanation & Interview with the authors) Read the full episode description	Jan 16, 2022
Full Self-Driving is HARD! Analyzing Elon Musk re: Tesla Autopilot on Lex Fridman's Podcast Read the full episode description	Jan 07, 2022
Player of Games: All the games, one algorithm! (w/ author Martin Schmid) Read the full episode description	Jan 05, 2022
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models Read the full episode description	Jan 05, 2022
[ML News] DeepMind builds Gopher \| Google builds GLaM \| Suicide capsule uses AI to check access Read the full episode description	Jan 05, 2022
Resolution-robust Large Mask Inpainting with Fourier Convolutions (w/ Author Interview) Read the full episode description	Jan 05, 2022
[ML News] DeepMind tackles Math \| Microsoft does more with less \| Timnit Gebru launches DAIR Read the full episode description	Dec 14, 2021
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion (ML Research Paper Explained) Read the full episode description	Dec 10, 2021
[ML News] OpenAI removes GPT-3 waitlist \| GauGAN2 is amazing \| NYC regulates AI hiring tools Read the full episode description	Dec 03, 2021
Sparse is Enough in Scaling Transformers (aka Terraformer) \| ML Research Paper Explained Read the full episode description	Dec 02, 2021
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning (Paper Explained) Read the full episode description	Dec 01, 2021
Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions (Paper Explained) Read the full episode description	Dec 01, 2021
Peer Review is still BROKEN! The NeurIPS 2021 Review Experiment (results are in) Read the full episode description	Nov 26, 2021
Parameter Prediction for Unseen Deep Architectures (w/ First Author Boris Knyazev) Read the full episode description	Nov 25, 2021
Learning Rate Grafting: Transferability of Optimizer Tuning (Machine Learning Research Paper Reivew) Read the full episode description	Nov 22, 2021
[ML News] Cedille French Language Model \| YOU Search Engine \| AI Finds Profitable MEME TOKENS Read the full episode description	Nov 22, 2021
Gradients are Not All You Need (Machine Learning Research Paper Explained) Read the full episode description	Nov 22, 2021
[ML News] Microsoft combines Images & Text \| Meta makes artificial skin \| Russians replicate DALL-E Read the full episode description	Nov 22, 2021
Autoregressive Diffusion Models (Machine Learning Research Paper Explained) Read the full episode description	Nov 11, 2021
[ML News] Google introduces Pathways \| OpenAI solves Math Problems \| Meta goes First Person Read the full episode description	Nov 11, 2021
EfficientZero: Mastering Atari Games with Limited Data (Machine Learning Research Paper Explained) Read the full episode description	Nov 05, 2021
[YTalks] Siraj Raval - Stories about YouTube, Plagiarism, and the Dangers of Fame (Interview) Read the full episode description	Nov 01, 2021
[ML News GERMAN] NVIDIA GTC'21 \| DeepMind kauft MuJoCo \| Google Lernt Spreadsheet Formeln Read the full episode description	Nov 01, 2021
[ML News] NVIDIA GTC'21 \| DeepMind buys MuJoCo \| Google predicts spreadsheet formulas Read the full episode description	Nov 01, 2021
I went to an AI Art Festival in Geneva (AiiA Festival Trip Report) Read the full episode description	Oct 29, 2021
Symbolic Knowledge Distillation: from General Language Models to Commonsense Models (Explained) Read the full episode description	Oct 25, 2021
I took a Swiss train and it was awesome! Train Seat Review - SBB InterCity 1 - Geneva to St. Gallen Read the full episode description	Oct 25, 2021
[ML News] Microsoft trains 530B model \| ConvMixer model fits into single tweet \| DeepMind profitable Read the full episode description	Oct 21, 2021
[ML News] DeepMind does Nowcasting \| The Guardian's shady reporting \| AI finishes Beethoven's 10th Read the full episode description	Oct 11, 2021
Grokking: Generalization beyond Overfitting on small algorithmic datasets (Paper Explained) Read the full episode description	Oct 11, 2021
How far can we scale up? Deep Learning's Diminishing Returns (Article Review) Read the full episode description	Oct 04, 2021
[ML News] Plagiarism Case w/ Plot Twist \| CLIP for video surveillance \| OpenAI summarizes books Read the full episode description	Sep 30, 2021
Inconsistency in Conference Peer Review: Revisiting the 2014 NeurIPS Experiment (Paper Explained) Read the full episode description	Sep 30, 2021
[ML News] New ImageNet SOTA \| Uber's H3 hexagonal coordinate system \| New text-image-pair dataset Read the full episode description	Sep 28, 2021
Does GPT-3 lie? - Misinformation and fear-mongering around the TruthfulQA dataset Read the full episode description	Sep 24, 2021
Topographic VAEs learn Equivariant Capsules (Machine Learning Research Paper Explained) Read the full episode description	Sep 21, 2021
[ML News] Roomba Avoids Poop \| Textless NLP \| TikTok Algorithm Secrets \| New Schmidhuber Blog Read the full episode description	Sep 16, 2021
Celebrating 100k Subscribers! (w/ Channel Statistics) Read the full episode description	Sep 16, 2021
[ML News] AI predicts race from X-Ray \| Google kills HealthStreams \| Boosting Search with MuZero Read the full episode description	Sep 13, 2021
∞-former: Infinite Memory Transformer (aka Infty-Former / Infinity-Former, Research Paper Explained) Read the full episode description	Sep 06, 2021
[ML News] Blind Chess AI Competition \| Graph NNs for traffic \| AI gift suggestions Read the full episode description	Sep 05, 2021
ALiBi - Train Short, Test Long: Attention with linear biases enables input length extrapolation Read the full episode description	Sep 05, 2021
[ML News] Stanford HAI coins Foundation Models & High-profile case of plagiarism uncovered Read the full episode description	Aug 30, 2021
Fastformer: Additive Attention Can Be All You Need (Machine Learning Research Paper Explained) Read the full episode description	Aug 27, 2021
PonderNet: Learning to Ponder (Machine Learning Research Paper Explained) Read the full episode description	Aug 23, 2021
NeuralHash is BROKEN \| How to evade Apple's detection and forge hash collisions (w/ Code) Read the full episode description	Aug 19, 2021
[ML News] Nvidia renders CEO \| Jurassic-1 larger than GPT-3 \| Tortured Phrases reveal Plagiarism Read the full episode description	Aug 19, 2021
How Apple scans your phone (and how to evade it) - NeuralHash CSAM Detection Algorithm Explained Read the full episode description	Aug 16, 2021
[ML NEWS] Apple scans your phone \| Master Faces beat face recognition \| WALL-E is real Read the full episode description	Aug 16, 2021
[ML News] AI-generated patent approved \| Germany gets an analog to OpenAI \| ML cheats video games Read the full episode description	Aug 09, 2021
[ML News] MMO Game destroys GPUs \| OpenAI quits Robotics \| Today w/ guest host Sanyam Bhutani Read the full episode description	Aug 09, 2021
[ML News] Facebook AI adapting robots \| Baidu autonomous excavators \| Happy Birthday EleutherAI Read the full episode description	Jul 18, 2021
[ML News] GitHub Copilot - Copyright, GPL, Patents & more \| Brickit LEGO app \| Distill goes on break Read the full episode description	Jul 13, 2021
Self-driving from VISION ONLY - Tesla's self-driving progress by Andrej Karpathy (Talk Analysis) Read the full episode description	Jul 05, 2021
[ML News] CVPR bans social media paper promotion \| AI restores Rembrandt \| GPU prices down Read the full episode description	Jul 05, 2021
The Dimpled Manifold Model of Adversarial Examples in Machine Learning (Research Paper Explained) Read the full episode description	Jun 28, 2021
[ML News] Hugging Face course \| GAN Theft Auto \| AI Programming Puzzles \| PyTorch 1.9 Released Read the full episode description	Jun 25, 2021
XCiT: Cross-Covariance Image Transformers (Facebook AI Machine Learning Research Paper Explained) Read the full episode description	Jun 25, 2021
AMP: Adversarial Motion Priors for Stylized Physics-Based Character Control (Paper Explained) Read the full episode description	Jun 22, 2021
[ML News] De-Biasing GPT-3 \| RL cracks chip design \| NetHack challenge \| Open-Source GPT-J Read the full episode description	Jun 22, 2021
Efficient and Modular Implicit Differentiation (Machine Learning Research Paper Explained) Read the full episode description	Jun 15, 2021
[ML News] EU regulates AI, China trains 1.75T model, Google's oopsie, Everybody cheers for fraud. Read the full episode description	Jun 10, 2021
Decision Transformer: Reinforcement Learning via Sequence Modeling (Research Paper Explained) Read the full episode description	Jun 07, 2021
[ML News] Anthropic raises $124M, ML execs clueless, collusion rings, ELIZA source discovered & more Read the full episode description	Jun 07, 2021
Reward Is Enough (Machine Learning Research Paper Explained) Read the full episode description	Jun 02, 2021
Expire-Span: Not All Memories are Created Equal: Learning to Forget by Expiring (Paper Explained) Read the full episode description	May 26, 2021
FNet: Mixing Tokens with Fourier Transforms (Machine Learning Research Paper Explained) Read the full episode description	May 24, 2021
AI made this music video \| What happens when OpenAI's CLIP meets BigGAN? Read the full episode description	May 21, 2021
DDPM - Diffusion Models Beat GANs on Image Synthesis (Machine Learning Research Paper Explained) Read the full episode description	May 15, 2021
Involution: Inverting the Inherence of Convolution for Visual Recognition (Research Paper Explained) Read the full episode description	May 10, 2021
MLP-Mixer: An all-MLP Architecture for Vision (Machine Learning Research Paper Explained) Read the full episode description	May 10, 2021
Is Google Translate Sexist? Gender Stereotypes in Statistical Machine Translation Read the full episode description	May 03, 2021
Perceiver: General Perception with Iterative Attention (Google DeepMind Research Paper Explained) Read the full episode description	May 03, 2021
Pretrained Transformers as Universal Computation Engines (Machine Learning Research Paper Explained) Read the full episode description	May 03, 2021
Yann LeCun - Self-Supervised Learning: The Dark Matter of Intelligence (FAIR Blog Post Explained) Read the full episode description	May 03, 2021
Multimodal Neurons in Artificial Neural Networks (w/ OpenAI Microscope, Research Paper Explained) Read the full episode description	May 03, 2021
Machine Learning PhD Survival Guide 2021 \| Advice on Topic Selection, Papers, Conferences & more! Read the full episode description	May 03, 2021
PAIR AI Explorables \| Is the problem in the data? Examples on Fairness, Diversity, and Bias. Read the full episode description	May 03, 2021
DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning Read the full episode description	May 03, 2021
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis (ML Research Paper Explained) Read the full episode description	May 02, 2021
DINO: Emerging Properties in Self-Supervised Vision Transformers (Facebook AI Research Explained) Read the full episode description	May 02, 2021
Why AI is Harder Than We Think (Machine Learning Research Paper Explained) Read the full episode description	May 02, 2021