Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.
Episode | Date |
---|---|
Efficient Streaming Language Models with Attention Sinks (Paper Explained)
|
Oct 17, 2023 |
Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution (Paper Explained)
|
Oct 17, 2023 |
Retentive Network: A Successor to Transformer for Large Language Models (Paper Explained)
|
Oct 05, 2023 |
Reinforced Self-Training (ReST) for Language Modeling (Paper Explained)
|
Oct 05, 2023 |
[ML News] LLaMA2 Released | LLMs for Robots | Multimodality on the Rise
|
Aug 28, 2023 |
How Cyber Criminals Are Using ChatGPT (w/ Sergey Shykevich)
|
Aug 28, 2023 |
Recipe AI suggests FATAL CHLORINE GAS Recipe
|
Aug 28, 2023 |
DeepFloyd IF - Pixel-Based Text-to-Image Diffusion (w/ Authors)
|
Aug 28, 2023 |
[ML News] GPT-4 solves MIT Exam with 100% ACCURACY | OpenLLaMA 13B released
|
Aug 28, 2023 |
Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust (Explained)
|
Aug 28, 2023 |
RWKV: Reinventing RNNs for the Transformer Era (Paper Explained)
|
Aug 28, 2023 |
Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Full Paper Review)
|
Aug 28, 2023 |
OpenAI suggests AI licenses (US Senate hearing on AI regulation w/ Sam Altman)
|
Aug 28, 2023 |
[ML News] Geoff Hinton leaves Google | Google has NO MOAT | OpenAI down half a billion
|
Aug 28, 2023 |
Scaling Transformer to 1M tokens and beyond with RMT (Paper Explained)
|
Aug 28, 2023 |
OpenAssistant RELEASED! The world's best open-source Chat AI!
|
Aug 28, 2023 |
OpenAssistant First Models are here! (Open-Source ChatGPT)
|
Aug 28, 2023 |
The biggest week in AI (GPT-4, Office Copilot, Google PaLM, Anthropic Claude & more)
|
Aug 28, 2023 |
GPT-4 is here! What we know so far (Full Analysis)
|
Aug 28, 2023 |
This ChatGPT Skill will earn you $10B (also, AI reads your mind!)
|
Aug 28, 2023 |
LLaMA: Open and Efficient Foundation Language Models (Paper Explained)
|
Aug 28, 2023 |
Open Assistant Inference Backend Development (Hands-On Coding)
|
Aug 28, 2023 |
OpenAssistant - ChatGPT's Open Alternative (We need your help!)
|
Aug 28, 2023 |
ChatGPT: This AI has a JAILBREAK?! (Unbelievable AI Progress)
|
Jan 02, 2023 |
[ML News] GPT-4 Rumors | AI Mind Reading | Neuron Interaction Solved | AI Theorem Proving
|
Nov 30, 2022 |
CICERO: An AI agent that negotiates, persuades, and cooperates with people
|
Nov 30, 2022 |
[ML News] Multiplayer Stable Diffusion | OpenAI needs more funding | Text-to-Video models incoming
|
Nov 23, 2022 |
The New AI Model Licenses have a Legal Loophole (OpenRAIL-M of BLOOM, Stable Diffusion, etc.)
|
Nov 23, 2022 |
ROME: Locating and Editing Factual Associations in GPT (Paper Explained & Author Interview)
|
Nov 23, 2022 |
Neural Networks are Decision Trees (w/ Alexander Mattick)
|
Oct 23, 2022 |
This is a game changer! (AlphaTensor by DeepMind explained)
|
Oct 23, 2022 |
[ML News] Stable Diffusion Takes Over! (Open Source AI Art)
|
Oct 23, 2022 |
How to make your CPU as fast as a GPU - Advances in Sparsity w/ Nir Shavit
|
Oct 23, 2022 |
More Is Different for AI - Scaling Up, Emergence, and Paperclip Maximizers (w/ Jacob Steinhardt)
|
Sep 15, 2022 |
The hidden dangers of loading open-source AI models (ARBITRARY CODE EXPLOIT!)
|
Sep 07, 2022 |
The Future of AI is Self-Organizing and Self-Assembling (w/ Prof. Sebastian Risi)
|
Aug 29, 2022 |
The Man behind Stable Diffusion
|
Aug 29, 2022 |
[ML News] BLOOM: 176B Open-Source | Chinese Brain-Scale Computer | Meta AI: No Language Left Behind
|
Aug 03, 2022 |
JEPA - A Path Towards Autonomous Machine Intelligence (Paper Explained)
|
Jul 10, 2022 |
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos (Paper Explained)
|
Jun 28, 2022 |
Parti - Scaling Autoregressive Models for Content-Rich Text-to-Image Generation (Paper Explained)
|
Jun 28, 2022 |
Did Google's LaMDA chatbot just become sentient?
|
Jun 20, 2022 |
[ML News] DeepMind's Flamingo Image-Text model | Locked-Image Tuning | Jurassic X & MRKL
|
May 16, 2022 |
[ML News] Meta's OPT 175B language model | DALL-E Mega is training | TorToiSe TTS fakes my voice
|
May 12, 2022 |
This A.I. creates infinite NFTs
|
May 12, 2022 |
Author Interview: SayCan - Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
|
May 12, 2022 |
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances (SayCan - Paper Explained)
|
May 02, 2022 |
Author Interview - ACCEL: Evolving Curricula with Regret-Based Environment Design
|
May 02, 2022 |
ACCEL: Evolving Curricula with Regret-Based Environment Design (Paper Review)
|
May 02, 2022 |
LAION-5B: 5 billion image-text-pairs dataset (with the authors)
|
Apr 25, 2022 |
Sparse Expert Models (Switch Transformers, GLAM, and more... w/ the Authors)
|
Apr 25, 2022 |
Author Interview - Transformer Memory as a Differentiable Search Index
|
Apr 21, 2022 |
Transformer Memory as a Differentiable Search Index (Machine Learning Research Paper Explained)
|
Apr 21, 2022 |
[ML News] Google's 540B PaLM Language Model & OpenAI's DALL-E 2 Text-to-Image Revolution
|
Apr 12, 2022 |
The Weird and Wonderful World of AI Art (w/ Author Jack Morris)
|
Apr 06, 2022 |
Author Interview - Improving Intrinsic Exploration with Language Abstractions
|
Apr 06, 2022 |
Improving Intrinsic Exploration with Language Abstractions (Machine Learning Paper Explained)
|
Apr 06, 2022 |
[ML News] GPT-3 learns to edit | Google Pathways | Make-A-Scene | CLIP meets GamePhysics | DouBlind
|
Apr 06, 2022 |
Author Interview - Memory-assisted prompt editing to improve GPT-3 after deployment
|
Mar 30, 2022 |
Memory-assisted prompt editing to improve GPT-3 after deployment (Machine Learning Paper Explained)
|
Mar 30, 2022 |
Author Interview - Typical Decoding for Natural Language Generation
|
Mar 28, 2022 |
Typical Decoding for Natural Language Generation (Get more human-like outputs from language models!)
|
Mar 28, 2022 |
One Model For All The Tasks - BLIP (Author Interview)
|
Mar 25, 2022 |
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding&Generation
|
Mar 25, 2022 |
[ML News] AI Threatens Biological Arms Race
|
Mar 22, 2022 |
Active Dendrites avoid catastrophic forgetting - Interview with the Authors
|
Mar 21, 2022 |
Avoiding Catastrophe: Active Dendrites Enable Multi-Task Learning in Dynamic Environments (Review)
|
Mar 21, 2022 |
Author Interview - VOS: Learning What You Don't Know by Virtual Outlier Synthesis
|
Mar 17, 2022 |
VOS: Learning What You Don't Know by Virtual Outlier Synthesis (Paper Explained)
|
Mar 14, 2022 |
Spurious normativity enhances learning of compliance and enforcement behavior in artificial agents
|
Mar 10, 2022 |
First Author Interview: AI & formal math (Formal Mathematics Statement Curriculum Learning)
|
Mar 08, 2022 |
OpenAI tackles Math - Formal Mathematics Statement Curriculum Learning (Paper Explained)
|
Mar 08, 2022 |
[ML News] DeepMind controls fusion | Yann LeCun's JEPA architecture | US: AI can't copyright its art
|
Mar 08, 2022 |
AlphaCode - with the authors!
|
Mar 08, 2022 |
Competition-Level Code Generation with AlphaCode (Paper Review)
|
Mar 02, 2022 |
Can Wikipedia Help Offline Reinforcement Learning? (Author Interview)
|
Mar 02, 2022 |
Can Wikipedia Help Offline Reinforcement Learning? (Paper Explained)
|
Mar 02, 2022 |
[ML Olds] Meta Research Supercluster | OpenAI GPT-Instruct | Google LaMDA | Drones fight Pigeons
|
Mar 02, 2022 |
[ML News] Uber: Deep Learning for ETA | MuZero Video Compression | Block-NeRF | EfficientNet-X
|
Feb 24, 2022 |
Listening to You! - Channel Update (Author Interviews)
|
Feb 22, 2022 |
All about AI Accelerators: GPU, TPU, Dataflow, Near-Memory, Optical, Neuromorphic & more (w/ Author)
|
Feb 21, 2022 |
CM3: A Causal Masked Multimodal Model of the Internet (Paper Explained w/ Author Interview)
|
Feb 21, 2022 |
AI against Censorship: Genetic Algorithms, The Geneva Project, ML in Security, and more!
|
Feb 17, 2022 |
HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning (w/ Author)
|
Feb 16, 2022 |
[ML News] DeepMind AlphaCode | OpenAI math prover | Meta battles harmful content with AI
|
Feb 16, 2022 |
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents (+Author)
|
Feb 16, 2022 |
OpenAI Embeddings (and Controversy?!)
|
Feb 16, 2022 |
Unsupervised Brain Models - How does Deep Learning inform Neuroscience? (w/ Patrick Mineault)
|
Feb 16, 2022 |
GPT-NeoX-20B - Open-Source huge language model by EleutherAI (Interview w/ co-founder Connor Leahy)
|
Feb 16, 2022 |
Predicting the rules behind - Deep Symbolic Regression for Recurrent Sequences (w/ author interview)
|
Feb 02, 2022 |
IT ARRIVED! YouTube sent me a package. (also: Limited Time Merch Deal)
|
Jan 28, 2022 |
[ML News] ConvNeXt: Convolutions return | China regulates algorithms | Saliency cropping examined
|
Jan 28, 2022 |
Dynamic Inference with Neural Interpreters (w/ author interview)
|
Jan 24, 2022 |
Noether Networks: Meta-Learning Useful Conserved Quantities (w/ the authors)
|
Jan 21, 2022 |
This Team won the Minecraft RL BASALT Challenge! (Paper Explanation & Interview with the authors)
|
Jan 20, 2022 |
This Team won the Minecraft RL BASALT Challenge! (Paper Explanation & Interview with the authors)
|
Jan 16, 2022 |
Full Self-Driving is HARD! Analyzing Elon Musk re: Tesla Autopilot on Lex Fridman's Podcast
|
Jan 07, 2022 |
Player of Games: All the games, one algorithm! (w/ author Martin Schmid)
|
Jan 05, 2022 |
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
|
Jan 05, 2022 |
[ML News] DeepMind builds Gopher | Google builds GLaM | Suicide capsule uses AI to check access
|
Jan 05, 2022 |
Resolution-robust Large Mask Inpainting with Fourier Convolutions (w/ Author Interview)
|
Jan 05, 2022 |
[ML News] DeepMind tackles Math | Microsoft does more with less | Timnit Gebru launches DAIR
|
Dec 14, 2021 |
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion (ML Research Paper Explained)
|
Dec 10, 2021 |
[ML News] OpenAI removes GPT-3 waitlist | GauGAN2 is amazing | NYC regulates AI hiring tools
|
Dec 03, 2021 |
Sparse is Enough in Scaling Transformers (aka Terraformer) | ML Research Paper Explained
|
Dec 02, 2021 |
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning (Paper Explained)
|
Dec 01, 2021 |
Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions (Paper Explained)
|
Dec 01, 2021 |
Peer Review is still BROKEN! The NeurIPS 2021 Review Experiment (results are in)
|
Nov 26, 2021 |
Parameter Prediction for Unseen Deep Architectures (w/ First Author Boris Knyazev)
|
Nov 25, 2021 |
Learning Rate Grafting: Transferability of Optimizer Tuning (Machine Learning Research Paper Reivew)
|
Nov 22, 2021 |
[ML News] Cedille French Language Model | YOU Search Engine | AI Finds Profitable MEME TOKENS
|
Nov 22, 2021 |
Gradients are Not All You Need (Machine Learning Research Paper Explained)
|
Nov 22, 2021 |
[ML News] Microsoft combines Images & Text | Meta makes artificial skin | Russians replicate DALL-E
|
Nov 22, 2021 |
Autoregressive Diffusion Models (Machine Learning Research Paper Explained)
|
Nov 11, 2021 |
[ML News] Google introduces Pathways | OpenAI solves Math Problems | Meta goes First Person
|
Nov 11, 2021 |
EfficientZero: Mastering Atari Games with Limited Data (Machine Learning Research Paper Explained)
|
Nov 05, 2021 |
[YTalks] Siraj Raval - Stories about YouTube, Plagiarism, and the Dangers of Fame (Interview)
|
Nov 01, 2021 |
[ML News GERMAN] NVIDIA GTC'21 | DeepMind kauft MuJoCo | Google Lernt Spreadsheet Formeln
|
Nov 01, 2021 |
[ML News] NVIDIA GTC'21 | DeepMind buys MuJoCo | Google predicts spreadsheet formulas
|
Nov 01, 2021 |
I went to an AI Art Festival in Geneva (AiiA Festival Trip Report)
|
Oct 29, 2021 |
Symbolic Knowledge Distillation: from General Language Models to Commonsense Models (Explained)
|
Oct 25, 2021 |
I took a Swiss train and it was awesome! Train Seat Review - SBB InterCity 1 - Geneva to St. Gallen
|
Oct 25, 2021 |
[ML News] Microsoft trains 530B model | ConvMixer model fits into single tweet | DeepMind profitable
|
Oct 21, 2021 |
[ML News] DeepMind does Nowcasting | The Guardian's shady reporting | AI finishes Beethoven's 10th
|
Oct 11, 2021 |
Grokking: Generalization beyond Overfitting on small algorithmic datasets (Paper Explained)
|
Oct 11, 2021 |
How far can we scale up? Deep Learning's Diminishing Returns (Article Review)
|
Oct 04, 2021 |
[ML News] Plagiarism Case w/ Plot Twist | CLIP for video surveillance | OpenAI summarizes books
|
Sep 30, 2021 |
Inconsistency in Conference Peer Review: Revisiting the 2014 NeurIPS Experiment (Paper Explained)
|
Sep 30, 2021 |
[ML News] New ImageNet SOTA | Uber's H3 hexagonal coordinate system | New text-image-pair dataset
|
Sep 28, 2021 |
Does GPT-3 lie? - Misinformation and fear-mongering around the TruthfulQA dataset
|
Sep 24, 2021 |
Topographic VAEs learn Equivariant Capsules (Machine Learning Research Paper Explained)
|
Sep 21, 2021 |
[ML News] Roomba Avoids Poop | Textless NLP | TikTok Algorithm Secrets | New Schmidhuber Blog
|
Sep 16, 2021 |
Celebrating 100k Subscribers! (w/ Channel Statistics)
|
Sep 16, 2021 |
[ML News] AI predicts race from X-Ray | Google kills HealthStreams | Boosting Search with MuZero
|
Sep 13, 2021 |
∞-former: Infinite Memory Transformer (aka Infty-Former / Infinity-Former, Research Paper Explained)
|
Sep 06, 2021 |
[ML News] Blind Chess AI Competition | Graph NNs for traffic | AI gift suggestions
|
Sep 05, 2021 |
ALiBi - Train Short, Test Long: Attention with linear biases enables input length extrapolation
|
Sep 05, 2021 |
[ML News] Stanford HAI coins Foundation Models & High-profile case of plagiarism uncovered
|
Aug 30, 2021 |
Fastformer: Additive Attention Can Be All You Need (Machine Learning Research Paper Explained)
|
Aug 27, 2021 |
PonderNet: Learning to Ponder (Machine Learning Research Paper Explained)
|
Aug 23, 2021 |
NeuralHash is BROKEN | How to evade Apple's detection and forge hash collisions (w/ Code)
|
Aug 19, 2021 |
[ML News] Nvidia renders CEO | Jurassic-1 larger than GPT-3 | Tortured Phrases reveal Plagiarism
|
Aug 19, 2021 |
How Apple scans your phone (and how to evade it) - NeuralHash CSAM Detection Algorithm Explained
|
Aug 16, 2021 |
[ML NEWS] Apple scans your phone | Master Faces beat face recognition | WALL-E is real
|
Aug 16, 2021 |
[ML News] AI-generated patent approved | Germany gets an analog to OpenAI | ML cheats video games
|
Aug 09, 2021 |
[ML News] MMO Game destroys GPUs | OpenAI quits Robotics | Today w/ guest host Sanyam Bhutani
|
Aug 09, 2021 |
[ML News] Facebook AI adapting robots | Baidu autonomous excavators | Happy Birthday EleutherAI
|
Jul 18, 2021 |
[ML News] GitHub Copilot - Copyright, GPL, Patents & more | Brickit LEGO app | Distill goes on break
|
Jul 13, 2021 |
Self-driving from VISION ONLY - Tesla's self-driving progress by Andrej Karpathy (Talk Analysis)
|
Jul 05, 2021 |
[ML News] CVPR bans social media paper promotion | AI restores Rembrandt | GPU prices down
|
Jul 05, 2021 |
The Dimpled Manifold Model of Adversarial Examples in Machine Learning (Research Paper Explained)
|
Jun 28, 2021 |
[ML News] Hugging Face course | GAN Theft Auto | AI Programming Puzzles | PyTorch 1.9 Released
|
Jun 25, 2021 |
XCiT: Cross-Covariance Image Transformers (Facebook AI Machine Learning Research Paper Explained)
|
Jun 25, 2021 |
AMP: Adversarial Motion Priors for Stylized Physics-Based Character Control (Paper Explained)
|
Jun 22, 2021 |
[ML News] De-Biasing GPT-3 | RL cracks chip design | NetHack challenge | Open-Source GPT-J
|
Jun 22, 2021 |
Efficient and Modular Implicit Differentiation (Machine Learning Research Paper Explained)
|
Jun 15, 2021 |
[ML News] EU regulates AI, China trains 1.75T model, Google's oopsie, Everybody cheers for fraud.
|
Jun 10, 2021 |
Decision Transformer: Reinforcement Learning via Sequence Modeling (Research Paper Explained)
|
Jun 07, 2021 |
[ML News] Anthropic raises $124M, ML execs clueless, collusion rings, ELIZA source discovered & more
|
Jun 07, 2021 |
Reward Is Enough (Machine Learning Research Paper Explained)
|
Jun 02, 2021 |
Expire-Span: Not All Memories are Created Equal: Learning to Forget by Expiring (Paper Explained)
|
May 26, 2021 |
FNet: Mixing Tokens with Fourier Transforms (Machine Learning Research Paper Explained)
|
May 24, 2021 |
AI made this music video | What happens when OpenAI's CLIP meets BigGAN?
|
May 21, 2021 |
DDPM - Diffusion Models Beat GANs on Image Synthesis (Machine Learning Research Paper Explained)
|
May 15, 2021 |
Involution: Inverting the Inherence of Convolution for Visual Recognition (Research Paper Explained)
|
May 10, 2021 |
MLP-Mixer: An all-MLP Architecture for Vision (Machine Learning Research Paper Explained)
|
May 10, 2021 |
Is Google Translate Sexist? Gender Stereotypes in Statistical Machine Translation
|
May 03, 2021 |
Perceiver: General Perception with Iterative Attention (Google DeepMind Research Paper Explained)
|
May 03, 2021 |
Pretrained Transformers as Universal Computation Engines (Machine Learning Research Paper Explained)
|
May 03, 2021 |
Yann LeCun - Self-Supervised Learning: The Dark Matter of Intelligence (FAIR Blog Post Explained)
|
May 03, 2021 |
Multimodal Neurons in Artificial Neural Networks (w/ OpenAI Microscope, Research Paper Explained)
|
May 03, 2021 |
Machine Learning PhD Survival Guide 2021 | Advice on Topic Selection, Papers, Conferences & more!
|
May 03, 2021 |
PAIR AI Explorables | Is the problem in the data? Examples on Fairness, Diversity, and Bias.
|
May 03, 2021 |
DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning
|
May 03, 2021 |
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis (ML Research Paper Explained)
|
May 02, 2021 |
DINO: Emerging Properties in Self-Supervised Vision Transformers (Facebook AI Research Explained)
|
May 02, 2021 |
Why AI is Harder Than We Think (Machine Learning Research Paper Explained)
|
May 02, 2021 |