Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.
AI-powered deep analysis of AI developments. We generated and curated AI Audio Overviews of all the essential AI papers (so you don't have to!)
| Episode | Date |
|---|---|
|
The Scaling Hypothesis - Gwern
|
Nov 17, 2024 |
|
The Bitter Lesson - Rich Sutton
|
Nov 17, 2024 |
|
Larger and more instructable language models become less reliable
|
Nov 17, 2024 |
|
AlphaChip + A PRELIMINARY EVALUATION OF OPENAI’S O1 ON PLANBENCH
|
Nov 17, 2024 |
|
Llama 3.2 + Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
|
Nov 17, 2024 |
|
Sparse Attention with Linear Units - Rectified Linear Attention (ReLA)
|
Nov 16, 2024 |
|
Sparse and Continuous Attention Mechanisms
|
Nov 16, 2024 |
|
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
|
Nov 16, 2024 |
|
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
|
Nov 16, 2024 |
|
The Intelligence Age - Sam Altman
|
Nov 11, 2024 |
|
A Path Towards Autonomous Machine Intelligence - Yann LeCun
|
Nov 10, 2024 |
|
Machines Of Loving Grace - Dario Amodei
|
Nov 10, 2024 |
|
Situational Awareness, The Decade Ahead - Leopold Aschenbrenner
|
Nov 10, 2024 |
|
Round Up : Top 30 Essential AI Papers
|
Nov 04, 2024 |
|
Lost in the Middle: How Language Models Use Long Contexts
|
Nov 04, 2024 |
|
Zephyr: Direct Distillation of LM Alignment
|
Nov 04, 2024 |
|
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
|
Nov 04, 2024 |
|
Dense Passage Retrieval for Open-Domain Question Answering
|
Nov 04, 2024 |
|
Better & Faster Large Language Models via Multi-token Prediction
|
Nov 04, 2024 |
|
Kolmogorov Complexity and Algorithmic Randomness
|
Nov 04, 2024 |
|
Machine Super Intelligence
|
Nov 04, 2024 |
|
A Tutorial Introduction to the Minimum Description Length Principle
|
Nov 04, 2024 |
|
Scaling Laws for Neural Language Models
|
Nov 04, 2024 |
|
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
|
Nov 04, 2024 |
|
Neural Turing Machines
|
Nov 03, 2024 |
|
Quantifying the Rise and Fall of Complexity in Closed Systems: the Coffee Automaton
|
Nov 03, 2024 |
|
Relational Recurrent Neural Networks
|
Nov 03, 2024 |
|
Variational Lossy Autoencoder
|
Nov 03, 2024 |
|
A Simple Neural Network Module for Relational Reasoning
|
Nov 03, 2024 |
|
Identity Mappings in Deep Residual Networks
|
Nov 03, 2024 |
|
Neural Machine Translation
|
Nov 03, 2024 |
|
Attention Is all You Need
|
Nov 03, 2024 |
|
Neural Message Passing for Quantum Chemistry
|
Nov 03, 2024 |
|
Multi-Scale Context Aggregation by Dilated Convolutions
|
Nov 03, 2024 |
|
Deep Residual Learning for Image Recognition
|
Nov 02, 2024 |
|
GPipe: Easy Scaling with Micro-Batch Pipeline Parallelism
|
Nov 02, 2024 |
|
Order Matters : Sequence to Sequence for Sets
|
Nov 02, 2024 |
|
ImageNet Classification with Deep Convolutional Neural Networks
|
Nov 02, 2024 |
|
Pointer Networks
|
Nov 02, 2024 |
|
Keeping Neural Networks Simple
|
Nov 02, 2024 |
|
RECURRENT NEURAL NETWORK REGULARIZATION
|
Nov 02, 2024 |
|
Understanding LSTM Networks
|
Nov 02, 2024 |
|
The Unreasonable Effectiveness of Recurrent Neural Networks
|
Nov 02, 2024 |
|
The First Law of Complexodynamics
|
Nov 02, 2024 |