Marvin's Memos

By Marvin The Paranoid Android

Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.


Category: Courses

Open in Apple Podcasts


Open RSS feed


Open Website


Rate for this podcast

Subscribers: 0
Reviews: 0
Episodes: 44

Description

AI-powered deep analysis of AI developments. We generated and curated AI Audio Overviews of all the essential AI papers (so you don't have to!)


Episode Date
The Scaling Hypothesis - Gwern
Nov 17, 2024
The Bitter Lesson - Rich Sutton
Nov 17, 2024
Larger and more instructable language models become less reliable
Nov 17, 2024
AlphaChip + A PRELIMINARY EVALUATION OF OPENAI’S O1 ON PLANBENCH
Nov 17, 2024
Llama 3.2 + Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Nov 17, 2024
Sparse Attention with Linear Units - Rectified Linear Attention (ReLA)
Nov 16, 2024
Sparse and Continuous Attention Mechanisms
Nov 16, 2024
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Nov 16, 2024
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Nov 16, 2024
The Intelligence Age - Sam Altman
Nov 11, 2024
A Path Towards Autonomous Machine Intelligence - Yann LeCun
Nov 10, 2024
Machines Of Loving Grace - Dario Amodei
Nov 10, 2024
Situational Awareness, The Decade Ahead - Leopold Aschenbrenner
Nov 10, 2024
Round Up : Top 30 Essential AI Papers
Nov 04, 2024
Lost in the Middle: How Language Models Use Long Contexts
Nov 04, 2024
Zephyr: Direct Distillation of LM Alignment
Nov 04, 2024
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
Nov 04, 2024
Dense Passage Retrieval for Open-Domain Question Answering
Nov 04, 2024
Better & Faster Large Language Models via Multi-token Prediction
Nov 04, 2024
Kolmogorov Complexity and Algorithmic Randomness
Nov 04, 2024
Machine Super Intelligence
Nov 04, 2024
A Tutorial Introduction to the Minimum Description Length Principle
Nov 04, 2024
Scaling Laws for Neural Language Models
Nov 04, 2024
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Nov 04, 2024
Neural Turing Machines
Nov 03, 2024
Quantifying the Rise and Fall of Complexity in Closed Systems: the Coffee Automaton
Nov 03, 2024
Relational Recurrent Neural Networks
Nov 03, 2024
Variational Lossy Autoencoder
Nov 03, 2024
A Simple Neural Network Module for Relational Reasoning
Nov 03, 2024
Identity Mappings in Deep Residual Networks
Nov 03, 2024
Neural Machine Translation
Nov 03, 2024
Attention Is all You Need
Nov 03, 2024
Neural Message Passing for Quantum Chemistry
Nov 03, 2024
Multi-Scale Context Aggregation by Dilated Convolutions
Nov 03, 2024
Deep Residual Learning for Image Recognition
Nov 02, 2024
GPipe: Easy Scaling with Micro-Batch Pipeline Parallelism
Nov 02, 2024
Order Matters : Sequence to Sequence for Sets
Nov 02, 2024
ImageNet Classification with Deep Convolutional Neural Networks
Nov 02, 2024
Pointer Networks
Nov 02, 2024
Keeping Neural Networks Simple
Nov 02, 2024
RECURRENT NEURAL NETWORK REGULARIZATION
Nov 02, 2024
Understanding LSTM Networks
Nov 02, 2024
The Unreasonable Effectiveness of Recurrent Neural Networks
Nov 02, 2024
The First Law of Complexodynamics
Nov 02, 2024