AXRP - the AI X-risk Research Podcast

By Daniel Filan

Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.

Image by Daniel Filan

Category: Technology

Open in Apple Podcasts


Open RSS feed


Open Website


Rate for this podcast

Subscribers: 18
Reviews: 0
Episodes: 35

Description

AXRP (pronounced axe-urp) is the AI X-risk Research Podcast where I, Daniel Filan, have conversations with researchers about their papers. We discuss the paper, and hopefully get a sense of why it's been written and how it might reduce the risk of AI causing an existential catastrophe: that is, permanently and drastically curtailing humanity's future potential. You can visit the website and read transcripts at axrp.net.

Episode Date
31 - Singular Learning Theory with Daniel Murfet
May 07, 2024
30 - AI Security with Jeffrey Ladish
Apr 30, 2024
29 - Science of Deep Learning with Vikrant Varma
Apr 25, 2024
28 - Suing Labs for AI Risk with Gabriel Weil
Apr 17, 2024
27 - AI Control with Buck Shlegeris and Ryan Greenblatt
Apr 11, 2024
26 - AI Governance with Elizabeth Seger
Nov 26, 2023
25 - Cooperative AI with Caspar Oesterheld
Oct 03, 2023
24 - Superalignment with Jan Leike
Jul 27, 2023
23 - Mechanistic Anomaly Detection with Mark Xu
Jul 27, 2023
Survey, store closing, Patreon
Jun 28, 2023
22 - Shard Theory with Quintin Pope
Jun 15, 2023
21 - Interpretability for Engineers with Stephen Casper
May 02, 2023
20 - 'Reform' AI Alignment with Scott Aaronson
Apr 12, 2023
Store, Patreon, Video
Feb 07, 2023
19 - Mechanistic Interpretability with Neel Nanda
Feb 04, 2023
New podcast - The Filan Cabinet
Oct 13, 2022
18 - Concept Extrapolation with Stuart Armstrong
Sep 03, 2022
17 - Training for Very High Reliability with Daniel Ziegler
Aug 21, 2022
16 - Preparing for Debate AI with Geoffrey Irving
Jul 01, 2022
15 - Natural Abstractions with John Wentworth
May 23, 2022
14 - Infra-Bayesian Physicalism with Vanessa Kosoy
Apr 05, 2022
13 - First Principles of AGI Safety with Richard Ngo
Mar 31, 2022
12 - AI Existential Risk with Paul Christiano
Dec 02, 2021
11 - Attainable Utility and Power with Alex Turner
Sep 25, 2021
10 - AI's Future and Impacts with Katja Grace
Jul 23, 2021
9 - Finite Factored Sets with Scott Garrabrant
Jun 24, 2021
8 - Assistance Games with Dylan Hadfield-Menell
Jun 08, 2021
7.5 - Forecasting Transformative AI from Biological Anchors with Ajeya Cotra
May 28, 2021
7 - Side Effects with Victoria Krakovna
May 14, 2021
6 - Debate and Imitative Generalization with Beth Barnes
Apr 08, 2021
5 - Infra-Bayesianism with Vanessa Kosoy
Mar 10, 2021
4 - Risks from Learned Optimization with Evan Hubinger
Feb 17, 2021
3 - Negotiable Reinforcement Learning with Andrew Critch
Dec 11, 2020
2 - Learning Human Biases with Rohin Shah
Dec 11, 2020
1 - Adversarial Policies with Adam Gleave
Dec 11, 2020