Preview Mode Links will not work in preview mode

AXRP - the AI X-risk Research Podcast

Nov 26, 2023

The events of this year have highlighted important questions about the governance of artificial intelligence. For instance, what does it mean to democratize AI? And how should we balance benefits and dangers of open-sourcing powerful AI systems such as large language models? In this episode, I speak with Elizabeth...


Oct 3, 2023

Imagine a world where there are many powerful AI systems, working at cross purposes. You could suppose that different governments use AIs to manage their militaries, or simply that many powerful AIs have their own wills. At any rate, it seems valuable for them to be able to cooperatively work together and...


Jul 27, 2023

Recently, OpenAI made a splash by announcing a new "Superalignment" team. Lead by Jan Leike and Ilya Sutskever, the team would consist of top researchers, attempting to solve alignment for superintelligent AIs in four years by figuring out how to build a trustworthy human-level AI alignment researcher, and then using it...


Jul 27, 2023

Is there some way we can detect bad behaviour in our AI system without having to know exactly what it looks like? In this episode, I speak with Mark Xu about mechanistic anomaly detection: a research direction based on the idea of detecting strange things happening in neural networks, in the hope that that will alert...


Jun 28, 2023

Very brief survey: bit.ly/axrpsurvey2023

Store is closing in a week! Link: store.axrp.net/

Patreon: patreon.com/axrpodcast

Ko-fi: ko-fi.com/axrpodcast