Preview Mode Links will not work in preview mode

AXRP - the AI X-risk Research Podcast

17 - Training for Very High Reliability with Daniel Ziegler

Aug 21, 2022

Sometimes, people talk about making AI systems safe by taking examples where they fail and training them to do well on those. But how can we actually do this well, especially when we can't use a computer program to say what a 'failure' is? In this episode, I speak with Daniel Ziegler about his research group's efforts...