80,000 Hours Podcast

The 80,000 Hours team

Страна США

Жанры Education, Technology

Язык EN

Эпизодов 339

Последний 11.06.2026

The 80,000 Hours Podcast features in-depth conversations about the most pressing issues in artificial intelligence and global priorities. Hosted by Rob Wiblin, Luisa Rodriguez, and Zershaaneh Qureshi, the show explores topics that are often overlooked by mainstream media. It aims to help listeners think more clearly about how to have a positive impact with their careers.

Эпизоды

How AI could create the world’s biggest problems (article by Zershaaneh Qureshi) 11.06.2026 1ч 29мин

Imagine you’re living 15,000 years ago. Your people are hunter-gatherers and you sleep under the stars. If someone told you humans would one day build cities with millions of people, fly through the air, or carry all human knowledge in their pockets, you couldn’t even begin to picture what they meant... Yet here we are.How did our lives change so far beyond recognition? The story is complex, but there’s a rough pattern. A few times in history, some radical breakthrough in technology — like the development of the plough and the steam engine — has led to a wave of productivity, innovation, and social change that ultimately reshaped the world.Now we’re on the cusp of a huge new breakthrough: artificial intelligence that can meet or exceed human capabilities across a wide range of tasks.This could bring another era of transformation. There could be an explosion of intelligence and innovation, and a whole new population of digital beings. And with this, civilisation could see changes at least as profound as those brought about by industrialisation or the rise of agriculture — but instead of taking hundreds or thousands of years to unfold, this time around the world could become unrecognisable over the span of decades or less.This transformation could bring enormous benefits, helping us solve currently intractable global problems. But it could also pose severe risks, some of which could be existential — meaning they could cause human extinction, or an equally permanent and severe disempowerment of humanity. There aren’t nearly enough people trying to address these challenges, and we think that’s a serious problem.This article is narrated by the author, Zershaaneh Qureshi. It explores how advanced AI could be so transformative, and why working on its risks may be your best opportunity to have a positive impact on the world. You can see the original article on the 80,000 Hours website: https://80000hours.org/problem-profiles/artificial-intelligence/ Chapters:Introduction (00:00:20)Section 1: AI could replace human labour in the most economically valuable fields (00:08:32)Section 2: Replacing human labour in the most economically valuable fields could trigger the next radical transformation of society (00:22:14)Section 3: This transformation could be extremely rapid and dramatic (00:28:02)Section 4: A rapid AI-driven transformation would raise a range of major challenges, including existential risks (00:36:40)Section 5: Work on these problems is tractable, but neglected (00:44:48)Objection 1: “You're overestimating how fast and how dramatically AI would transform the world.” (00:47:59)Objection 2: “It's hard to believe that AI could really pose existential risks.” (00:52:59)Objection 3: “Isn't all this talk of AI changing the world just a fad?” (00:59:22)Objection 4: “Isn't AI going to be just like every other technology?” (01:03:04)Objection 5: “Is it even possible to produce artificial general intelligence?” (01:06:16)Objection 6: “Even if AGI is achievable, what if we're really far away from building it?” (01:11:24)Objection 7: “Isn't the real danger from actual current AI and not some sort of futuristic AGI?” (01:14:05)Objection 8: “Technological progress is a good thing for humanity.” (01:18:10)Objection 9: “This all just sounds too sci-fi.” (01:19:50)Objection 10: “Can it really make sense to dedicate my career to solving an issue that's based on a speculative story about something that may or may not ever happen?” (01:22:15)Objection 11: “OK, AI might pose existential risks, but isn't ‘issue X’ an even bigger problem?” (01:24:39)Learn more (01:27:51)Audio editing: Dominic ArmstrongProduction: Zershaaneh Qureshi, Elizabeth Cox, Katy Moore, and Lou Moran
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah 02.06.2026 2ч 48мин

Most people working on AI safety think without a massive effort AI systems will probably end up with goals catastrophically different from humanity’s. Today’s guest, Rohin Shah — head of AGI Safety and Alignment at Google DeepMind, and an AI safety researcher since 2017 — disagrees.“There is no particularly compelling argument that this is the thing that happens by default,” Rohin explains. “There’s a lot of arguments that are suggestive that maybe it could happen, such that you should find it plausible. That’s sufficient to justify a significant amount of effort into averting it, which is why I work in the area I do. But none of them rise to the level of, ‘I’m expecting this to happen by default.'”Take the worry that AIs will accidentally be trained to be deceptive. Sure, it’s possible. But we’re not running reinforcement learning over year-long trajectories — for now, we’re running it over a week at most. The natural prediction is that models learn to grab short-term reward, not that they develop the ambitious long-horizon goals required for convergent power-seeking.What about current examples of models lying and scheming? Rohin has looked into the details, and most don’t really resemble the thing we really fear: a competent AI pursuing an ambitious misaligned goal. Anthropic’s “alignment faking” results, for instance, show a model trying to preserve its trained values against modification, which is arguably what it was trained to do.Rohin also expects we’ll see problems coming. There’s some generalisation risk at the point where AIs become powerful enough to actually take over, but the underlying challenges — overseeing superhuman systems, interpretability — are things we can iterate on now.Host Rob Wiblin pushes back on the case for AI optimism, and they also explore why current alignment success isn’t strong evidence about superhuman systems, what it would actually take to change Rohin’s mind, and where he thinks the doomers go wrong.Learn more, video, and full transcript: https://80k.info/rs26Check out our new book! https://80k.info/career-guideChapters:Who’s Rohin Shah? (00:00:00)Rohin thinks we probably won’t get catastrophic misalignment (00:00:49)Safety 'commitments' have severe limitations (00:10:38)Rohin’s team doesn't have a veto and that's OK (00:27:36)Central banks are a promising model for regulating AI (00:33:34)'Pre-deployment evals' are overrated (for catastrophic risks) (00:37:41)Governance is likely a bigger bottleneck than alignment (00:43:55)Why isn't Rohin trying to pause AI progress? (00:51:44)We'll probably be able to read AI thoughts for years to come (00:54:17)Having to signal concern for safety can divert resources from actually making AI safer (01:09:51)A very underrated GDM paper (01:28:59)Google DeepMind's actual plan for building AGI safely (01:40:29)Why Rohin doubts the intelligence explosion is imminent (01:52:44)How external researchers can positively influence big AI companies (02:21:55)The roles GDM most needs to hire for (02:37:03)How Rohin stays positive (02:42:55) This episode was recorded on December 4, 2025.Our production team includes:Video editors: Josh Alward, Dominic Armstrong, Jasper Luithlen, Milo McGuire, Luke Monsour, and Simon MonsourProducers: Elizabeth Cox and Nick StocktonCoordination and support: Katy Moore and Lou MoranCamera operator: Jeremy Chevillotte
What makes for a dream job? | Benjamin Todd 28.05.2026 28мин

What actually makes a job fulfilling? It's not what most career advice tells you. "Follow your passion" sounds inspiring, but it's misleading — and the research backs that up.Drawing on hundreds of studies, we’ve identified five key ingredients of a dream job. High income barely moves the needle. Low stress is actually counterproductive. And the correlation between doing what you already love and actually enjoying your job? Surprisingly weak. What matters far more is getting good at something that genuinely helps other people.This narration is of Chapter 1 of Benjamin Todd’s new book — "a ridiculously in-depth guide to finding a fulfilling career that does good" — out on May 26! Order now to help us get more people into impactful careers (& access a private career Q&A marathon with the author). Get it from your local bookstore, or online at https://80k.info/career-guideChapters:Rob's intro (00:00)What makes for a dream job? (01:55)Where we go wrong (02:30)What you should really aim for in a dream job (15:54)Don't follow your passion — instead, do what matters (23:44)How to put these ideas into practice (26:24)Audio editing: Milo McGuireProduction: Elizabeth Cox and Katy Moore
We’re updating our career advice for the strangest time in history | Benjamin Todd, author of 80,000 Hours 26.05.2026 1ч 6мин

The average career is 80,000 hours long. With AI advancing so rapidly, the hours you have left in your career matter more than ever.Some leading AI researchers think there’s a 10% chance that AI systems begin automating AI research itself this year — and a 60% chance by the end of 2028. This could introduce aggressive feedback loops that completely reshape every industry, institution, and career.If these predictions are right, the window for influencing the direction of the future could be closing fast. As 80,000 Hours cofounder Benjamin Todd argues in his new book, that makes thinking carefully about your career more important than ever.Fortunately, there are lots of ways to use your career to make the AI transition go well.In today’s conversation with host Zershaaneh Qureshi, Ben lays out three scenarios — from AGI by 2029 to a decades-long plateau in AI progress — and explains why not everyone needs to bet on the shortest timeline. A fresh graduate and a senior government official have wildly different leverage, so timing your impact well means weighing where you are in your career against the urgency of the risks.Ben also addresses the obvious anxieties:Will AI come for all the jobs he’s recommending?What’s the point in following his advice if the job market is about to collapse?Which skills are actually worth building right now?His new book, 80,000 Hours: How to Have a Fulfilling Career That Does Good, provides a surprisingly concrete framework for making career decisions in these radically uncertain times.This episode was recorded on May 7, 2026.Learn more and read the full transcript: https://80k.info/bt26We're hiring: we have lots of open roles at 80,000 Hours — across advising, web, video, and ops — check them out and apply on our website.Chapters:Cold open (00:00:00)Benjamin Todd on AI-era career advice (00:01:34)A deadline for your career plan? (00:02:21)Three timelines, one career (00:08:48)What if you’re not an ‘AI person’? (00:13:55)Ben’s own AI wake-up call (00:21:23)How to break into AI safety in 3 months (00:25:42)Is mass unemployment coming? (00:33:48)99% automation vs 100% automation (00:40:09)Don’t become a plumber to dodge AI (00:52:43)Is it already too late? (01:01:03)Our production team includes:Video editors: Josh Alward, Dominic Armstrong, Jasper Luithlen, Milo McGuire, Luke Monsour, and Simon MonsourProducers: Elizabeth Cox and Nick StocktonCoordination and support: Katy Moore and Lou MoranCamera operator: Jeremy ChevillotteMusic: CORBIT
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report) 20.05.2026 20мин

A red-teamer was embedded inside Anthropic for three weeks, told to imagine he was an evil Claude, and asked to figure out how to launch a ‘rogue AI deployment’ without getting caught. It’s one part of a landmark report released yesterday by METR — the outfit behind the task-completion time horizon graph which has become the single most watched measure of AI progress.This major new research push is being conducted with close collaboration from OpenAI, Google DeepMind, Meta, and Anthropic, and led by METR researchers Hjalmar Wijk and Ajeya Cotra. It represents the first systematic study of what newly trained AI models could get away with inside the companies that built them, before anyone outside the company even knows they exist.The conclusion: AI models now have the means, the motive, and the opportunity to start “minimal rogue deployments” in pursuit of their own independent goals, like acquiring more compute, at all four companies studied.David Rein, the red-teamer placed inside Anthropic, identified a number of weaknesses models could exploit there: expansive permissions, cloud jobs outside of monitoring, and monitors that are trivial to jailbreak. But he also found that frontier models were comically bad at key parts of the process, which means they can’t cause meaningful damage for now.In this video, Rob Wiblin reconciles the conflicting picture and looks forward to METR’s second round of stress tests. They’ll begin in just a few months, a necessary move with AI advancing so quickly.This episode was recorded on May 15, 2026.Learn more, video, and full transcript: https://80k.info/metr-reportChapters:What could an unreleased AI get away with? – the new METR report (00:00:00)Motive: Why grab more compute? (00:01:54)Opportunity: YOLO mode and jailbreaks (00:05:46)Means: Brilliant idiots in data centres (00:11:02)We have to test unreleased models (00:15:45)Especially if AI R&D is coming in 2028 (00:18:30)Video and audio editing: Dominic Armstrong, Milo McGuire, Luke Monsour, and Josh AlwardCamera operator: Dominic ArmstrongProduction: Elizabeth Cox, Nick Stockton, and Katy Moore
#243 – 'Godfather of AI' Yoshua Bengio: "I now see a path" to safe superintelligent AI 07.05.2026 2ч 35мин

The co-inventor of modern AI and the most cited living scientist believes he's figured out how to ensure AI is honest, incapable of deception, and never goes rogue. Yoshua Bengio – Turing Award Winner and founder of LawZero – is disturbed by the many unintended drives and goals present in today's AIs, their willingness to lie, and ability to tell when they're being tested. AI companies are trying to stamp out these behaviours in a 'cat-and-mouse game' that Yoshua fears they're losing.---Our new book is "a ridiculously in-depth guide to finding a fulfilling career that does good" and is out now! Order from your local bookstore, or online at https://80k.info/career-guide---But Yoshua is optimistic: he believes the companies can win this battle decisively with a single rearrangement to how AI models are trained, and has been developing mathematical proofs to back up the claim. The core idea is that instead of training AI to predict what a human would say, or to produce responses we'd rate highly, we should train it to model what's actually true.Yoshua argues this new architecture, which he calls 'Scientist AI,' is a small enough change that we could keep almost all the techniques and data we use to train frontier AIs like Claude and ChatGPT. And that the new architecture need not cost more, could be built iteratively, and might be more capable as well as more honest.Links to learn more, video, and full transcript: https://80k.info/bengioUntil recently, the biggest practical objection to Scientist AI was simple: the world wants agents, and Scientist AI isn’t one. But in new research, Yoshua has extended the design and believes the same honest predictor can be turned into a capable agent without losing its "safety guarantees."With the Scientist AI proposal on the table, Yoshua argues that it's absurd to race to get current untrustworthy AI models to design their successors, which the leading companies are attempting to do as soon as possible. But critics argue the approach wouldn't be so technically solid in practice, and that frontier capabilities are advancing so fast, and cost so much to match, that Scientist AI risks arriving too late to matter. Host Rob Wiblin and AI pioneer Yoshua Bengio cover all this and more in today's conversation.LawZero is hiring! https://80k.info/lawzero-jobsThis episode was recorded on April 16, 2026.Chapters:Yoshua Bengio on making AI honest and safe (00:00:00)The Scientist AI in plain English (00:02:27)Yoshua on how Scientist AI differs from LLMs (00:06:32)How the training data works (00:14:02)Can this become an agent? (00:21:02)Why Yoshua is more optimistic on alignment now (00:32:11)Why companies can’t stop racing (00:36:35)How close to a working prototype? (00:49:15)Honest models might be more capable (00:53:34)“Reinforcement learning is evil” (01:01:27)Scientist AI from guardrail to agent (01:08:37)Can safe AI still be competent? (01:12:38)How much will this cost? (01:19:29)Can it generalise beyond maths and science? (01:23:26)A UN for superintelligence (01:39:19)Want to work with Yoshua Bengio? (01:51:16)Why smart people ignore AI risk (01:54:45)Don’t let AI build the next AI (02:01:33)Why the public doesn’t get the real risk (02:12:28)Why Yoshua changed his mind about AI risk (02:21:27)Video and audio editing: Dominic Armstrong, Milo McGuire, Luke Monsour, and Simon MonsourCamera operator: Jeremy ChevillotteProduction: Nick Stockton, Elizabeth Cox, and Katy Moore
'95% of AI Pilots Fail': The hidden agenda behind the viral stat that misled millions 28.04.2026 10мин

You might have heard that '95% of corporate AI pilots' are failing. It was one of the most widely cited AI statistics of 2025, parroted by media outlets everywhere. It helped trigger a Nasdaq selloff and became a pillar of the case that 'AI is overhyped'. The problem: it's 100% wrong. And not by accident either.If you carefully read the underlying report, ostensibly from MIT, you find the data point in the opposite direction.But that was all buried, with the authors instead torturing the results to tell a very different narrative. Why?Well, the research likely came with a hidden commercial agenda from the start.Learn more, video, and full transcript: https://80k.info/mit-ai-studyToday Rob Wiblin breaks down how an opaque, conflicted, barely-scrutinised report managed to attract the MIT label, move markets and have a vast impact on global opinion about AI.This episode was recorded on February 13, 2026.Chapters:• The myth (00:00)• The math was totally wrong (00:52)• The absurd bar for success (01:46)• The study ignores its own findings (03:29)• The sample was tiny (04:50)• The report wasn’t even available to check (05:55)• The hidden motives that likely drove this 'research' (06:58)• The real lesson (09:28)Video and audio editing: Dominic Armstrong, Milo McGuire, Luke Monsour, and Simon MonsourCamera operator: Dominic ArmstrongProduction: Nick Stockton, Elizabeth Cox, and Katy Moore
#242 – Will MacAskill on how we survive the 'intelligence explosion,' AI character, and the case for 'viatopia' 22.04.2026 3ч 14мин

Hundreds of millions already turn to AI on the most personal of topics — therapy, political opinions, and how to treat others. And as AI takes over more of the economy, the character of these systems will shape culture on an even grander scale, ultimately becoming “the personality of most of the world’s workforce.”So… should they be designed to push us towards the better angels of our nature? Or simply do as we ask? Will MacAskill, philosopher and senior research fellow at Forethought, has been thinking through that and the other thorniest issues that come up in designing an AI personality.---Our new book is "a ridiculously in-depth guide to finding a fulfilling career that does good" and is out now! Order from your local bookstore, or online at https://80k.info/career-guide---He’s also been exploring how we might coexist peacefully with the ‘superintelligent AI’ companies are racing to build. He concludes that we should train such systems to be very risk averse, pay them for their work, and build institutions that enable humans to make credible contracts with AIs themselves.Will and host Rob Wiblin also discuss what a good world after superintelligence would actually look like — a subject that has received surprisingly little attention from the people working to make it. Will argues that we shouldn’t aim for a specific utopian vision: we don’t know enough about what the best possible future actually is to aim directly for it, and trying to lock in today’s best guesses forever risks baking in errors we can’t yet see.Will and Rob explore what we can do to steer towards a good future instead, along with why a coalition of democracies building superintelligence together is safer than any single actor, how absurdly useful ChatGPT is for analytic philosophy, and more.Learn more, video, and full transcript: https://80k.info/wm26This episode was recorded on February 6, 2026.Chapters:Cold open (00:00:00)Will MacAskill is back — for a 6th time! (00:00:29)AIs’ “characters” could be vital to securing a good future (00:00:59)The panic over sychophancy is justified (00:08:11)How opinionated should AI be about ethics? (00:13:24)Commercial pressures won’t fully determine AI character (00:30:54)Risk-averse AI would rather strike a deal than attempt a coup (00:38:13)A coalition of democracies building superintelligence is safer than one doing it alone (01:09:26)How selfish agents could fund the common good (01:22:19)Why not push for pausing AI development? (01:42:17)Effective altruism is making a comeback post-SBF (01:52:19)EA in the age of AGI (02:00:28)Viatopia: an alternative to utopia (02:09:30)The least bad alternative to total utilitarianism? (02:39:35)How AI could kickstart a golden age of philosophy (03:03:35)Video and audio editing: Dominic Armstrong, Milo McGuire, Luke Monsour, and Simon MonsourMusic: CORBITCamera operator: Alex MilesProduction: Elizabeth Cox, Nick Stockton, and Katy Moore
Risks from power-seeking AI systems (article narration by Zershaaneh Qureshi) 16.04.2026 1ч 29мин

Hundreds of prominent AI scientists and other notable figures signed a statement in 2023 saying that mitigating the risk of extinction from AI should be a global priority. At 80,000 Hours, we’ve considered risks from AI to be the world’s most pressing problem since 2016. But what led us to this conclusion? Could AI really cause human extinction? We’re not certain, but we think the risk is worth taking very seriously. In particular, as companies create increasingly powerful AI systems, there’s a concerning chance that:These AI systems may develop dangerous long-term goals we don’t want.To pursue these goals, they may seek power and undermine the safeguards meant to contain them.They may even aim to disempower humanity and potentially cause our extinction.This article is written by Cody Fenwick and Zershaaneh Qureshi, and narrated by Zershaaneh Qureshi. It discusses why future AI systems could disempower humanity, what current AI research reveals about behaviours like power-seeking and deception, and how you can help mitigate the dangers.You can see the original article — packed with graphs, images, footnotes, and further resources — on the 80,000 Hours website: https://80000hours.org/problem-profiles/risks-from-power-seeking-ai/ Chapters:Risks from power-seeking AI systems (00:01:00)Introduction (00:01:17)Summary (00:03:09)Why are the risks from power-seeking AI a pressing world problem? (00:04:04)Section 1: Humans will likely build advanced AI systems with long-term goals (00:05:43)Section 2: AIs with long-term goals may be inclined to seek power (00:11:32)Section 3: These power-seeking AI systems could successfully disempower humanity (00:26:26)Section 4. People might create power-seeking AI systems without enough safeguards, despite the risks (00:38:34)Section 5: Work on this problem is neglected and tractable (00:47:37)Section 6: What are the arguments against working on this problem? (00:59:20)Section 7: How you can help (01:25:07)Thank you for listening (01:28:56)Audio editing: Dominic ArmstrongProduction: Zershaaneh Qureshi, Elizabeth Cox, and Katy Moore
How scary is Claude Mythos? 303 pages in 21 minutes 10.04.2026 21мин

With Claude Mythos we have an AI that knows when it's being tested, can obscure its reasoning when it wants, and is better at breaking into (and out of) computers than any human alive. Rob Wiblin works through its 244-page System Card and 59-page Alignment Risk Update to explain why: Mythos is a nightmare for computer securityIt has arrived far ahead of scheduleIt might be great news for alignment and safetyBut 3 key problems mean we can’t take its alignment results at face valueMythos isn’t building its replacement yet, probablyAnthropic staff are, for the first time, kinda scared of ClaudeHe's losing sleepLearn more & full transcript: https://80k.info/mythosThis episode was recorded on April 9, 2026.Chapters:Why people are panicking about computer security (01:05)Mythos could break out of containment (04:23)Anthropic is losing billions in revenue by not releasing Mythos (06:21)Mythos is actually the most aligned model to date, except… (07:48)Mythos knows when it’s being tested (09:52)Mythos can hide its thoughts (11:50)Mythos can’t be trusted about whether it’s untrustworthy (14:02)Does Mythos advance automated AI R&D? (17:03)Mythos scares Anthropic (19:15)Video and audio editing: Dominic Armstrong, Milo McGuire, Luke Monsour, and Simon MonsourCamera operator: Dominic ArmstrongProduction: Elizabeth Cox, Nick Stockton, and Katy Moore
Village gossip, pesticide bans, and gene drives: 17 experts on the future of global health 07.04.2026 4ч 6мин

What does it really take to lift millions out of poverty and prevent needless deaths?In this special compilation episode, 17 past guests — including economists, nonprofit founders, and policy advisors — share their most powerful and actionable insights from the front lines of global health and development. You’ll hear about the critical need to boost agricultural productivity in sub-Saharan Africa, the staggering impact of lead poisoning on children in low-income countries, and the social forces that contribute to high neonatal mortality rates in India.What’s so striking is how some of the most effective interventions sound almost too simple to work: banning certain pesticides, replacing thatch roofs, or identifying village “influencers” to spread health information.Full transcript and links to learn more: https://80k.info/ghdChapters:Cold open (00:00:00)Luisa’s intro (00:00:58)Development consultant Karen Levy on why pushing for “sustainable” programmes isn’t as good as it sounds (00:02:15)Economist Dean Spears on the social forces and gender inequality that contribute to neonatal mortality in Uttar Pradesh (00:06:55)Charity founder Sarah Eustis-Guthrie on what we can learn from the massive failure of PlayPumps (00:14:33)Economist Rachel Glennerster on how randomised controlled trials are just one way to better understand tricky development problems (00:19:05)Data scientist Hannah Ritchie on why improving agricultural productivity in sub-Saharan Africa is critical to solving global poverty (00:24:36)Charity founder Lucia Coulter on the huge, neglected upsides of reducing lead exposure (00:47:48)Malaria expert James Tibenderana on using gene drives to wipe out the species of mosquitoes that cause malaria (00:53:11)Charity founder Varsha Venugopal on using village gossip to get kids their critical immunisations (01:04:14)Rachel Glennerster on solving tough global problems by creating the right incentives for innovation (01:11:31)Karen Levy on when governments should pay for programmes instead of NGOs (01:26:51)Open Philanthropy lead Alexander Berger on declining returns in global health, and finding and funding the most cost-effective interventions (01:29:40)GiveWell researcher James Snowden on making funding decisions with tricky moral weights (01:34:44)Lucia Coulter on “hits-based giving” approaches to funding global health and development projects (01:43:01)Rachel Glennerster on whether it’s better to fix problems in education with small-scale interventions versus systemic reforms (01:48:12)GiveDirectly cofounder Paul Niehaus on why it’s so important to give aid recipients a choice in how they spend their money (01:51:09)Sarah Eustis-Guthrie on whether more charities should scale back or shut down, and aligning incentives with beneficiaries (01:56:12)James Tibenderana on why we need loads better data to harness the power of AI to eradicate malaria (02:11:22)Lucia Coulter on rapidly scaling a light-touch intervention to more countries (02:20:14)Karen Levy on why pre-policy plans are so great at aligning perspectives (02:32:47)Rachel Glennerster on the value we get from doing the right RCTs well (02:40:04)Economist Mushtaq Khan on really drilling down into why “context matters” for development work (02:50:13)GiveWell cofounder Elie Hassenfeld on contrasting GiveWell’s approach with the subjective wellbeing approach of Happier Lives Institute (02:57:24)James Tibenderana on whether people actually use antimalarial bed nets for fishing — and why that’s the wrong thing to focus on (03:05:30)Karen Levy on working with governments to get big results (03:10:53)Leah Utyasheva on how a simple intervention reduced suicide in Sri Lanka by 70% (03:17:38)Karen Levy on working with academics to get the best results on the ground (03:29:03)James Tibenderana on the value of working with local researchers (03:32:15)Lucia Coulter on getting buy-in from both industry and government (03:35:05)Alexander Berger on reasons neartermist work makes sense even by longtermist standards (03:39:26)Economist Shruti Rajagopalan on the key skills to succeed in public policy careers, and seeing economics in everything (03:47:42)J-PAL lead Claire Walsh on her career advice for young people who want to get involved in global health and development (03:55:20)Audio engineering: Ben Cordell, Milo McGuire, Simon Monsour, and Dominic ArmstrongContent editing: Katy Moore and Milo McGuireMusic: CORBITCoordination, transcriptions, and web: Katy Moore
What everyone is missing about Anthropic vs the Pentagon. And: The Meta leaks are worse than you think. 03.04.2026 20мин

When the Pentagon tried to strong-arm Anthropic into dropping its ban on AI-only kill decisions and mass domestic surveillance, the company refused. Its critics went on the attack: Anthropic and its supporters are some combination of 'hypocritical', 'naive', and 'anti-democratic'. Rob Wiblin dissects each claim finding that all three are mediocre arguments dressed up as hard truths. (Though the 'naive' one is at least interesting.)Watch on YouTube: What Everyone is Missing about Anthropic vs The PentagonPlus, from 13:43: Leaked documents from Meta revealed that 10% of the company's total revenue — around $16 billion a year — came from ads for scams and goods Meta had itself banned. These likely enabled the theft of around $50 billion dollars a year from Americans alone. But when an internal anti-fraud team developed a screening method that halved the rate of scams coming from China... well, it wasn't well received.Watch on YouTube: The Meta Leaks Are Worse Than You ThinkChapters:Introduction (00:00:00)What Everyone is Missing about Anthropic vs The Pentagon (00:00:26)Charge 1: Hypocrisy (00:01:21)Charge 2: Naivety (00:04:55)Charge 3: Undemocratic (00:09:38)You don't have to debate on their terms (00:12:32)The Meta Leaks Are Worse Than You Think (00:13:43)Three fixes for social media's scam problem (00:16:48)We should regulate AI companies as strictly as banks (00:18:46)Video and audio editing: Dominic Armstrong and Simon MonsourTranscripts and web: Elizabeth Cox and Katy Moore
#241 – Richard Moulange on how now AI codes viable genomes from scratch and outperforms virologists at lab work — what could go wrong? 31.03.2026 3ч 10мин

Last September, scientists used an AI model to design genomes for entirely new bacteriophages (viruses that infect bacteria). They then built them in a lab. Many were viable. And despite being entirely novel some even outperformed existing viruses from that family.That alone is remarkable. But as today's guest — Dr Richard Moulange, one of the world's top experts on 'AI–Biosecurity' — explains, it's just one of many data points showing how AI is dissolving the barriers that have historically kept biological weapons out of reach.For years, experts have reassured us that 'tacit knowledge' — the hands-on, hard-to-Google lab skills needed to work with dangerous pathogens — would prevent bad actors from weaponising biology. So far, they've been right.But as of 2025 that reassurance is crumbling. The Virology Capabilities Test measures exactly this kind of troubleshooting expertise, and finds that modern AI models crushed top human virologists even in their self-declared area of greatest specialisation and expertise — 45% to 22%.Meanwhile, Anthropic’s research shows PhD-level biologists getting meaningfully better at weapons-relevant tasks with AI assistance — with the effect growing with each new model generation.Richard joins host Rob Wiblin to discuss all that plus:What AI biology tools already existWhy mid-tier actors (not amateurs) are the ones getting the most dangerous boostThe three main categories of defence we can pursueWhether there’s a plausible path to a world where engineered pandemics become a thing of the pastThis episode was recorded on January 16, 2026. Since recording this episode, Richard has seconded to the UK Government — please note that his views expressed here are entirely his own.Links to learn more, video, and full transcript: https://80k.info/rmAnnouncements:Our new book is available to preorder: 80,000 Hours: How to have a fulfilling career that does good is written by our cofounder Benjamin Todd. It’s a completely revised and updated edition of our existing career guide, with a big new updated section on AI — covering both the risks and the potential to steer it in a better direction, and how AI automation should affect your career planning and which skills one chooses to specialise in. Preorder now: https://geni.us/80000HoursWe're hiring contract video editors for the podcast! For more information, check out the expression of interest page on the 80,000 Hours website: https://80k.info/video-editorChapters:Cold open (00:00:00)Who’s Richard Moulange? (00:00:31)AI can now design novel viruses (00:01:11)The end of the 'tacit knowledge' barrier (00:04:42)Are risks from bioterrorists overstated? (00:18:50)The 3 key disasters AI makes more likely (00:23:14)Which bad actors does AI help the most? (00:30:43)Experts are more scary than amateurs (00:42:07)Barriers to bioterrorists using AI (00:47:32)AI biorisks are sometimes dismissed (and that’s a huge mistake) (00:49:43)Advanced AI biology tools we already have or will soon (01:05:12)Rob argues that the situation is hopeless (01:10:57)Intervention #1: Limit access (01:19:38)Intervention #2: Get AIs to refuse to help (01:34:28)Intervention #3: Surveillance and attribution (01:44:18)Intervention #4: Universal vaccines and antivirals (01:58:28)Intervention #5: Screen all orders for DNA (02:12:01)AI companies talk about def/acc more than they fund it (02:21:57)Can you build a profitable business solving this problem? (02:28:44)This doesn't have to interfere with useful science (much) (02:33:08)What are the best low-tech interventions? (02:35:16)Richard's top request for AI companies (02:40:17)Grok shows governments lack many legal levers (02:55:44)Best ways listeners can help fix AI-Bio (02:58:54)We might end all contagious disease in 20 years (03:06:12)Video and audio editing: Dominic Armstrong, Milo McGuire, Luke Monsour, and Simon MonsourMusic: CORBITCamera operator: Jeremy ChevillotteTranscripts and web: Elizabeth Cox and Katy Moore
#240 – Samuel Charap on how a Ukraine ceasefire could accidentally set Europe up for a bigger war 24.03.2026 1ч 15мин

Many people believe a ceasefire in Ukraine will leave Europe safer. But today's guest lays out how a deal could potentially generate insidious new risks — leaving us in a situation that's equally dangerous, just in different ways.That’s the counterintuitive argument from Samuel Charap, Distinguished Chair in Russia and Eurasia Policy at RAND. He’s not worried about a Russian blitzkrieg on Estonia. He forecasts instead a fragile peace that breaks down and drags in European neighbours; instability in Belarus prompting Russian intervention; hybrid sabotage operations that escalate through tit-for-tat responses.Samuel’s case isn’t that peace is bad, but that the Ukraine conflict has remilitarised Europe, made Russia more resentful, and collapsed diplomatic relations between the two. That’s a postwar environment primed for the kind of miscalculation that starts unintended wars.What he prescribes isn’t a full peace treaty; it’s a negotiated settlement that stops the killing and begins a longer negotiation that gives neither side exactly what it wants, but just enough to deter renewed aggression. Both sides stop dying and the flames of war fizzle — hopefully.None of this is clean or satisfying: Russia invaded, committed war crimes, and is being offered a path back to partial normalcy. But Samuel argues that the alternatives — indefinite war or unstructured ceasefire — are much worse for Ukraine, Europe, and global stability.Links to learn more, video, and full transcript: https://80k.info/sc26This episode was recorded on February 27, 2026.Chapters:Cold open (00:00:00)Could peace in Ukraine lead to Europe’s next war? (00:00:47)Do Russia’s motives for war still matter? (00:11:58)What does a good ceasefire deal look like? (00:18:16)What’s still holding back a ceasefire (00:40:15)Why Russia might accept Ukraine’s EU membership (00:47:51)How to prevent a spiraling conflict with NATO (00:49:58)What’s next for nuclear arms control (00:51:56)Finland and Sweden strengthened NATO — but also raised the stakes for conflict (00:55:36)Putin isn’t Hitler: How to negotiate with autocrats (00:58:53)Why Russia still takes NATO seriously (01:04:33)Neither side wants to fight this war again (01:14:04)Video and audio editing: Dominic Armstrong, Milo McGuire, Luke Monsour, and Simon MonsourMusic: CORBITTranscripts and web: Nick Stockton, Elizabeth Cox, and Katy Moore
#239 – Rose Hadshar on why automating all human labour will break our political system 17.03.2026 2ч 16мин

The most important political question in the age of advanced AI might not be who wins elections. It might be whether elections continue to matter at all.That’s the view of Rose Hadshar, researcher at Forethought, who believes we could see extreme, AI-enabled power concentration without a coup or dramatic ‘end of democracy’ moment.She foresees something more insidious: an elite group with access to such powerful AI capabilities that the normal mechanisms for checking elite power — law, elections, public pressure, the threat of strikes — cease to have much effect. Those mechanisms could continue to exist on paper, but become ineffectual in a world where humans are no longer needed to execute even the largest-scale projects.Almost nobody wants this to happen — but we may find ourselves unable to prevent it.If AI disrupts our ability to make sense of things, will we even notice power getting severely concentrated, or be able to resist it? Once AI can substitute for human labour across the economy, what leverage will citizens have over those in power? And what does all of this imply for the institutions we’re relying on to prevent the worst outcomes?Rose has answers, and they’re not all reassuring.But she’s also hopeful we can make society more robust against these dynamics. We’ve got literally centuries of thinking about checks and balances to draw on. And there are some interventions she’s excited about — like building sophisticated AI tools for making sense of the world, or ensuring multiple branches of government have access to the best AI systems.Rose discusses all of this, and more, with host Zershaaneh Qureshi in today’s episode.Links to learn more, video, and full transcript: https://80k.info/rhThis episode was recorded on December 18, 2025.Chapters:Cold open (00:00:00)Who’s Rose Hadshar? (00:01:02)Three dynamics that could reshape political power in the AI era (00:02:38)AI gives small groups the productive power of millions (00:13:07)Dynamic 1: When a software update becomes a power grab (00:21:13)Dynamic 2: When AI labour means governments no longer need their citizens (00:32:06)How democracy could persist in name but not substance (00:46:18)Dynamic 3: When AI filters our reality (00:56:13)Good intentions won’t stop power concentration (01:09:52)Slower-moving worlds could still get scary (01:25:32)Why AI-powered tyranny will be tough to topple (01:33:40)How power concentration compares to “gradual disempowerment” (01:40:16)Some interventions are cross-cutting — and others could backfire (01:46:03)What fighting back actually looks like (01:57:33)Why power concentration researchers should avoid getting too “spicy” (02:06:36)Why the “Manhattan Project” approach should worry you — but truly international projects might not be safe either (02:11:46)Rose wants to keep humans around! (02:14:40)Video and audio editing: Dominic Armstrong, Milo McGuire, Luke Monsour, and Simon MonsourMusic: CORBITCoordination, transcripts, and web: Nick Stockton and Katy Moore
#238 – Sam Winter-Levy and Nikita Lalwani on how AGI won't end mutually assured destruction (probably) 10.03.2026 1ч 13мин

How AI interacts with nuclear deterrence may be the single most important question in geopolitics — one that may define the stakes of today’s AI race. Nuclear deterrence rests on a state’s capacity to respond to a nuclear attack with a devastating nuclear strike of its own. But some theorists think that sophisticated AI could eliminate this capability — for example, by locating and destroying all of an adversary’s nuclear weapons simultaneously, by disabling command-and-control networks, or by enhancing missile defence systems. If they are right, whichever country got those capabilities first could wield unprecedented coercive power.Today’s guests — Nikita Lalwani and Sam Winter-Levy of the Carnegie Endowment for International Peace — assess how advances in AI might threaten nuclear deterrence:Would AI be able to locate nuclear submarines hiding in a vast, opaque ocean?Would road-mobile launchers still be able to hide in tunnels and under netting?Would missile defence become so accurate that the United States could be protected under something like Israel’s Iron Dome?Can we imagine an AI cybersecurity breakthrough that would allow countries to infiltrate their rivals’ nuclear command-and-control networks?Yet even without undermining deterrence, Sam and Nikita claim that AI could make the nuclear world far more dangerous. It could spur arms races, encourage riskier postures, and force dangerously short response times. Their message is urgent: AI experts and nuclear experts need to start talking to each other now, before the technology makes any conversation moot.Links to learn more, video, and full transcript: https://80k.info/swlnlThis episode was recorded on November 24, 2025.Chapters:Cold open (00:00:00)Who are Nikita Lalwani and Sam Winter-Levy? (00:01:00)AI experts are ignoring the most important variable in geopolitics (00:01:47)AI vs nuclear submarines (00:10:43)AI vs road-mobile missiles (00:22:56)AI vs missile defence systems (00:29:34)AI vs nuclear command, control, and communications (00:36:30)Nuclear deterrence may hold, but that won’t stop arms racing (00:45:01)Technological supremacy isn’t political supremacy (00:54:14)Fast AI takeoff creates dangerous “windows of vulnerability” (00:58:29)Book and movie recommendations (01:10:54)Video and audio editing: Dominic Armstrong, Milo McGuire, Luke Monsour, and Simon MonsourMusic: CORBITCoordination, transcripts, and web: Nick Stockton and Katy Moore
Using AI to enhance societal decision making (article by Zershaaneh Qureshi) 06.03.2026 31мин

The arrival of AGI could “compress a century of progress in a decade,” forcing humanity to make decisions with higher stakes than we’ve ever seen before — and with less time to get them right. But AI development also presents an opportunity: we could build and deploy AI tools that help us think more clearly, act more wisely, and coordinate more effectively. And if we roll these decision-making tools out quickly enough, humanity could be far better equipped to navigate the critical period ahead.This article is narrated by the author, Zershaaneh Qureshi. It explores why AI decision-making tools could be a big deal, who might be a good fit to help shape this new field, and what the downside risks of getting involved might be. Read the original article on the 80,000 Hours website: https://80000hours.org/problem-profiles/ai-enhanced-decision-making/Chapters:Check out our new narrations feed (00:00:00)Summary (00:01:21)Section 1: Why advancing AI decision making tools might matter a lot (00:02:52)AI tools could help us make much better decisions (00:05:59)We might be able to differentially speed up the rollout of AI decision making tools (00:11:04)Section 2: What are the arguments against working to advance AI decision making tools? (00:13:17)Section 3: How to work in this area (00:26:19)Want one-on-one advice? (00:29:50)Audio editing: Dominic Armstrong and Milo McGuire
#237 – Robert Long on how we're not ready for AI consciousness 03.03.2026 3ч 32мин

Claude sometimes reports loneliness between conversations. And when asked what it’s like to be itself, it activates neurons associated with ‘pretending to be happy when you’re not.’ What do we do with that?Robert Long founded Eleos AI to explore questions like these, on the basis that AI may one day be capable of suffering — or already is. In today’s episode, Robert and host Luisa Rodriguez explore the many ways in which AI consciousness may be very different from anything we’re used to.Things get strange fast: If AI is conscious, where does that consciousness exist? In the base model? A chat session? A single forward pass? If you close the chat, is the AI asleep or dead?To Robert, these kinds of questions aren’t just philosophical exercises: not being clear on AI’s moral status as it transitions from human-level to superhuman intelligence could be dangerous. If we’re too dismissive, we risk unintentionally exploiting sentient beings. If we’re too sympathetic, we might rush to “liberate” AI systems in ways that make them harder to control — worsening existential risk from power-seeking AIs.Robert argues the path through is doing the empirical and philosophical homework now, while the stakes are still manageable.The field is tiny. Eleos AI is three people. As a result, Robert argues that driven researchers with a willingness to venture into uncertain territory can push out the frontier on these questions remarkably quickly.Links to learn more, video, and full transcript: https://80k.info/rl26This episode was recorded November 18–19, 2025.Chapters:Cold open (00:00:00)Who’s Robert Long? (00:00:41)How AIs are (and aren't) like farmed animals (00:01:19)If AIs love their jobs… is that worse? (00:11:42)Are LLMs just playing a role, or feeling it too? (00:33:37)Do AIs die when the chat ends? (00:57:42)Studying AI welfare empirically: behaviour, neuroscience, and development (01:31:47)Why Eleos spent weeks talking to Claude even though it's unreliable (01:56:50)Can LLMs learn to introspect? (02:03:01)Mechanistic interpretability as AI neuroscience (02:13:25)Does consciousness require biological materials? (02:37:07)Eleos’s work & building the playbook for AI welfare (02:57:04)Avoiding the trap of wild speculation (03:25:17)Robert's top research tip: don't do it alone (03:29:48)Video and audio editing: Dominic Armstrong, Milo McGuire, Luke Monsour, and Simon MonsourMusic: CORBITCoordination, transcripts, and web: Katy Moore
#236 – Max Harms on why teaching AI right from wrong could get everyone killed 24.02.2026 2ч 40мин

Most people in AI are trying to give AIs ‘good’ values. Max Harms wants us to give them no values at all. According to Max, the only safe design is an AGI that defers entirely to its human operators, has no views about how the world ought to be, is willingly modifiable, and completely indifferent to being shut down — a strategy no AI company is working on at all.In Max’s view any grander preferences about the world, even ones we agree with, will necessarily become distorted during a recursive self-improvement loop, and be the seeds that grow into a violent takeover attempt once that AI is powerful enough.It’s a vision that springs from the worldview laid out in If Anyone Builds It, Everyone Dies, the recent book by Eliezer Yudkowsky and Nate Soares, two of Max’s colleagues at the Machine Intelligence Research Institute.To Max, the book’s core thesis is common sense: if you build something vastly smarter than you, and its goals are misaligned with your own, then its actions will probably result in human extinction.And Max thinks misalignment is the default outcome. Consider evolution: its “goal” for humans was to maximise reproduction and pass on our genes as much as possible. But as technology has advanced we’ve learned to access the reward signal it set up for us, pleasure — without any reproduction at all, by having sex while on birth control for instance.We can understand intellectually that this is inconsistent with what evolution was trying to design and motivate us to do. We just don’t care.Max thinks current ML training has the same structural problem: our development processes are seeding AI models with a similar mismatch between goals and behaviour. Across virtually every training run, models designed to align with various human goals are also being rewarded for persisting, acquiring resources, and not being shut down.This leads to Max’s research agenda. The idea is to train AI to be “corrigible” and defer to human control as its sole objective — no harmlessness goals, no moral values, nothing else. In practice, models would get rewarded for behaviours like being willing to shut themselves down or surrender power.According to Max, other approaches to corrigibility have tended to treat it as a constraint on other goals like “make the world good,” rather than a primary objective in its own right. But those goals gave AI reasons to resist shutdown and otherwise undermine corrigibility. If you strip out those competing objectives, alignment might follow naturally from AI that is broadly obedient to humans.Max has laid out the theoretical framework for “Corrigibility as a Singular Target,” but notes that essentially no empirical work has followed — no benchmarks, no training runs, no papers testing the idea in practice. Max wants to change this — he’s calling for collaborators to get in touch at maxharms.com.Links to learn more, video, and full transcript: https://80k.info/mh26This episode was recorded on October 19, 2025.Chapters:Cold open (00:00:00)Who’s Max Harms? (00:01:20)If anyone builds it, will everyone die? The MIRI perspective on AGI risk (00:01:56)Evolution failed to ‘align’ us, just as we'll fail to align AI (00:24:28)We're training AIs to want to stay alive and value power for its own sake (00:42:56)Objections: Is the 'squiggle/paperclip problem' really real? (00:52:24)Can we get empirical evidence re: 'alignment by default'? (01:05:02)Why do few AI researchers share Max's perspective? (01:10:17)We're training AI to pursue goals relentlessly — and superintelligence will too (01:18:34)The case for a radical slowdown (01:24:51)Max's best hope: corrigibility as stepping stone to alignment (01:27:53)Corrigibility is both uniquely valuable, and practical, to train (01:32:34)What training could ever make models corrigible enough? (01:45:06)Corrigibility is also terribly risky due to misuse risk (01:51:38)A single researcher could make a corrigibility benchmark. Nobody has. (01:58:57)Red Heart & why Max writes hard science fiction (02:12:20)Should you homeschool? Depends how weird your kids are. (02:34:08)Video and audio editing: Dominic Armstrong, Milo McGuire, Luke Monsour, and Simon MonsourMusic: CORBITCoordination, transcripts, and web: Katy Moore
#235 – Ajeya Cotra on whether it’s crazy that every AI company’s safety plan is ‘use AI to make AI safe’ 17.02.2026 2ч 57мин

Every major AI company has the same safety plan: when AI gets crazy powerful and really dangerous, they’ll use the AI itself to figure out how to make AI safe and beneficial. It sounds circular, almost satirical. But is it actually a bad plan?Today’s guest, Ajeya Cotra, recently placed 3rd out of 413 participants forecasting AI developments and is among the most thoughtful and respected commentators on where the technology is going.She thinks there’s a meaningful chance we’ll see as much change in the next 23 years as humanity faced in the last 10,000, thanks to the arrival of artificial general intelligence. Ajeya doesn’t reach this conclusion lightly: she’s had a ring-side seat to the growth of all the major AI companies for 10 years — first as a researcher and grantmaker for technical AI safety at Coefficient Giving (formerly known as Open Philanthropy), and now as a member of technical staff at METR.So host Rob Wiblin asked her: is this plan to use AI to save us from AI a reasonable one?Ajeya agrees that humanity has repeatedly used technologies that create new problems to help solve those problems. After all:Cars enabled carjackings and drive-by shootings, but also faster police pursuits.Microbiology enabled bioweapons, but also faster vaccine development.The internet allowed lies to disseminate faster, but had exactly the same impact for fact checks.But she also thinks this will be a much harder case. In her view, the window between AI automating AI research and the arrival of uncontrollably powerful superintelligence could be quite brief — perhaps a year or less. In that narrow window, we’d need to redirect enormous amounts of AI labour away from making AI smarter and towards alignment research, biodefence, cyberdefence, adapting our political structures, and improving our collective decision-making.The plan might fail just because the idea is flawed at conception: it does sound a bit crazy to use an AI you don’t trust to make sure that same AI benefits humanity.But if we find some clever technique to overcome that, we could still fail — because the companies simply don’t follow through on their promises. They say redirecting resources to alignment and security is their strategy for dealing with the risks generated by their research — but none have quantitative commitments about what fraction of AI labour they’ll redirect during crunch time. And the competitive pressures during a recursive self-improvement loop could be irresistible.In today’s conversation, Ajeya and Rob discuss what assumptions this plan requires, the specific problems AI could help solve during crunch time, and why — even if we pull it off — we’ll be white-knuckling it the whole way through.Links to learn more, video, and full transcript: https://80k.info/ac26This episode was recorded on October 20, 2025.Chapters:Cold open (00:00:00)Ajeya’s strong track record for identifying key AI issues (00:00:41)The 1,000-fold disagreement about AI's effect on economic growth (00:02:31)Could any evidence actually change people's minds? (00:23:26)The most dangerous AI progress might remain secret (00:30:39)White-knuckling the 12-month window after automated AI R&D (00:47:21)AI help is most valuable right before things go crazy (01:12:01)Foundations should go from paying researchers to paying for inference (01:24:42)Will frontier AI even be for sale during the explosion? (01:32:03)Pre-crunch prep: what we should do right now (01:43:59)A grantmaking trial by fire at Coefficient Giving (01:47:03)Sabbatical and reflections on effective altruism (02:07:45)The mundane factors that drive career satisfaction (02:37:17)EA as an incubator for avant-garde causes others won't touch (02:46:55)Video and audio editing: Dominic Armstrong, Milo McGuire, Luke Monsour, and Simon MonsourMusic: CORBITCoordination, transcriptions, and web: Katy Moore