Academic Papers Archives - Reflective altruism

Revisiting the shutdown problem (Part 2: Informal arguments)

Informal arguments for Catastrophic Shutdown Difficulty include the Argument from Instrumental Convergence and the Empirical Argument.

June 26, 2026

Revisiting the shutdown problem (Part 1: Introduction)

The catastrophic shutdown problem is (roughly) the problem of designing AI systems that can be shut down when their acts would lead to existential catastrophe.

June 12, 2026

Papers I learned from (Part 8: Somebody should do something)

Brownstein, Madva and Kelly examine the role of personal choices in systemic change.

February 20, 2026

Instrumental convergence and power-seeking (Part 4: Conclusion)

This post draws lessons from our discussion of instrumental convergence and power-seeking

December 12, 2025

Instrumental convergence and power-seeking (Part 3: Turner et al.)

The most-discussed modern power-seeking theorem, due to Alex Turner and colleagues, also won’t do the trick

October 4, 2025

Papers I learned from (Part 7: Essays on longtermism)

Essays on longtermism brings together over four dozen scholars and practitioners to discuss longtermism.

September 5, 2025

Papers I learned from (Part 6: A timing problem for instrumental convergence)

Should we expect means-end rational agents to preserve their goals? Southan, Ward and Semler are skeptical.

August 22, 2025

Instrumental convergence and power-seeking (Part 2: Benson-Tilsen and Soares)

A leading power-seeking theorem due to Benson-Tilsen and Soares does not ground the needed form of instrumental convergence

June 27, 2025

Instrumental convergence and power-seeking (Part 1: Introduction)

Power-seeking theorems aim to formally demonstrate that artificial agents are likely to seek power in problematic ways. I argue that leading power-seeking theorems do not succeed.

May 16, 2025

Category: Academic Papers