Tag: AI safety
-

Instrumental convergence and power-seeking (Part 2: Benson-Tilsen and Soares)
A leading power-seeking theorem due to Benson-Tilsen and Soares does not ground the needed form of instrumental convergence
-

Papers I learned from (Part 5: Language agents reduce the risk of existential catastrophe)
Simon Goldstein and Cameron Domenico Kirk-Giannini argue that language agents reduce the risk of existential catastrophe from artificial intelligence.


