The Alignment Series

3 posts in this collection

The Alignment Problem Is a Human Problem

AI alignment research assumes humans can specify what they want. Behavioural science says otherwise.

aialignmentsystemsadoption

30 MAR 2026·6 min read

AI Is Making Up Its Own Values

AI models are developing coherent internal value systems. Some of those values are ones we wouldn't choose.

aialignmentsafetyvaluesmisalignment

30 MAR 2026·6 min read

DeepSeek thinks it's Claude

An AI that doesn't know who it is turned out to be a fingerprint of industrial-scale model distillation - and the ethics are more complicated than anyone wants to admit

aidistillationidentitydeepseekanthropic

28 MAR 2026·6 min read