15 November 2023
Where Do Features Come From?
A story of sinusoids and inductive biases
By: Ben Edelman, Depen Morwani, Costin Oncescu, and Rosie Zhao
Mechanic interpretability results explained using known inductive biases.
9 November 2023
Watermarking in the Sand
By: Ben Edelman, Hanlin Zhang and Boaz Barak
Robust watermarking in AI is impossible under natural assumptions.