Deeper Learning

Home Research Deeper Learning

A research blog from the Kempner Institute for the Study of Natural and Artificial Intelligence at Harvard University.

2024

13 May 2024

Infinite Limits of Neural Networks

Part 1 of a two-part blog post covering recent findings from the authors

By: Alex Atanasov, Blake Bordelon, and Cengiz Pehlevan

A discussion of expository material and the authors' recent papers related to large width and depth limits.

16 April 2024

Distinguishing the Knowable from the Unknowable with Language Models

By: Gustaf Ahdritz, Tian Qin, Nikhil Vyas, Boaz Barak, and Ben Edelman

A new way to label different types of uncertainty in unconstrained text and simple methods to predict those labels, including a completely unsupervised approach.

Code
Paper

5 February 2024

Repeat After Me: Transformers are Better than State Space Models at Copying

By: Samy Jelassi, David Brandfonbrener, Sham Kakade and Eran Malach

Improved efficiency of State Space Models sacrifices some core capabilities for modern LLMs.

Pre-print

2023

7 December 2023

A Next-Generation Architecture for Elastic and Conditional Computation

The Matryoshka Way

By: Aditya Kusupati, Sneha Kudugunta, Devvrit, and Tim Dettmers

Introducing an algorithmic method to elastically deploy large models: the #MatFormer.

Code
Paper

15 November 2023

Where Do Features Come From?

A story of sinusoids and inductive biases

By: Ben Edelman, Depen Morwani, Costin Oncescu, and Rosie Zhao

Mechanic interpretability results explained using known inductive biases.

Pre-print

9 November 2023

Watermarking in the Sand

By: Ben Edelman, Hanlin Zhang and Boaz Barak

Robust watermarking in AI is impossible under natural assumptions.

Paper

Blog List