Deeper Learning

A research blog from the Kempner Institute for the Study of Natural and Artificial Intelligence at Harvard University.

Blog List

2025

29 October 2025

Boomerang Distillation Enables Zero-Shot Model Size Interpolation

By: Sara Kangaslahti, Nihal Nayak, Jonathan Geuter

The authors identify a novel phenomenon, Boomerang Distillation, which occurs when distilling a large language model into a smaller one. In this blog post, they describe how Boomerang Distillation can be used to create entire families of LLMs of fine-grained sizes without any training from a single student-teacher pair.

9 October 2025

From Models to Scientists: Building AI Agents for Scientific Discovery

By: Shanghua Gao, Richard Zhu, Marinka Zitnik

ToolUniverse is a framework for developing AI agents for science, often referred to as “AI scientists.” It provides an environment where LLMs interact with more than six hundred scientific tools, including machine learning models, databases, and simulators. ToolUniverse standardizes how AI models access and combine these tools, allowing researchers to develop, test, and evaluate AI agents for science.

4 August 2025

ANN-like Synapses in the Brain Mediate Online Reinforcement Learning

By: Shun Li

The authors show that a type of synapses in the brain challenges a long-held assumption about synaptic plasticity rules. These synapses switch between more excitatory and more inhibitory in an experience-dependent manner, and contribute to online dopamine updates during reinforcement learning.

28 July 2025

Solvable Model of In-Context Learning Using Linear Attention

By: Mary Letey

This work provides a sharp characterization of in-context learning (ICL) in an analytically-solvable model, which offers insights into the sample complexity and data quality requirements for ICL to happen. These insights can be applied to more complex, realistic architectures.