Hadas Orgad
Kempner Research Fellow

Contact Information
About
Hadas Orgad is a research fellow at the Kempner Institute at Harvard University.
Research Focus
Hadas Orgad investigates the internal mechanisms of AI models to better understand and mitigate failures in safety, fairness, and reliability. Her research bridges interpretability and practical deployment, focusing on harmful model behaviors such as hallucinations, bias, privacy violations, and unsafe outputs. By analyzing the internal structure of models, she develops actionable tools and interventions to improve model behavior and better align it with human values and incentives. Her long-term goal is to advance interpretability and control techniques so that AI systems are fully transparent, trustworthy, and steerable.