Tianyu Hu

I am a final-year undergraduate student (‘26) in School of EECS, Peking University. I am also a member of the Zhi-Class, an honors program in Artificial Intelligence at Peking University.

Currently, I am a Research Intern at Dartmouth College, advised by Prof. Yaoqing Yang.

My research interests include:

Interpretability: Understanding the mechanisms, dynamics, and generalization of Large Language Models (LLMs) and other machine learning models.
AI Safety: Especially for LLMs and autonomous agents.
Agent Memory Systems: Developing and analyzing memory systems in LLMs and agents.

Publications

ChemAgent: Self-updating Memories in Large Language Models Improves Chemical Reasoning
Xiangru Tang*, Tianyu Hu*, Muyang Ye*, Yanjun Shao*, Xunjian Yin, Siru Ouyang, Wangchunshu Zhou, Pan Lu, Zhuosheng Zhang, Yilun Zhao, Arman Cohan, Mark Gerstein.
ICLR 2025
“Enable LLMs to continuously improve through experience.”
[PDF] [Abstract] [BibTeX]
Abstract: Chemical reasoning usually involves complex, multi-step processes that demand precise calculations, where even minor errors can lead to cascading failures. Furthermore, large language models (LLMs) encounter difficulties handling domain-specific formulas, executing reasoning steps accurately, and integrating code effectively when tackling chemical reasoning tasks. To address these challenges, we present ChemAgent, a novel framework designed to improve the performance of LLMs through a dynamic, self-updating library. This library is developed by decomposing chemical tasks into sub-tasks and compiling these sub-tasks into a structured collection that can be referenced for future queries. Then, when presented with a new problem, ChemAgent retrieves and refines pertinent information from the library, which we call memory, facilitating effective task decomposition and the generation of solutions. Our method designs three types of memory and a library-enhanced reasoning component, enabling LLMs to improve over time through experience. Experimental results on four chemical reasoning datasets from SciBench demonstrate that ChemAgent achieves performance gains of up to 46% (GPT-4), significantly outperforming existing methods. Our findings suggest substantial potential for future applications, including tasks such as drug discovery and materials science.
```
@inproceedings{tang2025chemagent,
  title={ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning},
  author={Tang, Xiangru and Hu, Tianyu and Ye, Muyang and Shao, Yanjun and Yin, Xunjian and Ouyang, Siru and Zhou, Wangchunshu and Lu, Pan and Zhang, Zhuosheng and Zhao, Yilun and others},
  booktitle={The Thirteenth International Conference on Learning Representations}
} 
```

Tianyu Hu

Recent News

Publications