• Hi there! I'm Zhenyu

    I'm a final-year Ph.D. student at the Electrical and Computer Engineering Department of UT Austin, working with Prof. Atlas Wang. Previously, I had the opportunity to broaden my practical AI skills while collaborating with Prof. Beidi Chen at CMU and Dr. Yuandong Tian at Meta.

    My research aims to develop Efficient, Scalable, and Steerable machine learning algorithms and systems, with specific interests in: (i) Scalable Optimization for GenAI Model Training, (ii) Efficient Inference Algorithms, (iii) Long-Context Multimodal Modeling and Generation, and (iv) Mechanistic Interpretability & Reasoning Enhancement of Foundation Models.

    I was recognized as an ML and Systems Rising Star in 2025 and received the MLSys'25 Outstanding Paper Award (Honorable Mention), along with several travel and reviewer awards from prestigious conferences.

    I will be on the job market starting Fall 2025 and am actively seeking full-time research positions in industry. Please feel free to reach out if you believe I would be a good fit.

Education

Ph.D. in The University of Texas at Austin
Sep. 2022 - Now
Ph.D. in Electrical and Computer Engineering
Advised by Prof. Atlas Wang.

M.E. in University of Science and Technology of China
Sep 2019 - May 2022
M.E. in Electrical and Computer Engineering
Advised by Prof. Bin Li.

B.S. in University of Science and Technology of China
Sep 2015 - May 2019
B.S. in Applied Physics
Yan Ji-Ci Talent Program in Physics

Selected Publication

R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference

Zhenyu Zhang, Zechun Liu, Yuandong Tian, Harshit Khaitan, Zhangyang Wang, Steven Li

International Conference on Learning Representations (ICLR), 2025

[Paper] [Project] [Code]

Italian Trulli

H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

Zhenyu Zhang, Ying Sheng, Tianyi Zhou, Tianlong Chen, Lianmin Zheng, Ruisi Cai, Zhao Song, Yuandong Tian, Christopher Ré, Clark Barrett, Zhangyang Wang, Beidi Chen

Neural Information Processing Systems (NeurIPS), 2023

[Paper] [Project] [Code] [Meta/llama-recipes] [Answer.AI] [Talk] [新智元]

Italian Trulli

Efficient Lottery Ticket Finding: Less Data is More

Zhenyu Zhang*, Xuxi Chen*, Tianlong Chen*, Zhangyang Wang

(* Equal Contribution)
International Conference on Machine Learning (ICML), 2021

[Paper] [Project] [Code]

Italian Trulli

Robust Overfitting may be mitigated by properly learned smoothening

Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, Shiyu Chang, Zhangyang Wang (* Equal Contribution)

International Conference on Learning Representations (ICLR), 2021

[Paper] [Project] [Code]

Italian Trulli

Work Experience

Circular Image
Circular Image
  • Conduct research on enhancing context awareness of long context LLM.
  • Work with Dr. Zhewei Yao, Dr. Xiaoxia Wu.
Circular Image
Circular Image