GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Jiawei Zhao, Zhenyu Zhang, Beidi Chen, Zhangyang Wang, Anima Anandkumar, Yuandong Tian
ICML 2024 Oral / 
Paper / 
Code / 
Hacker News / 
HuggingFace / 
LLaMA-Factory / 
Axolotl / 
AICoffeeBreak
H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
Zhenyu Zhang, Ying Sheng, Tianyi Zhou, Tianlong Chen, Lianmin Zheng, Ruisi Cai, Zhao Song, Yuandong Tian, Christopher Ré, Clark Barrett, Zhangyang Wang, Beidi Chen
NeurIPS 2023 / 
Paper / 
Blog / 
Code / 
llama-recipes / 
Media (AI era/新智元)
Q-Hitter: A Better Token Oracle for Efficient LLM Inference via Sparse-Quantized KV Cache
Zhenyu Zhang*, Shiwei Liu*, Runjin Chen, Bhavya Kailkhura, Beidi Chen, Zhangyang Wang
MLSys 2024 / 
Paper / 
Code
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy
Pingzhi Li, Zhenyu Zhang, Prateek Yadav, Yi-Lin Sung, Yu Cheng, Mohit Bansal, Tianlong Chen
ICLR 2024 Spotlight / 
Paper / 
Code
JoMA: Demystifying Multilayer Transformers via JOint Dynamics of MLP and Attention
Yuandong Tian, Yiping Wang, Zhenyu Zhang, Beidi Chen, Simon Du
ICLR 2024 / 
Paper
Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers
Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal, Shiwei Liu, Zhangyang Wang
ICLR 2023 Spotlight / 
Paper / 
Code
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
Shiwei Liu*, Tianlong Chen*, Zhenyu Zhang, Xuxi Chen, Tianjin Huang, Ajay Jaiswal, Zhangyang Wang
ICLR 2023 Spotlight / 
Paper / 
Code
Sparse Winning Tickets are Data-Efficient Image Recognizers
Mukund Varma T, Xuxi Chen, Zhenyu Zhang, Tianlong Chen, Subhashini Venugopalan, Zhangyang Wang
NeurIPS 2022 Spotlight / 
Paper / 
Code
Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection without Clean Datasets
Ruisi Cai*, Zhenyu Zhang*, Tianlong Chen, Xiaohan Chen, Zhangyang Wang
NeurIPS 2022 / 
Paper / 
Code
Quarantine: Sparsity Can Uncover the Trojan Attack Trigger for Free
Tianlong Chen*, Zhenyu Zhang*, Yihua Zhang*, Shiyu Chang, Sijia Liu, Zhangyang Wang
CVPR 2022 / 
Paper / 
Code
Sparsity Winning Twice: Better Robust Generalization from More Efficient Training
Tianlong Chen*, Zhenyu Zhang*, Pengjun Wang*, Santosh Balachandra*, Haoyu Ma*, Zehao Wang, Zhangyang Wang
ICLR 2022 / 
Paper / 
Code
Efficient Lottery Ticket Finding: Less Data is More
Zhenyu Zhang*, Xuxi Chen*, Tianlong Chen*, Zhangyang Wang
ICML 2021 / 
Paper / 
Code
Robust Overfitting May be Mitigated by Properly Learned Smoothening
Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, Shiyu Chang, Zhangyang Wang
ICLR 2021 / 
Paper / 
Code
Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning
Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, Shiyu Chang, Zhangyang Wang
ICLR 2021 / 
Paper / 
Code
GANs Can Play Lottery Tickets Too
Xuxi Chen*, Zhenyu Zhang*, Yongduo Sui, Tianlong Chen
ICLR 2021 / 
Paper / 
Code
|