I am a PhD student at The Hong Kong University of Science and Technology, Department of Computer Science and Engineering. I am fortunate to be advised by Junxian He. I am generally interested in large language models and currently working on model merging.
📝 Publications
Most recent publications on Google Scholar.
* denotes co-first authors, $^\dagger$ denotes corresponding author/main advisor
COLM 2024
Compression Represents Intelligence Linearly
Yuzhen Huang*, Jinghan Zhang*, Zifei Shan, Junxian He$^\dagger$
paper | leaderboard | dataset |
- Abstract: In this paper, we study the relationship between compression rate and intelligence of LLMs.
NeurIPS 2023
Composing Parameter-Efficient Modules with Arithmetic Operations
Jinghan Zhang, Shiqi Chen, Junteng Liu, Junxian He$^\dagger$
paper |
- Abstract: In this paper, we study model merging on parameter-efficient modules like LoRA and (IA)^3.
NeurIPS 2023 D&B
NeurIPS 2023 D&B
🌟 Service
Reviewer: NeurIPS, NLPCC
🎖 Awards
- 2023.10 NeurIPS 2023 Scholar Award
- 2023.06 Outstanding Undergraduate Thesis in SEU (top 3%)
- 2021.12 National Scholarship
📖 Education
- 2024.02 - now PhD student, Department of CSE, HKUST, Hong Kong SAR, China.
- 2023.11 - 2024.01 Research Assistant, Department of CSE, HKUST, Hong Kong SAR, China.
- 2019.09 - 2023.06 Undergraduate, Artificial Intelligence, Southeast University, Nanjing, China.