I am a PhD student at The Hong Kong University of Science and Technology, Department of Computer Science and Engineering. I am fortunate to be advised by Junxian He. I am generally interested in large language models and currently working on model merging.

📝 Publications

Most recent publications on Google Scholar.
* denotes co-first authors, $^\dagger$ denotes corresponding author/main advisor

COLM 2024
sym

Compression Represents Intelligence Linearly

Yuzhen Huang*, Jinghan Zhang*, Zifei Shan, Junxian He$^\dagger$

paper | leaderboard | dataset |

  • Abstract: In this paper, we study the relationship between compression rate and intelligence of LLMs.
NeurIPS 2023
sym

Composing Parameter-Efficient Modules with Arithmetic Operations

Jinghan Zhang, Shiqi Chen, Junteng Liu, Junxian He$^\dagger$

paper |

  • Abstract: In this paper, we study model merging on parameter-efficient modules like LoRA and (IA)^3.
NeurIPS 2023 D&B
sym

C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models

Yuzhen Huang*, Yuzhuo Bai*, Zhihao Zhu, Junlei Zhang, Jinghan Zhang, Tangjun Su, Junteng Liu, Chuancheng Lv, Yikai Zhang, Jiayi Lei, Yao Fu, Maosong Sun, Junxian He$^\dagger$

paper | website | dataset |

NeurIPS 2023 D&B
sym

FELM: Benchmarking Factuality Evaluation of Large Language Models

Shiqi Chen, Yiran Zhao, Jinghan Zhang, I-Chun Chern, Siyang Gao, Pengfei Liu, Junxian He$^\dagger$

paper | website | dataset |

🌟 Service

Reviewer: NeurIPS, NLPCC

🎖 Awards

  • 2023.10 NeurIPS 2023 Scholar Award
  • 2023.06 Outstanding Undergraduate Thesis in SEU (top 3%)
  • 2021.12 National Scholarship

📖 Education

  • 2024.02 - now PhD student, Department of CSE, HKUST, Hong Kong SAR, China.
  • 2023.11 - 2024.01 Research Assistant, Department of CSE, HKUST, Hong Kong SAR, China.
  • 2019.09 - 2023.06 Undergraduate, Artificial Intelligence, Southeast University, Nanjing, China.