I am a second-year PhD student at The Hong Kong University of Science and Technology, Department of Computer Science and Engineering. I am fortunate to be advised by Junxian He. I am generally interested in large language models and vision language models, with experiences in model merging, long context modeling and multiturn reasoning.

📝 Publications

Most recent publications on Google Scholar.
* denotes co-first authors, $^\dagger$ denotes corresponding author/main advisor

ICML 2025
sym

Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging

Shiqi Chen*, Jinghan Zhang*, Tongyao Zhu, Wei Liu, Siyang Gao, Miao Xiong, Manling Li, Junxian He$^\dagger$

paper |

  • Abstract: We enhance VLM reasoning via model merging and understand perception and reasoning ability inside model.
ICML 2025
sym

Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas

Shiqi Chen, Tongyao Zhu, Ruochen Zhou, Jinghan Zhang, Siyang Gao, Juan Carlos Niebles, Mor Geva, Junxian He, Jiajun Wu, Manling Li$^\dagger$

paper | dataset |

  • Abstract: A training-free decoding method called AdaptVis boosts VLM’s spatial reasoning by dynamically sharpening or broadening attention based on confidence, yielding up to 50-point accuracy gains on benchmarks.
COLM 2024
sym

Compression Represents Intelligence Linearly

Yuzhen Huang*, Jinghan Zhang*, Zifei Shan, Junxian He$^\dagger$

paper | leaderboard | dataset |

  • Abstract: In this paper, we study the relationship between compression rate and intelligence of LLMs.
NeurIPS 2023
sym

Composing Parameter-Efficient Modules with Arithmetic Operations

Jinghan Zhang, Shiqi Chen, Junteng Liu, Junxian He$^\dagger$

paper |

  • Abstract: In this paper, we study model merging on parameter-efficient modules like LoRA and (IA)^3.
NeurIPS 2023 D&B
sym

C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models

Yuzhen Huang*, Yuzhuo Bai*, Zhihao Zhu, Junlei Zhang, Jinghan Zhang, Tangjun Su, Junteng Liu, Chuancheng Lv, Yikai Zhang, Jiayi Lei, Yao Fu, Maosong Sun, Junxian He$^\dagger$

paper | website | dataset |

NeurIPS 2023 D&B
sym

FELM: Benchmarking Factuality Evaluation of Large Language Models

Shiqi Chen, Yiran Zhao, Jinghan Zhang, I-Chun Chern, Siyang Gao, Pengfei Liu, Junxian He$^\dagger$

paper | website | dataset |

🌟 Service

Reviewer: NeurIPS, ICLR, ICML, COLM, ACL Demo, NLPCC

🎖 Awards

  • 2024.9 COLM 2024 DEI Scholarship
  • 2023.10 NeurIPS 2023 Scholar Award
  • 2023.06 Outstanding Undergraduate Thesis in SEU (top 3%)
  • 2021.12 National Scholarship

📖 Education

  • 2024.02 - now PhD student, Department of CSE, HKUST, Hong Kong SAR, China.
  • 2023.11 - 2024.01 Research Assistant, Department of CSE, HKUST, Hong Kong SAR, China.
  • 2019.09 - 2023.06 Undergraduate, Artificial Intelligence, Southeast University, Nanjing, China.