Songyang Zhang

I am now with Hunyuan, Tencent, focusing on post-training for vision-language models (VLMs), including data, reinforcement learning, and evaluation. Previously, I was a Young Scientist at Shanghai AI Laboratory, where I collaborated with Dr. Kai Chen, and before that, a postdoctoral researcher supervised by Prof. Dahua Lin. I also led work on foundation model research and open-source platforms, and initiated the OpenCompass project for foundation model evaluation and analysis.

My team also contributes to the InternLM and InternVL, working on the research and open-source of large language model/vision-language model. We also developed OpenMMLab projects MMPreTrain.

I obtained Ph.D. in Computer Science at the University of Chinese Academy of Science(UCAS), in the joint program at PLUS Lab, ShanghaiTech Univeristy in 2022. I got my B.Sc. degree in 2017 from Beihang University.

Open positions include full-time researchers/engineers and interns, feel free to contact me through the email. Research directions include: VLM post-training (data, reinforcement learning, and evaluation), Post-training and Alignment of LLMs/VLMs, Evaluation and Analysis of Foundation Model, Data-centric AI, etc.

News

Sep 24, 2025	I was honored to be inivited to serve as the area chair(AC) for ICLR 2026. Looking forward to an exciting conference!
Aug 24, 2025	1 paper on long-context capability(NeedleBench) has been accepted by Transactions on Machine Learning Research(TMLR).
Aug 20, 2025	CompassVerifier has been accepted by EMNLP 2025.
Aug 2, 2025	Our paper “Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law” has been selected as Outstanding Paper Award of ACL 2025.
Jul 20, 2025	1 paper on RL for LLM(OREAL) has been accepted by COLM 2025.
Jun 29, 2025	1 paper on diffusion model for image generation accepted to ICCV 2025.
May 21, 2025	4 papers (2 main + 2 findings) have been accepted to ACL 2025. One of the papers has been selected as an oral presentation.
Dec 28, 2024	I am honored to serve as the Area Chair (AC) for the ACL Rolling Review (ARR) in December 2024.
Dec 9, 2024	We are excited to announce that one of our papers has been accepted for presentation at AAAI 2025!
Sep 27, 2024	Three papers(2 conference papers and 1 DB track paper) are accepted by NeurIPS 2024!

Selected Publications(Full List)

ACL

Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law

Qiming Ge, Shuhao Xing, Songyang Gao, Yunhua Zhou, Yicheng Zou, and 6 more authors

In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2025

Bib

@inproceedings{ge2025capability,
  title = {Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law},
  author = {Ge, Qiming and Xing, Shuhao and Gao, Songyang and Zhou, Yunhua and Zou, Yicheng and Zhang, Songyang and Chen, Zhi and Yan, Hang and Zhang, Qi and Guo, Qipeng and Chen, Kai},
  booktitle = {Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL),},
  year = {2025}
}

ACL

Are Your LLMs Capable of Stable Reasoning?

Junnan Liu, Hongwei Liu, Linchen Xiao, Ziyi Wang, Kuikun Liu, and 4 more authors

In Findings of the Association for Computational Linguistics (ACL), 2025

Bib

@inproceedings{liu2025stable,
  title = {Are Your LLMs Capable of Stable Reasoning?},
  author = {Liu, Junnan and Liu, Hongwei and Xiao, Linchen and Wang, Ziyi and Liu, Kuikun and Gao, Songyang and Zhang, Wenwei and Zhang, Songyang and Chen, Kai},
  booktitle = {Findings of the Association for Computational Linguistics (ACL),},
  year = {2025}
}

ACL

Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement

Maosong Cao, Taolin Zhang, Mo Li, Chuyu Zhang, Yunxin Liu, and 3 more authors

In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2025

Bib

@inproceedings{cao2025condor,
  title = {Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement},
  author = {Cao, Maosong and Zhang, Taolin and Li, Mo and Zhang, Chuyu and Liu, Yunxin and Duan, Haodong and Zhang, Songyang and Chen, Kai},
  booktitle = {Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL),},
  year = {2025}
}

NeurIPS

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, and 19 more authors

In Proceeding of Advances in Neural Information Processing Systems (NeurIPS), 2024

Bib

@inproceedings{dong2024internlmxcomposer2_4khd,
  title = {InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD},
  author = {Dong, Xiaoyi and Zhang, Pan and Zang, Yuhang and Cao, Yuhang and Wang, Bin and Ouyang, Linke and Zhang, Songyang and Duan, Haodong and Zhang, Wenwei and Li, Yining and Yan, Hang and Gao, Yang and Chen, Zhe and Zhang, Xinyue and Li, Wei and Li, Jingwen and Wang, Wenhai and Chen, Kai and He, Conghui and Zhang, Xingcheng and Dai, Jifeng and Qiao, Yu and Lin, Dahua and Wang, Jiaqi},
  booktitle = {Proceeding of Advances in Neural Information Processing Systems (NeurIPS),},
  year = {2024}
}

EMNLP

ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs

Zhuo Jingming, Zhang Songyang, Fang Xinyu, Duan Haodong, Lin Dahua, and 1 more author

In Findings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Bib

@inproceedings{zhuo2024prosa,
  title = {ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs},
  author = {Jingming, Zhuo and Songyang, Zhang and Xinyu, Fang and Haodong, Duan and Dahua, Lin and Kai, Chen},
  booktitle = {Findings of the Conference on Empirical Methods in Natural Language Processing (EMNLP),},
  year = {2024},
}

EMNLP

LawBench: Benchmarking Legal Knowledge of Large Language Models

Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, and 4 more authors

In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Bib Code

@inproceedings{fei2023lawbench,
  title = {LawBench: Benchmarking Legal Knowledge of Large Language Models},
  author = {Fei, Zhiwei and Shen, Xiaoyu and Zhu, Dawei and Zhou, Fengzhe and Han, Zhuo and Zhang, Songyang and Chen, Kai and Shen, Zongwen and Ge, Jidong},
  booktitle = {Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP),},
  year = {2024},
}

ACL

MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark

Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, and 5 more authors

In Findings of the Association for Computational Linguistics (ACL), 2024

Bib

@inproceedings{liu2024mathbench,
  title = {MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark},
  author = {Liu, Hongwei and Zheng, Zilong and Qiao, Yuxuan and Duan, Haodong and Fei, Zhiwei and Zhou, Fengzhe and Zhang, Wenwei and Zhang, Songyang and Lin, Dahua and Chen, Kai},
  booktitle = {Findings of the Association for Computational Linguistics (ACL),},
  year = {2024}
}

ACL

T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step

Zehui Chen, Weihua Du, Wenwei Zhang, Kuikun Liu, Jiangning Liu, and 6 more authors

In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2024

Bib Code

@inproceedings{chen2024t,
  title = {T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step},
  author = {Chen, Zehui and Du, Weihua and Zhang, Wenwei and Liu, Kuikun and Liu, Jiangning and Zheng, Miao and Zhuo, Jingming and Zhang, Songyang and Lin, Dahua and Chen, Kai and Zhao, Feng},
  booktitle = {Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL),},
  year = {2024}
}

ArXiv

InternLM2 Technical Report

Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, and 95 more authors

2024

Bib Code

@misc{cai2024internlm2,
  title = {InternLM2 Technical Report},
  author = {Cai, Zheng and Cao, Maosong and Chen, Haojiong and Chen, Kai and Chen, Keyu and Chen, Xin and Chen, Xun and Chen, Zehui and Chen, Zhi and Chu, Pei and Dong, Xiaoyi and Duan, Haodong and Fan, Qi and Fei, Zhaoye and Gao, Yang and Ge, Jiaye and Gu, Chenya and Gu, Yuzhe and Gui, Tao and Guo, Aijia and Guo, Qipeng and He, Conghui and Hu, Yingfan and Huang, Ting and Jiang, Tao and Jiao, Penglong and Jin, Zhenjiang and Lei, Zhikai and Li, Jiaxing and Li, Jingwen and Li, Linyang and Li, Shuaibin and Li, Wei and Li, Yining and Liu, Hongwei and Liu, Jiangning and Hong, Jiawei and Liu, Kaiwen and Liu, Kuikun and Liu, Xiaoran and Lv, Chengqi and Lv, Haijun and Lv, Kai and Ma, Li and Ma, Runyuan and Ma, Zerun and Ning, Wenchang and Ouyang, Linke and Qiu, Jiantao and Qu, Yuan and Shang, Fukai and Shao, Yunfan and Song, Demin and Song, Zifan and Sui, Zhihao and Sun, Peng and Sun, Yu and Tang, Huanze and Wang, Bin and Wang, Guoteng and Wang, Jiaqi and Wang, Jiayu and Wang, Rui and Wang, Yudong and Wang, Ziyi and Wei, Xingjian and Weng, Qizhen and Wu, Fan and Xiong, Yingtong and Xu, Chao and Xu, Ruiliang and Yan, Hang and Yan, Yirong and Yang, Xiaogui and Ye, Haochen and Ying, Huaiyuan and Yu, Jia and Yu, Jing and Zang, Yuhang and Zhang, Chuyu and Zhang, Li and Zhang, Pan and Zhang, Peng and Zhang, Ruijie and Zhang, Shuo and Zhang, Songyang and Zhang, Wenjian and Zhang, Wenwei and Zhang, Xingcheng and Zhang, Xinyue and Zhao, Hui and Zhao, Qian and Zhao, Xiaomeng and Zhou, Fengzhe and Zhou, Zaida and Zhuo, Jingming and Zou, Yicheng and Qiu, Xipeng and Qiao, Yu and Lin, Dahua},
  year = {2024},
  booktitle = {arXiv Preprint,},
}

ArXiv

InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model

Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, and 18 more authors

2024

Bib Code

@article{dong2024internlmxcomposer2,
  title = {InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model},
  author = {Dong, Xiaoyi and Zhang, Pan and Zang, Yuhang and Cao, Yuhang and Wang, Bin and Ouyang, Linke and Wei, Xilin and Zhang, Songyang and Duan, Haodong and Cao, Maosong and Zhang, Wenwei and Li, Yining and Yan, Hang and Gao, Yang and Zhang, Xinyue and Li, Wei and Li, Jingwen and Chen, Kai and He, Conghui and Zhang, Xingcheng and Qiao, Yu and Lin, Dahua and Wang, Jiaqi},
  year = {2024},
  booktitle = {arXiv Preprint,},
}

CVPR

From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models

Rongjie Li, Songyang Zhang, Dahua Lin, Kai Chen, and Xuming He

In Proceeding of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

T-PAMI

SGTR+: End-to-end Scene Graph Generation with Transformer

Rongjie Li, Songyang Zhang, and Xuming He

In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

Bib

@inproceedings{li2023sgtrplus,
  title = {SGTR+: End-to-end Scene Graph Generation with Transformer},
  author = {Li, Rongjie and Zhang, Songyang and He, Xuming},
  booktitle = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI),},
  year = {2024},
}

ECCV

MMBench: Is Your Multi-modal Model an All-around Player?

Yuan Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang, and 7 more authors

In Proceeding of the European Conference on Computer Vision (ECCV), 2024

Bib HTML Code

@inproceedings{liu2023mmbench,
  title = {MMBench: Is Your Multi-modal Model an All-around Player?},
  author = {Liu, Yuan and Duan, Haodong and Zhang, Yuanhan and Li, Bo and Zhang, Songyang and Zhao, Wangbo and Yuan, Yike and Wang, Jiaqi and He, Conghui and Liu, Ziwei and Chen, Kai and Lin, Dahua},
  booktitle = {Proceeding of the European Conference on Computer Vision (ECCV),},
  year = {2024},
}

ICCV

Improving Pixel-based MIM by Reducing Wasted Modeling Capability

Yuan Liu, Songyang Zhang, Jiacheng Chen, Zhaohui Yu, Kai Chen, and 1 more author

In Proceedings of the IEEE/CVF International Conference on Computer Vision(ICCV), 2023

Bib HTML Code

@inproceedings{liu2023mff,
  title = {Improving Pixel-based MIM by Reducing Wasted Modeling Capability},
  author = {Liu, Yuan and Zhang, Songyang and Chen, Jiacheng and Yu, Zhaohui and Chen, Kai and Lin, Dahua},
  booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision(ICCV),},
  year = {2023},
}

CVPR

RIFormer: Keep Your Vision Backbone Effective But Removing Token Mixer

Jiahao Wang, Songyang Zhang, Yong Liu, Taiqiang Wu, Yujiu Yang, and 4 more authors

In Proceeding of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023

Bib HTML Code

@inproceedings{wang2022riformer,
  title = {RIFormer: Keep Your Vision Backbone Effective But Removing Token Mixer},
  author = {Wang, Jiahao and Zhang, Songyang and Liu, Yong and Wu, Taiqiang and Yang, Yujiu and Liu, Xihui and Chen, Kai and Luo, Ping and Lin, Dahua},
  booktitle = {Proceeding of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR),},
  year = {2023},
}

NeurIPS

Dynamic Grained Encoder for Vision Transformers

Lin Song*, Songyang Zhang*, Songtao Liu, Zeming Li, Xuming He, and 3 more authors

In Proceeding of Advances in Neural Information Processing Systems (NeurIPS), 2021

Bib HTML Code

@inproceedings{lin2021dynamic,
  author = {Song*, Lin and Zhang*, Songyang and Liu, Songtao and Li, Zeming and He, Xuming and Sun, Hongbin and Sun, Jian and Zheng, Nanning},
  booktitle = {Proceeding of Advances in Neural Information Processing Systems (NeurIPS),},
  year = {2021},
}

CVPR

Distribution Alignment: A Unified Framework for Long-tail Visual Recognition

Songyang Zhang, Zeming Li, Shipeng Yan, Xuming He, and Jian Sun

In Proceeding of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021

Bib HTML Code

@inproceedings{zhang2021distribution,
  title = {Distribution Alignment: A Unified Framework for Long-tail Visual Recognition},
  author = {Zhang, Songyang and Li, Zeming and Yan, Shipeng and and He, Xuming and Sun, Jian},
  booktitle = {Proceeding of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR),},
  year = {2021},
}

ICML

LatentGNN: Learning Efficient Non-local Relations for Visual Recognition

Songyang Zhang, Shipeng Yan, and Xuming He

In Proceeding of the 36th International Conference on Machine Learning (ICML),, 2019

Bib HTML Code

@inproceedings{zhang2019latent,
  title = {LatentGNN: Learning Efficient Non-local Relations for Visual Recognition},
  author = {Zhang, Songyang and Yan, Shipeng and He, Xuming},
  booktitle = {Proceeding of the 36th International Conference on Machine Learning (ICML),,},
  year = {2019},
}

Awards

ACL Outstanding Paper Award, 2025

WAIC Yunfan Award Rising Star(15 AI researchers in total), 2024

Webly-supervised Fine-grained Image Classification , Champion, ACCV 2022. [Code-P1][Code-P2][Certificate]

System Design Contest(SDC) ,Champion, DAC 2021

LVIS Challenge, 2nd Place, ICCV 2021

Workshop on Autonomous Driving(WAD) streaming detection challenge,Champion, CVPR 2021