Songyang Zhang

Young Scientist at Shanghai AI Laboratory, Shanghai, China.

zhangsongyang_2024.jpg

I am a Young Scientist of Shanghai AI Laboratory, collaborate with Dr. Kai Chen. Previously, I was a postdoctoral researcher, supervised by Prof. Dahua Lin. Currently, I lead a team working on multi-modality model and large language model, includes the research and open-source platform. My team develops and maintains the OpenCompass, an evaluation platform for foundation model, and OpenMMLab projects MMPreTrain.

OpenCompass MMPreTrain

I obtained Ph.D. in Computer Science at the University of Chinese Academy of Science(UCAS), in the joint program at PLUS Lab, ShanghaiTech University in 2022. I got my B.Sc. degree in 2017 from Beihang University.

Open positions include full-time researchers/engineers and interns, feel free to contact me through the email. Research directions include: evaluation and application of large language model(tool use, reasoning, safety, robustness etc), multi-modality learning( vision/audio/-language learning), etc.

News

Jul 20, 2024 I have finished the postdoctral research project and join Shanghai AI Lab as the Young Scientist.
Jul 1, 2024 MMBench is accepted by ECCV 2024! Congratulations to Yuan Liu.
Jun 16, 2024 I was awarded the WAIC 2024 Rising Star (15 AI researchers in total).
May 16, 2024 4 papers (2 main + 2 findings) accepted to ACL 2024.
Mar 26, 2024 Technical report of InternLM2 has been released, welcome to InternLM2 for more details.
Mar 14, 2024 Three papers accepted by NAACL 2024, Fake Alignment, BotChat and AdaEval.
Feb 29, 2024 One paper on “SGG with Vision-langauge Model” is accepted by CVPR 2024, congratulations to Rongjie.
Nov 10, 2023 One paper on “T-Eval” is on Arxiv, welcome to T-Eval for more details.
Nov 10, 2023 One paper on “Scene Graph Generation” is accepted by T-PAMI, Congratulations to Rongjie, welcome to SGTR+ for more details.
Oct 30, 2023 One paper on “Evaluating LLMs’ Multi-round Chatting Capability” is on arxiv, welcome to BotChat for more details.

Selected Publications(Full List)

  1. ACL
    Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations
    Jiaxing Sun, Weiquan Huang, Jiang Wu, Chenya Gu, Wei Li, and 3 more authors
    In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2024
  2. ACL
    MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark
    Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, and 5 more authors
    In Findings of the Association for Computational Linguistics (ACL), 2024
  3. ACL
    T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step
    Zehui Chen, Weihua Du, Wenwei Zhang, Kuikun Liu, Jiangning Liu, and 6 more authors
    In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2024
  4. ArXiv
    InternLM2 Technical Report
    Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, and 95 more authors
    2024
  5. ArXiv
    InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model
    Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, and 18 more authors
    2024
  6. CVPR
    From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models
    Rongjie Li, Songyang Zhang, Dahua Lin, Kai Chen, and Xuming He
    In Proceeding of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
  7. NAACL
    Fake Alignment: Are LLMs Really Aligned Well?
    Yixu Wang, Yan Teng, Kexin Huang, Chengqi Lyu, Songyang Zhang, and 3 more authors
    In Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024
  8. T-PAMI
    SGTR+: End-to-end Scene Graph Generation with Transformer
    Rongjie Li, Songyang Zhang, and Xuming He
    In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
  9. NAACL
    BotChat: Evaluating LLMs’ Capabilities of Having Multi-Turn Dialogues
    Haodong Duan, Jueqi Wei, Chonghua Wang, Hongwei Liu, Yixiao Fang, and 3 more authors
    In Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024
  10. ArXiv
    MMBench: Is Your Multi-modal Model an All-around Player?
    Yuan Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang, and 7 more authors
    In arXiv Preprint, 2023
  11. ICCV
    Improving Pixel-based MIM by Reducing Wasted Modeling Capability
    Yuan Liu, Songyang Zhang, Jiacheng Chen, Zhaohui Yu, Kai Chen, and 1 more author
    In Proceedings of the IEEE/CVF International Conference on Computer Vision(ICCV), 2023
  12. CVPR
    RIFormer: Keep Your Vision Backbone Effective But Removing Token Mixer
    Jiahao Wang, Songyang Zhang, Yong Liu, Taiqiang Wu, Yujiu Yang, and 4 more authors
    In Proceeding of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
  13. CVPR
    SGTR: End-to-end Scene Graph Generation with Transformer
    Rongjie Li, Songyang Zhang, and Xuming He
    In Proceeding of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
  14. NeurIPS
    Dynamic Grained Encoder for Vision Transformers
    Lin Song*, Songyang Zhang*, Songtao Liu, Zeming Li, Xuming He, and 3 more authors
    In Proceeding of Advances in Neural Information Processing Systems (NeurIPS), 2021
  15. CVPR
    Distribution Alignment: A Unified Framework for Long-tail Visual Recognition
    Songyang Zhang, Zeming Li, Shipeng Yan, Xuming He, and Jian Sun
    In Proceeding of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
  16. ICML
    LatentGNN: Learning Efficient Non-local Relations for Visual Recognition
    Songyang Zhang, Shipeng Yan, and Xuming He
    In Proceeding of the 36th International Conference on Machine Learning (ICML),, 2019