Publications

Please see Google Scholar for more recent works and arXiv papers.

* : Equal contribution †: Corresponding author.

2025

  1. arXiv
    mmcot.png
    Look Shallow, Think Deep: What Multimodal Chain-of-Thought Reasoning Can and Cannot Do
    Zhuoran Jin*, Kejian Zhu*, Hongbang Yuan, Yupu Hao, Pengfei Cao, Yubo Chen, Kang Liu, and Jun Zhao
    arXiv preprint (arXiv), 2025
  2. arXiv
    omni.png
    Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences
    Zhuoran Jin*, Hongbang Yuan*, Kejian Zhu*, Pengfei Cao, Yubo Chen, Kang Liu, and Jun Zhao
    arXiv preprint (arXiv), 2025
  3. arXiv
    mmrv.png
    MMR-V: What’s Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos
    Kejian Zhu, Zhuoran Jin, Hongbang Yuan, Jiachun Li, Shangqing Tu, Pengfei Cao, Yubo Chen, Kang Liu, and Jun Zhao
    arXiv preprint (arXiv), 2025
  4. arXiv
    rule.png
    RULE: Reinforcement UnLEarning Achieves Forget-retain Pareto Optimality
    Chenlong Zhang, Zhuoran Jin, Hongbang Yuan, Jiaheng Wei, Tong Zhou, Kang Liu, Jun Zhao, and Yubo Chen
    arXiv preprint (arXiv), 2025
  5. ACL
    Agent-RewardBench: Towards a Unified Benchmark for Reward Modeling across Perception, Planning, and Safety in Real-World Multimodal Agents
    Tianyi Men, Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, and Jun Zhao
    In Annual Meeting of the Association for Computational Linguistics (ACL) , 2025
  6. ACL
    A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns
    Tianyi Men, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, and Jun Zhao
    In Annual Meeting of the Association for Computational Linguistics (ACL) , 2025
  7. ACL
    Evaluating Personalized Tool-Augmented LLMs from the Perspectives of Personalization and Proactivity
    Yupu Hao, Pengfei Cao, Zhuoran Jin, Huanxuan Liao, Yubo Chen, Kang Liu, and Jun Zhao
    In Annual Meeting of the Association for Computational Linguistics (ACL) , 2025
  8. ACL
    Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis
    Kejian Zhu, Shangqing Tu, Zhuoran Jin, Lei Hou, Juanzi Li, and Jun Zhao
    In Annual Meeting of the Association for Computational Linguistics (ACL) , 2025
  9. ACL Findings
    ragreward.png
    RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
    Zhuoran Jin, Hongbang Yuan, Tianyi Men, Pengfei Cao, Yubo Chen, Kang Liu, and Jun Zhao
    In Annual Meeting of the Association for Computational Linguistics (ACL Findings) , 2025
  10. ICLR
    MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models
    Jiachun Li, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, and Jun Zhao
    In International Conference on Learning Representations (ICLR) , 2025
  11. NAACL
    DTELS: Towards Dynamic Granularity of Timeline Summarization
    Chenlong Zhang, Tong Zhou, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, and Jun Zhao
    In Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL) , 2025
  12. NAACL Findings
    Beyond Under-Alignment: Atomic Preference Enhanced Factuality Tuning for Large Language Models
    Hongbang Yuan, Yubo Chen, Pengfei Cao, Zhuoran Jin, and Kang Liu
    In Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL Findings) , 2025
  13. AAAI
    lau.png
    Towards Robust Knowledge Unlearning: An Adversarial Framework for Assessing and Improving Unlearning Robustness in Large Language Models
    Hongbang Yuan*Zhuoran Jin*, Pengfei Cao, Yubo Chen, Kang Liu, and Jun Zhao
    In Annual AAAI Conference on Artificial Intelligence (AAAI) , 2025
  14. AAAI
    CITI: Enhancing Tool Utilizing Ability in Large Language Models Without Sacrificing General Performance
    Yupu Hao, Pengfei Cao, Zhuoran Jin, Huanxuan Liao, Yubo Chen, Kang Liu, and Jun Zhao
    In Annual AAAI Conference on Artificial Intelligence (AAAI) , 2025

2024

  1. NeurIPS
    rwku.png
    RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models
    Zhuoran Jin, Pengfei Cao, Chenhao Wang, Zhitao He, Hongbang Yuan, Jiachun Li, Yubo Chen, Kang Liu, and Jun Zhao
    In Annual Conference on Neural Information Processing Systems (NeurIPS) , 2024
  2. EMNLP
    Whispers that Shake Foundations: Analyzing and Mitigating False Premise Hallucinations in Large Language Models
    Hongbang Yuan, Pengfei Cao, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, and Jun Zhao
    In Conference on Empirical Methods in Natural Language Processing (EMNLP) , 2024
  3. EMNLP
    Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models
    Tianyi Men, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, and Jun Zhao
    In Conference on Empirical Methods in Natural Language Processing (EMNLP) , 2024
  4. EMNLP Findings
    LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning
    Jiachun Li, Pengfei Cao, Chenhao Wang, Zhuoran Jin, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, and Jun Zhao
    In Conference on Empirical Methods in Natural Language Processing (EMNLP Findings) , 2024
  5. EMNLP Findings
    AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation
    Zhitao He, Pengfei Cao, Chenhao Wang, Zhuoran Jin, Yubo Chen, Jiexin Xu, Huaijun Li, Kang Liu, and Jun Zhao
    In Conference on Empirical Methods in Natural Language Processing (EMNLP Findings) , 2024
  6. ACL
    Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning
    Jiachun Li, Pengfei Cao, Chenhao Wang, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, and Jun Zhao
    In Annual Meeting of the Association for Computational Linguistics (ACL) , 2024
  7. ACL
    MULFE: A Multi-Level Benchmark for Free Text Model Editing
    Chenhao Wang, Pengfei Cao, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, and Jun Zhao
    In Annual Meeting of the Association for Computational Linguistics (ACL) , 2024
  8. ACL Findings
    ph3.png
    Cutting Off the Head Ends the Conflict: A Mechanism for Interpreting and Mitigating Knowledge Conflicts in Language Models
    Zhuoran Jin, Pengfei Cao, Hongbang Yuan, Yubo Chen, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Kang Liu, and Jun Zhao
    In Annual Meeting of the Association for Computational Linguistics (ACL Findings) , 2024
  9. COLING
    tug.png
    Tug-of-War between Knowledge: Exploring and Resolving Knowledge Conflicts in Retrieval-Augmented Language Models
    Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Qiuxia Li, and Jun Zhao
    In International Conference on Computational Linguistics (COLING) , 2024
  10. COLING
    Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning
    Zhitao He, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Zhiqiang Zhang, Mengshu Sun, and Jun Zhao
    In International Conference on Computational Linguistics (COLING) , 2024

2023

  1. EMNLP Findings
    instructor.png
    InstructoR: Instructing Unsupervised Conversational Dense Retrieval with Large Language Models
    Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, and Jun Zhao
    In Conference on Empirical Methods in Natural Language Processing (EMNLP Findings) , 2023
  2. EMNLP Findings
    Alignment Precedes Fusion: Open-Vocabulary Named Entity Recognition as Context-Type Semantic Matching
    Zhuoran Jin, Pengfei Cao, Zhitao He, Yubo Chen, Kang Liu, and Jun Zhao
    In Conference on Empirical Methods in Natural Language Processing (EMNLP Findings) , 2023
  3. AAAI
    Zero-Shot Cross-Lingual Event Argument Extraction with Language-Oriented Prefix-Tuning
    Pengfei Cao*Zhuoran Jin*, Yubo Chen, Kang Liu, and Jun Zhao
    In Annual AAAI Conference on Artificial Intelligence (AAAI) , 2023

2022

  1. EMNLP
    A Good Neighbor, A Found Treasure: Mining Treasured Neighbors for Knowledge Graph Entity Typing
    Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, and Jun Zhao
    In Conference on Empirical Methods in Natural Language Processing (EMNLP) , 2022
  2. EMNLP Demo
    cogktr.png
    CogKTR: A Knowledge-Enhanced Text Representation Toolkit for Natural Language Understanding
    Zhuoran Jin*, Tianyi Men*, Hongbang Yuan*, Yuyang Zhou, Pengfei Cao, Yubo Chen, Zhipeng Xue, Kang Liu, and Jun Zhao
    In Conference on Empirical Methods in Natural Language Processing (EMNLP Demo) , 2022
  3. ACL Demo
    cogkge.png
    CogKGE: A Knowledge Graph Embedding Toolkit and Benchmark for Representing Multi-source and Heterogeneous Knowledge
    Zhuoran Jin*, Tianyi Men*, Hongbang Yuan*, Zhitao He, Dianbo Sui, Chenhao Wang, Zhipeng Xue, Yubo Chen, and Jun Zhao
    In Annual Meeting of the Association for Computational Linguistics (ACL Demo) , 2022

2021

  1. ACL Demo
    cogie.png
    CogIE: An Information Extraction Toolkit for Bridging Texts and CogNet
    Zhuoran Jin, Yubo Chen, Dianbo Sui, Chenhao Wang, Zhipeng Xue, and Jun Zhao
    In Annual Meeting of the Association for Computational Linguistics (ACL Demo) , 2021