Publications

See also: Google Scholar, DBLP

Open-source Codes

WebAgent-R1  AdaDecode  InstructRAG  CasRel  BERT-NER 

Preprints / In Submission

  1. [Preprint] TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning
    Zhepei Wei, Xiao Yang, Kai Sun, Jiaqi Wang, Rulin Shao, Sean Chen, Mohammad Kachuee, Teja Gollapudi, Tony Liao, Nicolas Scheffer, Rakesh Wanga, Anuj Kumar, Yu Meng, Wen-tau Yih, Xin Luna Dong

  2. [Preprint] Do LLM Evaluators Prefer Themselves for a Reason?
    Wei-Lin Chen, Zhepei Wei, Xinyu Zhu, Shi Feng, Yu Meng

  3. [Preprint] Beyond Outcome Reward: Decoupling Search and Answering Improves LLM Agents
    Yiding Wang, Zhepei Wei, Xinyu Zhu, Yu Meng

  4. [Preprint] Aligning Large Language Models via Fully Self-Synthetic Data
    Shangjian Yin, Zhepei Wei, Xinyu Zhu, Wei-Lin Chen, Yu Meng

Conference Papers

  1. [NeurIPS 2025] The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning [code]
    Xinyu Zhu, Mengzhou Xia, Zhepei Wei, Wei-Lin Chen, Danqi Chen, Yu Meng

  2. [EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning [code]
    Zhepei Wei, Wenlin Yao, Yao Liu, Weizhi Zhang, Qin Lu, Liang Qiu, Changlong Yu, Puyang Xu, Chao Zhang, Bing Yin, Hyokun Yun, Lihong Li

  3. [ICML 2025] AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism [code]
    Zhepei Wei, Wei-Lin Chen, Xinyu Zhu, Yu Meng. Previously presented at NeurIPS 2024 AFM Workshop (Oral: 8/157)

  4. [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales [code] [webpage]
    Zhepei Wei, Wei-Lin Chen, Yu Meng

  5. [ICLR 2024] Incentivized Truthful Communication for Federated Bandits
    Zhepei Wei, Chuanhao Li, Tianze Ren, Haifeng Xu, Hongning Wang

  6. [NeurIPS 2023] Incentivized Communication for Federated Bandits
    Zhepei Wei, Chuanhao Li, Haifeng Xu, Hongning Wang

  7. [EMNLP 2022] Learning Semantic Textual Similarity via Topic-informed Discrete Latent Variables
    Erxin Yu, Lan Du, Yuan Jin, Zhepei Wei, Yi Chang

  8. [AACL 2022] Towards Unified Representations of Knowledge Graph and Expert Rules for Machine Learning and Reasoning
    Zhepei Wei, Yue Wang, Jinnan Li, Zhining Liu, Erxin Yu, Yuan Tian, Xin Wang, Yi Chang

  9. [IJCAI 2022] AttExplainer: Explain Transformer via Attention by Reinforcement Learning [code]
    Runliang Niu, Zhepei Wei, Yan Wang, Qi Wang

  10. [ACL 2020] A Novel Cascade Binary Tagging Framework for Relational Triple Extraction [code]
    Zhepei Wei, Jianlin Su, Yue Wang, Yuan Tian, Yi Chang