Publications
See also: Google Scholar, DBLP
Open-source Codes
Preprints / In Submission
[Preprint] The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning Xinyu Zhu, Mengzhou Xia, Zhepei Wei, Wei-Lin Chen, Danqi Chen, Yu Meng
[Preprint] Do LLM Evaluators Prefer Themselves for a Reason? Wei-Lin Chen, Zhepei Wei, Xinyu Zhu, Shi Feng, Yu Meng
[In Submission] Align Large Language Model with Human Preference via Extremely Self-Synthetic Data Shangjian Yin, Zhepei Wei, Xinyu Zhu, Wei-Lin Chen, Yu Meng
Conference Papers
[EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning Zhepei Wei, Wenlin Yao, Yao Liu, Weizhi Zhang, Qin Lu, Liang Qiu, Changlong Yu, Puyang Xu, Chao Zhang, Bing Yin, Hyokun Yun, Lihong Li
[ICML 2025] AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism [code] Zhepei Wei, Wei-Lin Chen, Xinyu Zhu, Yu Meng. Previously presented at NeurIPS 2024 AFM Workshop (Oral: 8/157)
[ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales [code] [webpage] Zhepei Wei, Wei-Lin Chen, Yu Meng
[ICLR 2024] Incentivized Truthful Communication for Federated Bandits Zhepei Wei, Chuanhao Li, Tianze Ren, Haifeng Xu, Hongning Wang
[NeurIPS 2023] Incentivized Communication for Federated Bandits Zhepei Wei, Chuanhao Li, Haifeng Xu, Hongning Wang
[EMNLP 2022] Learning Semantic Textual Similarity via Topic-informed Discrete Latent Variables Erxin Yu, Lan Du, Yuan Jin, Zhepei Wei, Yi Chang
[AACL 2022] Towards Unified Representations of Knowledge Graph and Expert Rules for Machine Learning and Reasoning Zhepei Wei, Yue Wang, Jinnan Li, Zhining Liu, Erxin Yu, Yuan Tian, Xin Wang, Yi Chang
[IJCAI 2022] AttExplainer: Explain Transformer via Attention by Reinforcement Learning [code] Runliang Niu, Zhepei Wei, Yan Wang, Qi Wang
[ACL 2020] A Novel Cascade Binary Tagging Framework for Relational Triple Extraction [code] Zhepei Wei, Jianlin Su, Yue Wang, Yuan Tian, Yi Chang
|