Publications

More details in

2026

Zhenyu Bi, Gaurav Srivastava, Yang Li, Swastik Roy, Meng Lu, Morteza Ziyadi, Xuan Wang.
JudgeBoard: Benchmarking and Enhancing Small Language Models for Reasoning Evaluation.
In Proceedings of the 40th AAAI Conference on Artificial Intelligence (AAAI Oral) , 2026. (acceptance rate = 17.6%).
Yang Li, Han Meng, Chenan Wang, Zhenyu Bi, Xuan Wang, Haipeng Chen.
DIP: Dynamic In-Context Planner For Diffusion Language Models.
Under review.

Zhenyu Bi, Meng Lu, Yang Li, Swastik Roy, Weijie Guan, Morteza Ziyadi, Xuan Wang.
OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning.
In Proceedings on Findings of 14th International Joint Conference on Natural Language Processing & 4th Asia-Pacific Chapter of the Association for Computational Linguistics (IJCNLP-AACL Findings), 2025
Syna, Bhavana Kunisetty, Chuyi Zhang, Yang Li, Anna Simos, Agatha Scheideman, Mandy Shao, Yanfu Zhang, David Klonoff, Helge Rader, Marina Basina, Michael Snyder, Haipeng Chen, Tao Wang.
SWEET: Large Language Model Benchmark for Scalable Diabetes Patient Education.
In The Diabetes Technology Meeting (DTM), 2025.
Han Xu, Yang Li, Yanhai Xiong, Robert Mintern, Amir Louka, and Haipeng Chen.
AutoRuleSQL: Hybrid Text-to-SQL via Rule-Driven Fast Paths and LLM Bootstrapping.
In the 34th ACM International Conference on Information and Knowledge Management (CIKM Industry Day track, short paper), 2025
Yang Li, Han Meng, Zhenyu Bi, Ingolv T. Urnes, and Haipeng Chen.
Population Aware Diffusion for Time Series Generation.
In Proceedings of the 39th AAAI Conference on Artificial Intelligence (AAAI), 2025. (acceptance rate = 23.4%)