Publications

More details in Google Scholar Badge

  1. Zhenyu Bi, Gaurav Srivastava, Yang Li, Swastik Roy, Meng Lu, Morteza Ziyadi, Xuan Wang.
    JudgeBoard: Benchmarking and Enhancing Small Language Models for Reasoning Evaluation. arXiv Badge
    In Proceedings of the 40th AAAI Conference on Artificial Intelligence (AAAI Oral) , 2026. (acceptance rate = 17.6%).

  2. Zhenyu Bi, Meng Lu, Yang Li, Swastik Roy, Weijie Guan, Morteza Ziyadi, Xuan Wang.
    OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning. arXiv Badge
    In Proceedings on Findings of 14th International Joint Conference on Natural Language Processing & 4th Asia-Pacific Chapter of the Association for Computational Linguistics (IJCNLP-AACL Findings), 2025

  3. Syna, Bhavana Kunisetty, Chuyi Zhang, Yang Li, Anna Simos, Agatha Scheideman, Mandy Shao, Yanfu Zhang, David Klonoff, Helge Rader, Marina Basina, Michael Snyder, Haipeng Chen, Tao Wang.
    SWEET: Large Language Model Benchmark for Scalable Diabetes Patient Education.
    In The Diabetes Technology Meeting (DTM), 2025.

  4. Han Xu, Yang Li, Yanhai Xiong, Robert Mintern, Amir Louka, and Haipeng Chen.
    AutoRuleSQL: Hybrid Text-to-SQL via Rule-Driven Fast Paths and LLM Bootstrapping. ACM Badge
    In the 34th ACM International Conference on Information and Knowledge Management (CIKM Industry Day track, short paper), 2025

  5. Yang Li, Han Meng, Zhenyu Bi, Ingolv T. Urnes, and Haipeng Chen.
    Population Aware Diffusion for Time Series Generation. arXiv Badge GitHub Badge
    In Proceedings of the 39th AAAI Conference on Artificial Intelligence (AAAI), 2025. (acceptance rate = 23.4%)