figure.jpg

Xiaobao Wu

Research Scientist
College of Computing and Data Science
Nanyang Technological University

Hi, I received my Ph.D. degree from CCDS, Nanyang Technological University in 2024., working with Prof. Anh Tuan Luu. I received my Master’s degree from Tsinghua University (2018 - 2021) and my Bachelor’s degree from Southeast University (2014 - 2018).

My research interests lie mostly in the area of machine learning and natural language processing. I’m interested in LLMs recently, specially the following topics:

  • Reasoning: How/why can LLMs consistently perform logical and faithful reasoning?
  • Efficiency: How can LLMs efficiently produce reliable information?
  • Evaluation: How to fairly and extensively evaluate LLMs?
  • Deployment: How to build robust LLM systems in practice?
📢 Annoncement

I'm looking for highly self-motivated Ph.D./master/undergraduate students to collaborate on various interesting topics related to LLMs. If you have interest, please feel free to reach out to me via email.




News

  May, 2025   6 papers accepted to ACL 2025.
We release a survey on learning from rewards, including reinforcement learning (in RLHF, DPO, and GRPO), reward-guided decoding, and post-hoc correction.
One paper accepted to ICML 2025.
  Feb, 2025   Invited to serve as Area Chair for ACL 2025.
One paper accepted to NAACL 2025.
  Dec, 2024   Succesfully defend my PhD thesis!
Two papers accepted to AAAI 2025!
One paper accepted to COLING 2025!
One paper accepted to ACM/SAC 2025!
One paper accepted to TMLR!
  Sep, 2024   One paper accepted to NeurIPS 2024 and Three papers accepted to EMNLP 2024 main conference!
  Jul, 2024   One paper accepted to ECCV 2024!
  Jun, 2024   Two papers accepted to ACL 2024 (one findings and one demo)!
  Mar, 2024   One paper accepted to NAACL 2024!
  Jan, 2024   Our Neural Topic Modeling Survey Paper got accepted to Artificial Intelligence Review!
  Dec, 2023   Dec 2023: Two papers accepted to AAAI 2024!


Selected Publications

  1. arxiv
    Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models
    preprint, 2025
  2. ACL
    AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
    Xiaobao Wu, Liangming Pan, Yuxi Xie, Ruiwen Zhou, Shuai Zhao, Yubo Ma, Mingzhe Du, Rui Mao, Anh Tuan Luu, and William Yang Wang
    In Annual Meeting of the Association for Computational Linguistics (ACL), 2025
  3. FASTopic: Pretrained Transformer is a Fast, Adaptive, Stable, and Transferable Topic Model
    Xiaobao Wu, Thong Nguyen, Delvin Ce Zhang, William Yang Wang, and Anh Tuan Luu
    In Neural Information Processing Systems (NeurIPS), 2024
  4. AKEW: Assessing Knowledge Editing in the Wild
    Xiaobao Wu, Liangming Pan, William Yang Wang, and Anh Tuan Luu
    In Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
  5. Are LLMs Good Zero-shot Fallacy Classifiers?
    Fengjun Pan#Xiaobao Wu#, Zongrui Li, and Anh Tuan Luu
    In Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
  6. AIR
    A Survey on Neural Topic Models: Methods, Applications, and Challenges
    Xiaobao Wu, Thong Nguyen, and Anh Tuan Luu
    Artificial Intelligence Review (AIR), 2024
  7. ACL Demo
    Towards the TopMost: A Topic Modeling System Toolkit
    Xiaobao Wu, Fengjun Pan, and Anh Tuan Luu
    In The Annual Meeting of Association for Computational Linguistics: System Demonstration Track, 2024
  8. Effective Neural Topic Modeling with Embedding Clustering Regularization
    Xiaobao Wu, Xinshuai Dong, Thong Nguyen, and Anh Tuan Luu
    In International Conference on Machine Learning (ICML), 2023
  9. ACL
    Fact-Checking Complex Claims with Program-Guided Reasoning
    Liangming Pan,  Xiaobao Wu, Xinyuan Lu, Anh Tuan Luu, William Yang Wang, Min-Yen Kan, and Preslav Nakov
    In Annual Meeting of the Association for Computational Linguistics (ACL), 2023
  10. Mitigating Data Sparsity for Short Text Topic Modeling by Topic-Semantic Contrastive Learning
    Xiaobao Wu, Anh Tuan Luu, and Xinshuai Dong
    In Conference on Empirical Methods in Natural Language Processing (EMNLP), Dec 2022
  11. Short Text Topic Modeling with Topic Distribution Quantization and Negative Sampling Decoder
    Xiaobao Wu, Chunping Li, Yan Zhu, and Yishu Miao
    In Conference on Empirical Methods in Natural Language Processing (EMNLP), Nov 2020