Xubin Wang

Ph.D. Candidate in Computer Science · Large Language Models · Data-Centric AI · Deployable Intelligence

I study how better selection of data, demonstrations, and deployment strategies can make AI systems more reliable, efficient, and useful. My work spans in-context learning, reinforcement learning, evolutionary optimization, on-device/edge AI, and robust learning for high-dimensional data.

About me


Xubin Wang
Xubin Wang
Selection-centric AI researcher working across data-centric learning, large language models, and deployable edge intelligence.
B.Sc. (TJNU) · M.Sc. (JLU) · Ph.D. (HKBU, defense passed)
Degree Status
Ph.D. defense passed; degree to be conferred by Hong Kong Baptist University in 2026. Jointly trained with BNU-BNBU Institute of Artificial Intelligence and Future Networks. Advisor: Prof. Weijia Jia (Chair Professor, IEEE Fellow).
Visiting Research
The Hong Kong Polytechnic University, Department of Computing, hosted by Prof. Qing Li (Chair Professor, IEEE Fellow), with work on cognitive edge computing and geometric learning for imbalanced structured data.
Research Appointment
Research Assistant, Institute of Artificial Intelligence and Future Networks, Beijing Normal University at Zhuhai, focusing on LLM in-context learning, collaborative prompting, on-device AI, and research-to-deployment tooling.
Academic Profile

Selection-Centric AI for Reliable, Efficient, and Deployable Intelligence

My research asks one central question: what should an AI system select before it learns, reasons, and acts? I frame this through Selection-Centric AI, where reliability and efficiency are determined not only by model architecture, but by principled choices over data, context, tools, and execution policy under constraints of accuracy, latency, memory, privacy, energy, and cost. This perspective treats selection as a first-class scientific variable rather than a purely engineering heuristic.

Methodologically, I integrate reinforcement learning, evolutionary optimization, and robust representation learning across three layers: data-centric selection (feature/sample quality control for high-dimensional and imbalanced data), reasoning-context optimization (demonstration/context selection for LLMs), and system-level deployment (quality-latency-resource trade-offs for edge and on-device intelligence). This connects statistical learning, foundation-model reasoning, and deployable AI systems in a unified research program. Across these layers, my goal is to develop methods that are both theoretically grounded and practically reproducible.

Representative results include RL-based demonstration selection for in-context learning (ICML 2025), multi-task evolutionary feature selection (IEEE TKDE 2024), and a comprehensive survey of on-device intelligence (ACM Computing Surveys 2025, ESI Hot Paper). My recent and ongoing work appears in ICML, TKDE, TCBB, CSUR, KBS, and related venues, with applications in biomedical data analysis, RAG and knowledge-base construction, reasoning/agent workflows, and resource-constrained deployment. I am especially interested in research that bridges algorithmic advances with measurable impact in real-world AI systems.

Recent Highlights


ICML 2025 (CCF A)
First-author: Demonstration Selection for In-Context Learning via Reinforcement Learning; featured by MIT Technology Review China.
ACM Computing Surveys 2025 (IF 28, ESI Hot Paper)
"Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models" (200+ citations)
Citation Impact
580+ total citations (Google Scholar); one first-author paper >200 citations, three papers >100 each
Open-Source & Deployment
RAG-QA-Generator (270+ GitHub stars) deployed at Beijing Normal University Intelligent Supercomputing Center
International Collaboration
Visiting Research Student, The Hong Kong Polytechnic University (Department of Computing, 2025)

I welcome research collaborators and prospective students interested in future opportunities. Strong interest in LLMs, agents, data-centric AI, or edge intelligence is a plus. Contact me for details.

Research Topics

Integrated framework from completed work and trend-informed next directions (Agentic AI, efficient deployment, trustworthy evaluation)


Integrated research framework: completed workstreams, scholar-derived topic clusters, applications, and future AI directions
Google Scholar Label

Learning & Optimization

Core methodological layer for selection, search, and adaptation under constraints, spanning evolutionary optimization, reinforcement learning, and efficient decision-making.

Google Scholar Label

Data-Centric AI

Feature/sample selection and geometry-aware robust learning for high-dimensional, noisy, and imbalanced structured data, especially in biomedical and scientific settings.

Google Scholar Label

Large Language Models

In-context example selection and prompting strategy optimization for LLM reasoning quality, robustness, and cost-efficiency, with auditable and reproducible selection mechanisms.

Scholar + Current Program

Deployable Intelligence

Model-system co-optimization for on-device AI, edge LLMs, agent execution, and research translation into deployable tooling and real-world services.

Projects & Software


I maintain open-source tools and research code for RAG, edge AI, evolutionary learning, and reproducible machine learning.

RAG / LLM Systems

RAG-QA-Generator

Automated knowledge-base construction and QA-pair generation for retrieval-augmented generation systems. The project supports document ingestion, LLM-based QA generation, collection management, and Streamlit-based interactive workflows.

Impact: GitHub stars; deployed at the Intelligent Supercomputing Center, Beijing Normal University at Zhuhai.

Edge AI

Awesome-EdgeAI

A curated resource collection on edge AI and on-device intelligence, covering papers, systems, benchmarks, and practical references for deploying AI beyond the cloud.

Impact: GitHub stars and growing community usage.

Evolutionary Optimization

SaWDE / MEL / EODE / HSNOE

Open-source repositories for evolutionary feature selection and ensemble optimization on high-dimensional biomedical data, supporting reproducibility for KBS, TKDE, TCBB, and ESWA publications.

Academic Service


Conferences

Reviewer

ICLR, ICML, NeurIPS, KDD.

Journals

Reviewer

IEEE Computational Intelligence Magazine, Artificial Intelligence Review, Machine Learning, npj Artificial Intelligence, IEEE TCSVT, Neurocomputing, Scientific Reports, Knowledge and Information Systems, and 20+ others.

Mentoring / Talks

Research Mentoring and Seminars

Co-authored with undergraduate, M.S., and junior Ph.D. students on in-submission work; provided guidance on experimental design, scientific writing, and rebuttal preparation. Delivered internal and cross-institution seminars on in-context learning, on-device AI, cognitive edge computing, and RL-based selection methods.

Teaching


Experience

Teaching Assistantships

Teaching Assistant (2018-2026) for Data Structures and Algorithms, Software Engineering and Workshop, Artificial Intelligence Ethics and Governance, Introduction to Artificial Intelligence, and Operating Systems.

Recent roles: Artificial Intelligence Ethics and Governance (Beijing Normal University at Zhuhai, Spring 2026); Software Engineering and Workshop (Beijing Normal-Hong Kong Baptist University, Spring 2025).

Courses

Prepared to Teach

Machine Learning, Data-Centric AI, Generative AI / Large Language Models, Data Structures and Algorithms, Software Engineering, AI Ethics and Governance, and Computational Methods for Data Science.

Teaching orientation: connect mathematical foundations with reproducible implementation, research awareness, and application-driven problem solving.

Selected Publications


* marks corresponding author(s).

  • 1
    Demonstration Selection for In-Context Learning via Reinforcement Learning
    Xubin Wang, Jianfei Wu, Yichen Yuan, Deyu Cai, Mingzhe Li, Weijia Jia
    Forty-Second International Conference on Machine Learning (ICML), 2025 CCF A
  • 2
    Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models
    Xubin Wang, Zhiqing Tang, Jianxiong Guo, Tianhui Meng, Chenhao Wang, Tian Wang, Weijia Jia
    ACM Computing Surveys, 57(9):1-39, 2025 SCI I IF 2025: 28 ESI Hot Paper
  • 3
    MEL: Efficient Multi-Task Evolutionary Learning for High-Dimensional Feature Selection
    Xubin Wang, Haojiong Shangguan, Fengyi Huang, Shangrui Wu, Weijia Jia
    IEEE Transactions on Knowledge and Data Engineering, 36(08):4020-4033, 2024 CCF A IF 2024: 10.4
  • 4
    Exhaustive Exploitation of Nature-inspired Computation for Cancer Screening in an Ensemble Manner
    Xubin Wang, Yunhe Wang, Zhiqiang Ma, Ka-Chun Wong, Xiangtao Li
    IEEE/ACM Transactions on Computational Biology and Bioinformatics, 21(5):1366-1379, 2024 CCF B IF 2024: 3.4
  • 5
    Evolving Pathway Activation from Cancer Gene Expression Data using Nature-inspired Ensemble Optimization
    Xubin Wang, Yunhe Wang, Zhiqiang Ma, Ka-Chun Wong, Xiangtao Li
    Expert Systems with Applications, 248:123469, 2024 SCI I IF 2024: 7.5
  • 6
    A self-adaptive weighted differential evolution approach for large-scale feature selection
    Xubin Wang, Yunhe Wang, Ka-Chun Wong, Xiangtao Li
    Knowledge-Based Systems, 235:107633, 2022 SCI I IF 2022: 8.139 100+ Citations
  • 7
    A Niching Archive-Assisted Evolutionary Algorithm for Multimodal Feature Selection in High-Dimensional Data Classification
    Yunhe Wang, Zhengyu Du, Zeming Zhou, Xubin Wang*, Shengxiang Yang
    Knowledge-Based Systems, 115870, 2026 SCI I IF 2026: 7.6

Contact


Email wangxubin at ieee.org

I welcome collaboration on large language models, data-centric AI, edge intelligence, evolutionary optimization, and deployable AI systems.