Ph.D. Candidate in Computer Science · Large Language Models · Data-Centric AI · Deployable Intelligence
I study how better selection of data, demonstrations, and deployment strategies can make AI systems more reliable, efficient, and useful. My work spans in-context learning, reinforcement learning, evolutionary optimization, on-device/edge AI, and robust learning for high-dimensional data.
My research asks one central question: what should an AI system select before it learns, reasons, and acts? I frame this through Selection-Centric AI, where reliability and efficiency are determined not only by model architecture, but by principled choices over data, context, tools, and execution policy under constraints of accuracy, latency, memory, privacy, energy, and cost. This perspective treats selection as a first-class scientific variable rather than a purely engineering heuristic.
Methodologically, I integrate reinforcement learning, evolutionary optimization, and robust representation learning across three layers: data-centric selection (feature/sample quality control for high-dimensional and imbalanced data), reasoning-context optimization (demonstration/context selection for LLMs), and system-level deployment (quality-latency-resource trade-offs for edge and on-device intelligence). This connects statistical learning, foundation-model reasoning, and deployable AI systems in a unified research program. Across these layers, my goal is to develop methods that are both theoretically grounded and practically reproducible.
Representative results include RL-based demonstration selection for in-context learning (ICML 2025), multi-task evolutionary feature selection (IEEE TKDE 2024), and a comprehensive survey of on-device intelligence (ACM Computing Surveys 2025, ESI Hot Paper). My recent and ongoing work appears in ICML, TKDE, TCBB, CSUR, KBS, and related venues, with applications in biomedical data analysis, RAG and knowledge-base construction, reasoning/agent workflows, and resource-constrained deployment. I am especially interested in research that bridges algorithmic advances with measurable impact in real-world AI systems.
I welcome research collaborators and prospective students interested in future opportunities. Strong interest in LLMs, agents, data-centric AI, or edge intelligence is a plus. Contact me for details.
Integrated framework from completed work and trend-informed next directions (Agentic AI, efficient deployment, trustworthy evaluation)
Core methodological layer for selection, search, and adaptation under constraints, spanning evolutionary optimization, reinforcement learning, and efficient decision-making.
Feature/sample selection and geometry-aware robust learning for high-dimensional, noisy, and imbalanced structured data, especially in biomedical and scientific settings.
In-context example selection and prompting strategy optimization for LLM reasoning quality, robustness, and cost-efficiency, with auditable and reproducible selection mechanisms.
Model-system co-optimization for on-device AI, edge LLMs, agent execution, and research translation into deployable tooling and real-world services.
I maintain open-source tools and research code for RAG, edge AI, evolutionary learning, and reproducible machine learning.
Automated knowledge-base construction and QA-pair generation for retrieval-augmented generation systems. The project supports document ingestion, LLM-based QA generation, collection management, and Streamlit-based interactive workflows.
Impact: ; deployed at the Intelligent Supercomputing Center, Beijing Normal University at Zhuhai.
A curated resource collection on edge AI and on-device intelligence, covering papers, systems, benchmarks, and practical references for deploying AI beyond the cloud.
Impact: and growing community usage.
ICLR, ICML, NeurIPS, KDD.
IEEE Computational Intelligence Magazine, Artificial Intelligence Review, Machine Learning, npj Artificial Intelligence, IEEE TCSVT, Neurocomputing, Scientific Reports, Knowledge and Information Systems, and 20+ others.
Co-authored with undergraduate, M.S., and junior Ph.D. students on in-submission work; provided guidance on experimental design, scientific writing, and rebuttal preparation. Delivered internal and cross-institution seminars on in-context learning, on-device AI, cognitive edge computing, and RL-based selection methods.
Teaching Assistant (2018-2026) for Data Structures and Algorithms, Software Engineering and Workshop, Artificial Intelligence Ethics and Governance, Introduction to Artificial Intelligence, and Operating Systems.
Recent roles: Artificial Intelligence Ethics and Governance (Beijing Normal University at Zhuhai, Spring 2026); Software Engineering and Workshop (Beijing Normal-Hong Kong Baptist University, Spring 2025).
Machine Learning, Data-Centric AI, Generative AI / Large Language Models, Data Structures and Algorithms, Software Engineering, AI Ethics and Governance, and Computational Methods for Data Science.
Teaching orientation: connect mathematical foundations with reproducible implementation, research awareness, and application-driven problem solving.
* marks corresponding author(s).
I welcome collaboration on large language models, data-centric AI, edge intelligence, evolutionary optimization, and deployable AI systems.