Jiaqi Ma | 马家祺

Assistant Professor
School of Information Sciences
Siebel School of Computing and Data Science (Affiliation)
University of Illinois Urbana-Champaign

Contact:
Outlook: jiaqima AT illinois DOT edu
Gmail: jiaqima.mle AT gmail DOT com
[Google Scholar | GitHub | Bluesky | Twitter]

About Me

I am an Assistant Professor at the University of Illinois Urbana-Champaign (UIUC). Prior to UIUC, I was a Postdoctoral Researcher at Harvard University. I received my Ph.D. from University of Michigan and a B.Eng. from Tsinghua University.

I’m interested in the broad area of machine learning and artificial intelligence (AI). Recently, my research focuses on the data problems in AI, including data attribution, data curation, and data compensation.

For students who want to work with me, please see here for more details.

News

Giving a talk about our work on data attribution at the AI Seminar at UMich in May 2025!
Giving a talk about our work on data attribution at YouTube in Apr 2025!
Giving a talk at the New Faculty Highlight session at AAAI 2025!
I’m serving as an Area Chair for ICML 2025!
Giving a talk about our work on data attribution at RIKEN AIP in Tokyo in Jan 2025!
Giving a talk about our work on data attribution at the VASC Seminar at CMU in Dec 2024!
Giving a talk about our work on data attribution at the IDEAL Institute in Chicago in Nov 2024!
Two papers accepted by NeurIPS 2024 (with one spotlight at the D&B track)!
I’m serving as an Area Chair for AISTATS 2025!
Giving a talk about our work on data attribution at the Siebel School Colloquium at UIUC in Sep 2024!
We are organizing the 2nd Workshop on Regulatable Machine Learning in conjunction with NeurIPS 2024!
Our work on Computational Copyright received a best paper award from the DPFM Workshop at ICLR 2024!
One paper accepted by AISTATS 2024!
I’m serving as an Area Chair for ICML 2024!

Selected Papers

Please see the full list on my Google Scholar page.

Preprints

A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning.
Yuzheng Hu*, Fan Wu*, Haotian Ye, David Forsyth, James Zou, Nan Jiang, Jiaqi Ma†, Han Zhao†.
[ArXiv]
GraSS: Scalable Influence Function with Sparse Gradient Compression.
Pingbang Hu, Joseph Melkonian, Weijing Tang, Han Zhao, Jiaqi Ma.
[ArXiv]
Measuring Fine-Grained Relatedness in Multitask Learning via Data Attribution.
Yiwen Tu*, Ziqi Liu*, Jiaqi Ma, Weijing Tang.
[ArXiv]
Efficient Ensembles Improve Training Data Attribution.
Junwei Deng*, Ting Wei Li*, Shichang Zhang, Jiaqi Ma.
[ArXiv]
Towards Reliable Empirical Machine Unlearning Evaluation: A Game-Theoretic View.
Yiwen Tu*, Pingbang Hu*, Jiaqi Ma.
[ArXiv]
Computational Copyright: Towards A Royalty Model for AI Music Generation Platforms.
Junwei Deng, Jiaqi Ma.
[ArXiv]

Conference Publications

DCA-Bench: A Benchmark for Dataset Curation Agents.
Benhao Huang, Yingzhuo Yu, Jin Huang, Xingjian Zhang, Jiaqi Ma.
KDD 2025 (Datasets and Benchmarks Track, Oral).
[ArXiv]
A Versatile Influence Function for Data Attribution with Non-Decomposable Loss.
Junwei Deng, Weijing Tang, Jiaqi Ma.
ICML 2025.
[ArXiv]
Adversarial Attacks on Data Attribution.
Xinhe Wang, Pingbang Hu, Junwei Deng, Jiaqi Ma.
ICLR 2025.
[ArXiv]
dattri: A Library for Efficient Data Attribution.
Junwei Deng*, Ting Wei Li*, Shiyuan Zhang, Shixuan Liu, Yijun Pan, Hao Huang, Xinhe Wang, Pingbang Hu, Xingjian Zhang, Jiaqi Ma.
NeurIPS 2024 (Datasets and Benchmarks Track, Spotlight).
[ArXiv][GitHub]
Most Influential Subset Selection: Challenges, Promises, and Beyond.
Yuzheng Hu, Pingbang Hu, Han Zhao, Jiaqi Ma.
NeurIPS 2024.
[ArXiv]
A Metadata-Driven Approach to Understand Graph Neural Networks.
Ting Wei Li, Qiaozhu Mei, Jiaqi Ma.
NeurIPS 2023.
[ArXiv]
Towards Bridging the Gaps Between the Right to Explanation and the Right to be Forgotten.
Satyapriya Krishna*, Jiaqi Ma*, Himabindu Lakkaraju.
ICML 2023.
[OpenReview]
Subgroup Generalization and Fairness of Graph Neural Networks.
Jiaqi Ma*, Junwei Deng*, Qiaozhu Mei.
NeurIPS 2021 (Spotlight).
[ArXiv][Code]
Learning-to-Rank with Partitioned Preference: Fast Estimation for the Plackett-Luce Model.
Jiaqi Ma, Xinyang Yi, Weijing Tang, Zhe Zhao, Lichan Hong, Ed H. Chi, Qiaozhu Mei.
AISTATS 2021.
[ArXiv][SlidesLive]
Modeling Task Relationships in Multi-task Learning with Multi-gate Mixture-of-Experts.
Jiaqi Ma, Zhe Zhao, Xinyang Yi, Jilin Chen, Lichan Hong, Ed H. Chi.
KDD 2018 (with oral presentation).
[Proceedings][Video][Presentation]

Journal Publications

SODEN: A Scalable Continuous-Time Survival Model through Ordinary Differential Equation Networks.
Weijing Tang*, Jiaqi Ma*, Qiaozhu Mei, Ji Zhu.
JMLR 2022.
[ArXiv][Code]

(* Equal Contribution; † Euqal Advising)

(Note: I publish under the name Jiaqi W. Ma, starting in Sep 2024.)

Back Home

PhD Students

Back Home

Teaching

Instructor, IS 527, SP24/SP25, University of Illinois Urbana-Champaign.
Network Analysis.
Instructor, IS 327, FA23/SP25, University of Illinois Urbana-Champaign.
Concepts of Machine Learning.
Co-Instructor, COMPSCI 282BR, SP23, Harvard University.
Explainable AI: From Simple Rules to Complex Generative Models.

Back Home

Misc

Pronunciation of my first name: Jia-Chi.