Jaeseung Heo

Hi, I’m Jaeseung Heo, a Ph.D. student at POSTECH ML Lab under the supervision of Prof. Dongwoo Kim. My research aims to identify which training data causes specific model behaviors, with a longer-term goal of extending this to problematic behaviors relevant to safety and alignment. I use training data attribution (TDA), particularly influence functions, to characterize how training examples drive model behavior, and translate these signals into data-centric interventions such as augmentation, label smoothing, and selection. Methodologically, I develop influence functions that capture dependence between training examples, both explicit (as in graph neural networks) and implicit (arising from joint loss minimization). Going forward, I aim to connect TDA with mechanistic interpretability, and ultimately to trace behaviors such as subliminal learning back to their origins in training data.

News

Jun, 2026	A new preprint “Long Live the Librarian! A Persistent Search Sub-Agent for Energy-Efficient Multi-Agent Software Engineering Systems” is now on arXiv. We introduce the Librarian, a persistent search sub-agent that reduces redundant retrieval and lowers the energy cost of MAS.
May, 2026	A new preprint “Interaction-Aware Influence Functions for Group Attribution” is now on arXiv. We propose an interaction-aware attribution method and apply it to instruction-tuning data selection on Llama-3.1-8B.
Nov, 2025	A paper has been accepted to AAAI 2026 (Oral).
Sep, 2025	A paper has been accepted to NeurIPS 2025.
Jun, 2025	I’m delighted to share that I recently got married and began a new chapter in my life.

Selected Publications

Interaction-Aware Influence Functions for Group Attribution

Jaeseung Heo, Kyeongheung Yun, Youngbin Choi, Sehyun Hwang, Jungseul Ok, and Dongwoo Kim

Mechanistic Interpretability Workshop @ ICML, 2026

Abs arXiv

Influence functions approximate how removing a training example changes a quantity of interest, called the target function, such as a held-out loss. To estimate the influence of a group of examples, the standard practice is to sum the individual influences of its members. However, this sum does not capture how examples jointly affect the target: a pair of examples may be redundant or complementary, but the sum cannot distinguish these cases. We propose an interaction-aware influence function that characterizes how interactions between examples influence the target. By expanding the target to second order around the trained parameters, we obtain an estimator that augments the standard sum with a pairwise interaction term that captures the alignment between two examples’ effects on the target. We empirically evaluate our estimator in two settings. First, on six dataset-model pairs spanning logistic regression, MLPs, and ResNet-9, our estimator tracks leave-group-out retraining substantially better than first-order influence across all settings. Second, when used as a greedy selection rule for instruction-tuning data on Llama-3.1-8B, it beats prior influence-based and representation-similarity baselines on five of seven downstream tasks, in a regime where standard influence-based selection underperforms random selection.
Posterior Label Smoothing for Node Classification

Jaeseung Heo, Moonjeong Park, and Dongwoo Kim

AAAI Conference on Artificial Intelligence (AAAI), 2026

Abs arXiv Oral

Soft labels can improve the generalization of a neural network classifier in many domains, such as image classification. Despite its success, the current literature has overlooked the efficiency of label smoothing in node classification with graph-structured data. In this work, we propose a simple yet effective label smoothing for the transductive node classification task. We design the soft label to encapsulate the local context of the target node through the neighborhood label distribution. We apply the smoothing method for seven baseline models to show its effectiveness. The label smoothing methods improve the classification accuracy in 10 node classification datasets in most cases. In the following analysis, we find that incorporating global label statistics in posterior computation is the key to the success of label smoothing. Further investigation reveals that the soft labels mitigate overfitting during training, leading to better generalization performance.
Influence Functions for Edge Edits in Non-Convex Graph Neural Networks

Jaeseung Heo, Kyeongheung Yun, Seokwon Yoon, MoonJeong Park, Jungseul Ok, and Dongwoo Kim

Advances in Neural Information Processing Systems (NeurIPS), 2025

Abs arXiv

Understanding how individual edges influence the behavior of graph neural networks (GNNs) is essential for improving their interpretability and robustness. Graph influence functions have emerged as promising tools to efficiently estimate the effects of edge deletions without retraining. However, existing influence prediction methods rely on strict convexity assumptions, exclusively consider the influence of edge deletions while disregarding edge insertions, and fail to capture changes in message propagation caused by these modifications. In this work, we propose a proximal Bregman response function specifically tailored for GNNs, relaxing the convexity requirement and enabling accurate influence prediction for standard neural network architectures. Furthermore, our method explicitly accounts for message propagation effects and extends influence prediction to both edge deletions and insertions in a principled way. Experiments with real-world datasets demonstrate accurate influence predictions for different characteristics of GNNs. We further demonstrate that the influence function is versatile in applications such as graph rewiring and adversarial attacks.
EPIC: Graph Augmentation with Edit Path Interpolation via Learnable Cost

Jaeseung Heo^*, Seungbeom Lee^*, Sungsoo Ahn, and Dongwoo Kim

International Joint Conference on Artificial Intelligence (IJCAI), 2024

Abs arXiv

Data augmentation plays a critical role in improving model performance across various domains, but it becomes challenging with graph data due to their complex and irregular structure. To address this issue, we propose EPIC (Edit Path Interpolation via learnable Cost), a novel interpolation-based method for augmenting graph datasets. To interpolate between two graphs lying in an irregular domain, EPIC leverages the concept of graph edit distance, constructing an edit path that represents the transformation process between two graphs via edit operations. Moreover, our method introduces a context-sensitive cost model that accounts for the importance of specific edit operations formulated through a learning framework. This allows for a more nuanced transformation process, where the edit distance is not merely count-based but reflects meaningful graph attributes. With randomly sampled graphs from the edit path, we enrich the training set to enhance the generalization capability of classification models. Experimental evaluations across several benchmark datasets demonstrate that our approach outperforms existing augmentation techniques in many tasks.
Mitigating Oversmoothing through Reverse Process of GNNs for Heterophilic Graphs

MoonJeong Park, Jaeseung Heo, and Dongwoo Kim

International Conference on Machine Learning (ICML), 2024

Abs arXiv

Graph Neural Network (GNN) resembles the diffusion process, leading to the over-smoothing of learned representations when stacking many layers. Hence, the reverse process of message passing can produce the distinguishable node representations by inverting the forward message propagation. The distinguishable representations can help us to better classify neighboring nodes with different labels, such as in heterophilic graphs. In this work, we apply the design principle of the reverse process to the three variants of the GNNs. Through the experiments on heterophilic graph data, where adjacent nodes need to have different representations for successful classification, we show that the reverse process significantly improves the prediction performance in many cases. Additional analysis reveals that the reverse mechanism can mitigate the over-smoothing over hundreds of layers.