Jaeseung Heo

jsheo12304@postech.ac.kr

prof_pic.jpg

Hi, I’m Jaeseung Heo, a Ph.D. student at POSTECH ML Lab under the supervision of Prof. Dongwoo Kim. My research aims to identify which training data causes specific model behaviors, with a longer-term goal of extending this to problematic behaviors relevant to safety and alignment. I use training data attribution (TDA), particularly influence functions, to characterize how training examples drive model behavior, and translate these signals into data-centric interventions such as augmentation, label smoothing, and selection. Methodologically, I develop influence functions that capture dependence between training examples, both explicit (as in graph neural networks) and implicit (arising from joint loss minimization). Going forward, I aim to connect TDA with mechanistic interpretability, and ultimately to trace behaviors such as subliminal learning back to their origins in training data.

News

May, 2026 :page_facing_up: A new preprint “Interaction-Aware Influence Functions for Group Attribution” is now on arXiv. We propose an interaction-aware attribution method and apply it to instruction-tuning data selection on Llama-3.1-8B.
Nov, 2025 :page_facing_up: A paper has been accepted to AAAI 2026 (Oral).
Sep, 2025 :page_facing_up: A paper has been accepted to NeurIPS 2025.
Jun, 2025 :ring: I’m delighted to share that I recently got married and began a new chapter in my life.
May, 2024 :page_facing_up: A paper has been accepted to ICML 2024.

Selected Publications

  1. Interaction-Aware Influence Functions for Group Attribution
    Jaeseung Heo, Kyeongheung Yun, Youngbin Choi, Sehyun Hwang, Jungseul Ok, and Dongwoo Kim
    arXiv preprint, 2026
  2. Posterior Label Smoothing for Node Classification
    Jaeseung Heo, Moonjeong Park, and Dongwoo Kim
    AAAI Conference on Artificial Intelligence (AAAI), 2026
  3. Influence Functions for Edge Edits in Non-Convex Graph Neural Networks
    Jaeseung Heo, Kyeongheung Yun, Seokwon Yoon, MoonJeong Park, Jungseul Ok, and Dongwoo Kim
    Advances in Neural Information Processing Systems (NeurIPS), 2025
  4. EPIC: Graph Augmentation with Edit Path Interpolation via Learnable Cost
    Jaeseung Heo*, Seungbeom Lee*, Sungsoo Ahn, and Dongwoo Kim
    International Joint Conference on Artificial Intelligence (IJCAI), 2024
  5. Mitigating Oversmoothing through Reverse Process of GNNs for Heterophilic Graphs
    MoonJeong Park, Jaeseung Heo, and Dongwoo Kim
    International Conference on Machine Learning (ICML), 2024