I am an Assistant Professor at KAIST Graduate School of AI, where I direct the Structured and Probabilistic Machine Learning (SPML) Lab.
My research focuses on developing machine learning algorithms for molecules, with applications to drug discovery and material design. I enjoy bringing a machine learning perspective—especially a probabilistic one—to scientific problems, and diving deep into the underlying physics, chemistry, and biology.
Members
Yunhui Jang, Hyosoon Jang, Hyomin Kim, Seonghyun Park, Seongsu Kim, Kiyoung Seong, Minkyu Kim, Dongyeop Woo, Nayoung Kim, Minsu Kim (Postdoc), Taewon Kim, Hyunjin Seo, Yinhua Piao (Postdoc), Yoonho Kim, Honghui Kim (Postdoc)
Alumni: Haeji Ko, Juwon Hwang
Publications
- Learning Adaptive Perturbation-Conditioned Contexts for Robust Transcriptional Response Prediction[arxiv]
- Boltz is a Strong Baseline for Atom-level Representation Learning[arxiv]
- Riemannian MeanFlow[arxiv]
- Progressive Multi-Agent Reasoning for Biological Perturbation Prediction[arxiv]
- AtomMOF: All-Atom Flow Matching for MOF-Adsorbate Structure Prediction[arxiv]
- CatFlow: Co-generation of Slab-Adsorbate Systems via Flow Matching[arxiv]
- INDIBATOR: Diverse and Fact-Grounded Individuality for Multi-Agent Debate in Molecular Discovery[arxiv]
- Latent Veracity Inference for Identifying Errors in Stepwise Reasoning (ICLR 2026)[arxiv]
- Learning Collective Variables from BioEmu with Time-Lagged Generation (ICLR 2026)[arxiv]
- DNACHUNKER: Learnable Tokenization for DNA Language Models[arxiv]
- Self-Training Large Language Models with Confident Reasoning (EMNLP 2025)[arxiv]
- MT-Mol: Multi Agent System with Tool-based Reasoning for Molecular Optimization (EMNLP 2025)[arxiv]
- Generative Flows on Synthetic Pathway for Drug Design (ICLR 2025)[arxiv]
- Adaptive Teachers for Amortized Samplers (ICLR 2025)[arxiv]
- Can LLMs Generate Diverse Molecules? Towards Alignment with Structural Diversity[arxiv]
- Iterated Energy-based Flow Matching for Sampling from Boltzmann Densities[arxiv]
- Multi-resolution Spectral Coherence for Graph Generation with Score-based Diffusion (NeurIPS 2023)[paper]
- Disentangling Sources of Risk for Distributional Multi-Agent Reinforcement Learning (ICML 2022)[paper]
- Spanning Tree-based Graph Generation for Molecules (ICLR 2022)[paper]
- Maximum Weight Matching using Odd-sized Cycles: Max-Product Belief Propagation and Half-Integrality (IEEE TIT 2018)[paper]