I am a third-year Integrated M.S./Ph.D. student at KAIST, advised by Professor Joon Son Chung. My research focuses on multimodal learning and generative modeling, particularly for building conversational agents that interact naturally through speech and facial expressions.
I am particularly interested in developing dialogue-aware talking head generation models, expressive speech synthesis, and multimodal interaction systems that integrate audio, text, and visual cues. My broader goal is to advance the capabilities of multimodal large language models (MLLMs) for real-time, human-like interaction.