Seongjun Yang

Hi, I am a NLP Research Engineer at KRAFTON AI. Prior to joining KRAFTON AI, I earned my Bachelor's degree in Computer Science from Yonsei University and completed my Master's at the Graduate School of AI at Korea Advanced Institute of Science and Technology (KAIST), where I was fortunate to be advised by Prof. Edward Choi.
I aim to ensure that AI systems are trustworthy and socially responsible. My research will focus on (1) developing evaluation frameworks to address risks such as bias, privacy, and accountability in generative AI, and (2) creating robust methods to mitigate vulnerabilities, enabling the safe and ethical use of AI in high-stakes fields like healthcare and law.

Email / CV / Google Scholar / Github / Twitter

News

(February 2024): Our work on Predictive Pipelined Decoding: A Compute-Latency Trade-off for Exact LLM Decoding is accepted to TMLR 2024.
(April 2023): Our work on Towards a Practical Utility of Federated Learning in the Medical Domain is accepted to CHIL 2023.
(Nov 2022): Joined KRAFTON AI as a NLP Research Engineer
(August 2022): Earned my M.S. in AI from KAIST, where I was mentored by Prof. Edward Choi.
(August 2022): Our work on TAPUDD: Task Agnostic and Post-hoc Unseen Distribution Detection is accepted to WACV 2023.
(September 2022): Our work on Ehrsql: A practical text-to-sql benchmark for electronic health records is accepted to NeurIPS 2022 Datasets and Benchmarks.
(May 2021): Our work on Improving lexically constrained neural machine translation with source-conditioned masked span prediction is accepted to ACL 2021.
(Sep 2020): Began my M.S. studies at the Graduate School of AI at KAIST.
(Oct 2018): Our team wins third prize in 6th Industrial Engineering Competition.

Conference Publications

You can view all papers by visiting Google Scholar or the Publications section of the CV.
	Predictive pipelined decoding: A compute-latency trade-off for exact LLM decoding Seongjun Yang ^, Gibbeum Lee ^, Jaewoong Cho, Dimitris Papailiopoulos, Kangwook Lee TMLR 2024 Paper This paper presents Predictive Pipelined Decoding (PPD), an approach that speeds up decoding in Large Language Models (LLMs) while maintaining the exact same output as the original decoding.
	Towards the Practical Utility of Federated Learning in the Medical Domain Seongjun Yang ^, Hyeonji Hwang ^, Daeyoung Kim, Radhika Dua, Jong-Yeup Kim, Eunho Yang, Edward Choi CHIL 2023 Paper / Code The study introduces federated learning benchmarks for three medical datasets to aid adoption in healthcare. We evaluate six algorithms and a hybrid method (FedPxN), it finds simpler methods often outperform complex ones, with the hybrid consistently performing well.
	Task agnostic and post-hoc unseen distribution detection. Radhika Dua , Seongjun Yang, Yixuan Li, Edward Choi WACV 2023 Paper / Code This study designs a novel clustering-based ensembling method, called Task Agnostic and Post-hoc Unseen Distribution Detection (TAPUDD) that utilizes the features extracted from a model trained on a specific task.
	Ehrsql: A practical text-to-sql benchmark for electronic health records Gyubok Lee , Hyeonji Hwang, Seongsu Bae, Yeonsu Kwon, Woncheol Shin, Seongjun Yang, Minjoon Seo, Jong-Yeup Kim, Edward Choi NeurIPS 2022 Datasets and Benchmarks Paper / Code The study introduces a new EHR text-to-SQL dataset with time expressions and unanswerable questions, collected from 222 hospital staff and linked to the MIMIC-III and eICU databases. It challenges models to generate SQL queries for diverse hospital needs, handle time-sensitive questions, and determine question answerability based on prediction confidence.
	Improving lexically constrained neural machine translation with source-conditioned masked span prediction Gyubok Lee ^, Seongjun Yang ^, Edward Choi ACL 21 Paper / Code The paper proposes a benchmark and training strategy inspired by masked span prediction models to improve neural machine translation of domain-specific terms. This approach enhances terminology accuracy and sentence-level translation across three specialized datasets in two language pairs.

Experience

	NLP Research Engineer, KRAFTON AI Division director: Prof. Kangwook Lee Instruct-tune LLMs, such as LLaMA, and develop prompting strategies for in-game applications.
	AI Researcher, NHN Cloud Designed tutorials for benchmarking Korean Language Models.
	Graduate Student Researcher, KAIST Advisor: Prof. Edward Choi Chosen as a Graduate Student Researcher at KAIST's Graduate School of AI, under Prof. Edward Choi, focusing on Federated Learning and Natural Language Processing.

Teaching

KAIST-AI612: Machine Learning for Healthcare
Instructor: Prof. Edward Choi

KAIST-AI504: Programming for AI
Instructor: Prof. Edward Choi

Project

Industry-academia collaboration project
Supervisor: Prof. Wooju Kim
Mentor: Prof. Haemin Jung
Sponsor: HYUNDAI NGV
Period: June 2018 – October 2019
Project Name: Research on methodologies for developing an information classification system and prototype

I conducted text data extraction and semantic parsing from various file types (PDF, text).

Design and source code from Jon Barron and Radhika Dua's website.