Pang Wei Koh

I'm interested in making machine learning systems more useful, responsible, and reliable in the real world. For example:

Access. How can we expand our access to foundation models, so that we can better understand, build upon, and adapt them? We develop new methods, architectures, and data for efficiently training and deploying fully open models.
Reliability. How do we make our models more reliable and trustworthy? We design new approaches to evaluation and are working on the next generation of retrieval-based models that can reason directly over data.
Impact. What can we do with AI that we could not do before, e.g., accelerate scientific discovery or provide universal access to medical advice?

I received my PhD in Computer Science from Stanford, advised by Percy Liang. Before that, I was the 3rd employee and Director of Partnerships at Coursera. I was also an undergraduate at Stanford, advised by Andrew Ng and Daphne Koller.

I'm part of the UW ML and NLP groups, and I'm also a visiting research scientist at AI2. If you're interested in joining our group, please read this. This cycle, I'm also looking for prospective students/postdocs interested in AI for science.

Current students

Scott Geng
PhD student
(with Ranjay Krishna)

Jacqueline He
PhD student
(with Luke Zettlemoyer)

Rulin Shao
PhD student
(with Luke Zettlemoyer)

Rui Xin
PhD student
(with Sewoong Oh)

Ian Magnusson
PhD student
(with Noah Smith)

Zhiyuan Zeng
PhD student
(with Hanna Hajishirzi)

Molly Park
Undergrad

Alumni

Qiao Rui (Visiting PhD 2024, now PhD student at the National University of Singapore)
Irena Gao (MS 2023, now PhD student at Stanford University)
Kendrick Shen (MS 2022, now ML research engineer at Genesis Therapeutics)
Henrik Marklund (MS 2021, now PhD student at Stanford University)
Kai-Siang Ang (MS 2021, now Technical Lead Manager at Nuro)
Erik Jones (MS 2020, now researcher at Anthropic)
Hubert Teo (MS 2019, now senior software engineer at CodeSignal)
Thao Nguyen (BS 2019, now PhD student at the University of Washington)
Yew-Siang Tang (BS 2019, now staff software engineer at You.com)

Publications

* = equal contribution.

Spurious rewards: Rethinking training signals in RLVR

Rulin Shao*, Shuyue Stella Li*, Rui Xin*, Scott Geng*, Yiping Wang, Sewoong Oh, Simon Shaolei Du, Nathan Lambert, Sewon Min, Ranjay Krishna, Yulia Tsvetkov, Hannaneh Hajishirzi, Pang Wei Koh, and Luke Zettlemoyer

arXiv 2025

(paper) (code)

Precise information control in long-form text generation

Jacqueline He, Howard Yen, Margaret Li, Shuyue Stella Li, Zhiyuan Zeng, Weijia Shi, Yulia Tsvetkov, Danqi Chen, Pang Wei Koh, and Luke Zettlemoyer

arXiv 2025

(paper) (code)

ReasonIR: Training retrievers for reasoning tasks

Rulin Shao*, Rui Qiao*, Varsha Kishore, Niklas Muennighoff, Xi Victoria Lin, Daniela Rus, Bryan Kian Hsiang Low, Sewon Min, Wen-tau Yih, Pang Wei Koh, and Luke Zettlemoyer

arXiv 2025

(paper) (code)

A false sense of privacy: Evaluating textual data sanitization beyond surface-level privacy leakage

Rui Xin*, Niloofar Mireshghallah*, Shuyue Stella Li, Michael Duan, Hyunwoo Kim, Yejin Choi, Yulia Tsvetkov, Sewoong Oh, Pang Wei Koh

arXiv 2025