Scott Lundberg

Member of Technical Staff

Microsoft AI

Biography

I am a Member of Technical Staff at Microsoft AI (Health), and an Affiliate Assistant Professor at the University of Washington. Previously I was a Staff Research Scientist at Google DeepMind, and a Senior Researcher at Microsoft Research. My work focuses on large language models, explainable artificial intelligence, and their application to problems in medicine and healthcare. This has led to the development of broadly applicable methods and tools for complex machine learning models that are now used in banking, logistics, manufacturing, cloud services, economics, sports, and other areas. I did my Ph.D. studies at the Paul G. Allen School of Computer Science & Engineering of the University of Washington working with Su-In Lee.

Interests

Language Models
Explainable AI
Machine Learning
Healthcare
Genomics

Education

PhD in Computer Science, 2019

University of Washington
MS in Computer Science, 2008

Colorado State University
BS in Computer Science, 2005

Colorado State University

Selected Projects

Google Gemini

Contributions to Google’s Gemini model family during tenure at Google DeepMind, specifically coding and reasoning capabilities.

Guidance: A Language for Controlling Large Language Models

A popular open-source library for controlling large language models. Enables reliable structured generation, token healing, and fast execution. Used by OpenAI for JSON structured output mode.

Explain any machine learning model

A popular package that uses SHAP values (theoretically grounded feature attributions) to explain the output of any machine learning model. It is actively used by thousands of data scientists representing a diverse set of organizations, including startups, non-profits, major tech companies, NBA teams, banks, and medical providers. It has high speed algorithm integrations with XGBoost, LightGBM, CatBoost, scikit-learn, TensorFlow and PyTorch.

Prescience

Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Prescience is a machine-learning-based system that predicts the risk of hypoxaemia and provides explanations of the risk factors in real time during general anaesthesia. The system improved the performance of anaesthesiologists by providing interpretable hypoxaemia risks and contributing factors.

Exact game theoretic explanations for trees

Consistent individualized feature attribution for tree ensembles. While computing the classic Shapley values from game theory is NP-hard in general, we show how to exactly compute them in low order polynomial time for tree ensembles. This enables us to provide explanations of individual machine learning predictions that come with strong theoretical guarantees and no sampling variability.

ChromNet

Learning the human chromatin network from all ENCODE ChIP-seq data. A cell’s epigenome arises from interactions among regulatory factors—transcription factors and histone modifications—co-localized at particular genomic regions. We developed ChromNet to infer a network of these interactions, the chromatin network, by inferring conditional-dependence relationships among a large number of ChIP-seq data sets from the ENCODE project.

Unifying Explanation Methods

Understanding why a model makes a certain prediction can be as crucial as the prediction’s accuracy in many applications. However, the highest accuracy for large modern datasets is often achieved by complex models that even experts struggle to interpret, such as ensemble or deep learning models, creating a tension between accuracy and interpretability. In response, various methods have recently been proposed to help users interpret the predictions of complex models, but it is often unclear how these methods are related and when one method is preferable over another. To address this problem, we presented a unified framework for interpreting predictions, SHAP (SHapley Additive exPlanations).

Publications and Patents

For the most up-to-date list of publications, please visit my Google Scholar profile. Older publications are below.

S. Lundberg. Explaining Quantitative Measures of Fairness. Fair & Responsible AI Workshop @ CHI2020, 2020.

PDF Code

S. Lundberg, G. Erion, H. Chen, A. DeGrave, J. Prutkin, B. Nair, R. Katz, J. Himmelfarb, N. Bansal, S. Lee. From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence, volume 2, pages 56-67, 2020. (selected to be the cover article)

PDF Code

N. Hiranuma, S. Lundberg, S. Lee. AIControl: Replacing matched control experiments with machine learning improves ChIP-seq peak identification. to appear in Nucleic Acids Research, 2019.

Preprint Code

S. Lundberg, B. Nair, M. Vavilala, M. Horibe, M. Eisses, T. Adams, D. Liston, D. Low, S. Newman, J. Kim, S. Lee. Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nature Biomedical Engineering, volume 2, pages 749–760, 2018. (selected to be the cover article)

PDF Code

S. Lundberg, G. Erion, S. Lee. Consistent Individualized Feature Attribution for Tree Ensembles. arXiv, 2018.

PDF Code

S. Lee, S. Celik, B. Logsdon, S. Lundberg, T. Martins, V. Oehler, E. Estey, C. Miller, S. Chien, J. Dai, A. Saxena. A machine learning approach to integrate big data for precision medicine in acute myeloid leukemia. Nature communications, 2018.

PDF

S. Lundberg, S. Lee. A unified approach to interpreting model predictions. NeurIPS, 2017. (selected for oral presentation)

PDF Code Errata Video

G. Erion, H. Chen, S. Lundberg, S. Lee. Anesthesiologist-level forecasting of hypoxemia with only SpO2 data using deep learning. NeurIPS Workshop ML4H: Machine Learning for Health, 2017.

PDF

H. Chen, S. Lundberg, S. Lee. Hybrid Gradient Boosting Trees and Neural Networks for Forecasting Operating Room Data. NeurIPS Workshop ML4H: Machine Learning for Health, 2017.

PDF

S. Lundberg, S. Lee. An unexpected unity among methods for interpreting model predictions. NeurIPS Workshop on Interpretable Machine Learning in Complex Systems, 2016. (best paper award)

PDF Code

N. Hiranuma, S. Lundberg, S. Lee. CloudControl: Leveraging many public ChIP-seq control experiments to better remove background noise. in Proceedings of the 7th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics, ACM, 2016.

PDF

S. Lundberg, W. Tu, B. Raught, L. Penn, M. Hoffman, S. Lee. ChromNet: Learning the human chromatin network from all ENCODE ChIP-seq data. Genome Biology, 2016. (F1000Prime recommended)

Preprint Code

See all publications