Comparing deep learning and pathologist quantification of cell-level PD-L1 expression in non-small cell lung cancer whole-slide images
- Leander van Eekelen 1, Joey Spronck 2, Monika Looijen-Salamon 2, Shoko Vos 2, Enrico Munari 3, Ilaria Girolami 4, Albino Eccher 5, Balazs Acs 6, Ceren Boyaci 6, Gabriel Silva de Souza 2, Muradije Demirel-Andishmand 2, Luca Dulce Meesters 2, Daan Zegers 2, Lieke van der Woude 2, Willemijn Theelen 7, Michel van den Heuvel 8, Katrien Grünberg 2, Bram van Ginneken 9, Jeroen van der Laak 2, Francesco Ciompi 2
- 1Department of Pathology, Radboud University Medical Center, P.O.Box 9101, 6500 HB, Nijmegen, The Netherlands. leander.vaneekelen@radboudumc.nl.
- 2Department of Pathology, Radboud University Medical Center, P.O.Box 9101, 6500 HB, Nijmegen, The Netherlands.
- 3Pathology Unit, Department of Molecular and Translational Medicine, University of Brescia, Brescia, Italy.
- 4Department of Pathology, Provincial Hospital of Bolzano (SABES-ASDAA), Bolzano-Bozen, Italy.
- 5Department of Pathology and Diagnostics, University and Hospital Trust of Verona, Verona, Italy.
- 6Department of Clinical Pathology and Cancer Diagnostics, Karolinska University Hospital, Stockholm, Sweden.
- 7Department of Thoracic Oncology, Netherlands Cancer Institute, Amsterdam, The Netherlands.
- 8Respiratory Diseases Department, Radboud University Medical Center, Nijmegen, The Netherlands.
- 9Department of Radiology, Radboud University Medical Center, Nijmegen, The Netherlands.
- 0Department of Pathology, Radboud University Medical Center, P.O.Box 9101, 6500 HB, Nijmegen, The Netherlands. leander.vaneekelen@radboudumc.nl.
Related Experiment Videos
Contact us if these videos are not relevant.
Contact us if these videos are not relevant.
View abstract on PubMed
Summary
This summary is machine-generated.A new deep learning algorithm detects programmed death-ligand 1 (PD-L1) expression at the cellular level. This AI shows promise for improving immune-checkpoint inhibitor eligibility assessment by quantifying tumor proportion score (TPS).
Area Of Science
- Computational pathology
- Artificial intelligence in oncology
- Immunohistochemistry analysis
Background
- Programmed death-ligand 1 (PD-L1) expression, assessed by tumor proportion score (TPS), is crucial for immune-checkpoint inhibitor eligibility.
- Current clinical assessment of PD-L1 TPS suffers from high interobserver variability, limiting treatment efficacy.
- Existing automated quantification systems often focus on slide-level TPS and lack cellular-level PD-L1 expression analysis.
Purpose Of The Study
- To develop and evaluate a deep learning algorithm for cell-level detection of PD-L1 negative and positive tumor cells.
- To establish a cell-level reference standard for benchmarking computer vision algorithms in PD-L1 assessment.
- To evaluate the algorithm's performance in slide-level tumor proportion score (TPS) quantification against pathologist agreement.
Main Methods
- A deep learning algorithm was developed for cell-level PD-L1 expression detection.
- The algorithm was evaluated on a multi-centric dataset with a cell-level reference standard created by six expert readers.
- Slide-level TPS quantification was assessed by comparing algorithm predictions with those of six pathologists.
Main Results
- The algorithm achieved a mean reader-AI F1 score of 0.55, slightly below the interobserver agreement (mean reader-reader F1 score = 0.68), particularly on external clinical data.
- Despite cell-level challenges, the AI demonstrated good agreement with pathologists in quantifying TPS (mean reader-AI Cohen's kappa = 0.49), comparable to interobserver agreement (mean reader-reader kappa = 0.54).
- The study established a novel benchmark for computer vision algorithms using a multi-reader, multi-assay cell-level PD-L1 dataset.
Conclusions
- The developed deep learning algorithm shows potential for accurate cell-level PD-L1 expression detection.
- The AI exhibits favorable agreement with pathologists in TPS quantification, offering a promising tool to reduce interobserver variability.
- The study's models are publicly released on the Grand-Challenge platform to facilitate further research and development.
Related Experiment Videos
Contact us if these videos are not relevant.
Contact us if these videos are not relevant.

