Loading...
Publications
1 min read

Towards Training Large-Scale Pathology Foundation Models: from TCGA to Hospital Scale

Initial work towards our first pathology vision FM preceding and laying the groundwork for our most recent Midnight family of models.
Written by:kaiko.ai,Nanne Aben,Edwin D. de Jong,Ioannis Gatopoulos,Nicolas Känzig,Mikhail Karasikov,Axel Lagré,Roman Moser,Joost van Doorn,Fei Tang
Published on: arXiv

Abstract

Driven by the recent advances in deep learning methods and, in particular, by the development of modern self-supervised learning algorithms, increased interest and efforts have been devoted to build foundation models (FMs) for medical images.In this work, we present our scalable training pipeline for large pathology imaging data, and a comprehensive analysis of various hyperparameter choices and training techniques for building pathology FMs. We release and make publicly available the first batch of our pathology FMs1 trained on open-access TCGA whole slide images, a commonly used collection of pathology images.

The experimental evaluation shows that our models reach state-of-the-art performance on various patch-level downstream tasks, ranging from breast cancer subtyping to colorectal nuclear segmentation. Finally, to unify the evaluation approaches used in the field and to simplify future comparisons of different FMs, we present an open-source framework2 designed for the consistent evaluation of pathology FMs across various downstream task

View full article