Send us Fan Mail
Are pathology foundation models actually ready for labs, or are they still stronger on paper than in practice?
In this episode of DigiPath Digest #49, I unpack a timely review on pathology foundation models and ask the question that matters most to me: not just what these models can do, but what has to be true before they are genuinely useful in real pathology workflows.
I walk through how pathology AI moved from narrow, task-specific models into the era of transformer-based foundation models. That shift matters because pathology is no longer only about looking at H&E in isolation. Today, pathologists are expected to integrate morphology, immunohistochemistry, molecular assays, genomics, and clinical context. That growing complexity is one reason foundation models are getting so much attention.
In this discussion, I explain how transformers entered pathology, why image patches are treated like tokens, and how shared embeddings can support classification, regression, segmentation, and multimodal retrieval. I also go through the major pathology foundation models mentioned in the paper, including Virchow/Virchow2, Mayo Clinic Atlas, UNI, CONCH, H-Optimus, GigaPath, and TITAN, and why scale alone is not the full story.
A big part of this episode is about the gap between benchmark performance and clinical readiness. I talk about the persistent limitations in training data diversity, the overuse of TCGA, and why public benchmarks can still miss what real pathology practice looks like. I also cover where foundation models still struggle, especially in cytopathology, hematopathology, and underrepresented disease areas, along with the real-world problems of artifacts, domain shift, concept drift, infrastructure burden, regulatory complexity, and workflow disruption.
For me, one of the most important themes is this: AI in pathology should augment, not replace, pathologists. The future is not about handing diagnosis to a model. It is about building tools that support pathologists better, fit real workflows, and can be validated in ways that deserve trust.
I also spend time on what comes next: explainable AI, counterfactual explanations, conversational interfaces, retrieval-augmented systems, multimodal fusion, and the need for deployment-centric validation rather than paper-only excitement.
If you are trying to understand where pathology foundation models really stand today, this episode will help you separate the promise from the practical barriers.
Episode Highlights
00:01 – Why I chose this paper, what is changing at Digital Pathology Place, and why foundation models are worth paying attention to now.
02:15 – The core questions: what pathology foundation models are, where they are, and how difficult they are to apply in pathology.
04:50 – Why pathology is becoming more cognitively demanding, and how multimodal complexity is driving interest in scalable AI.
07:02 – From narrow AI to transformers: how pathology moved beyond single-task CNN models.
10:16 – How transformers work in pathology: image patches as tokens, self-attention, embeddings, and downstream tasks.
14:16 – Why multimodality matters, and what kinds of data foundation models may eventually integrate.
15:27 – Timeline of key model developments, from “Attention Is All You Need” to gigapixel-scale pathology foundation models.
17:13 – The leading models and what scale really looks like: Virchow, Mayo Clinic Atlas, UNI, CONCH, H-Optimus, and GigaPath.
19:51 – Why dataset diversity matters more than sheer volume, and why TCGA is not enough.
23:17 – Where foundation models still struggle: cytopathology, hematopathology, rare disease, artifacts, scanner shifts, and pen marks.
28:06 – Explainability, counterfactual explanations, and why trust in pathology AI needs more than attention maps.
30:17 – The real deployment hurdles: regulation, infrastructure, workflow fit, and economics.
36:32 – Why AI should augment pathologists, not replace them, and which tedious tasks pathologists would gladly hand over.
38:36 – Retrieval-augmented and conversational AI in pathology: where interactive systems may actually help.
40:51 – Vision-language models and multimodal fusion with histology, radiology, genomics, and clinical notes.
42:16 – The path forward: deployment-centric design, prospective multi-site validation, and human-AI collaboration.
44:08 – Closing thoughts on AI literacy, community learning, and what needs to happen next.
Resources Mentioned
Support the show
Get the "Digital Pathology 101" FREE E-book and join us!