
Share
Despite billions invested by tech giants, AI foundation models for pathology struggle to gain traction in hospitals. Clinicians remain skeptical as the technology falls short of promised accuracy and reliability.
In recent years, billions of dollars have been invested in developing foundation models for pathology. Major tech giants like Google, Meta, and Microsoft have redirected significant resources toward creating "universal" AI systems that could transform digital pathology. These models were trained on vast datasets with the promise of becoming highly accurate feature extractors capable of handling any pathology task. However, despite this substantial investment, clinical adoption remains minimal. Hospitals are not replacing pathologists with these AI tools, and clinics are not using them as primary diagnostic aids. The core issue lies in a fundamental mismatch between what these models were trained on and the real-world needs of pathologists.
The stakes are high. Pathology plays a crucial role in diagnosing diseases like cancer, where early and accurate detection can be life-saving. AI has the potential to enhance the efficiency and accuracy of pathology, but only if it is designed with the specific needs of pathologists in mind. The current gap between the capabilities of these models and their practical use in clinical settings highlights a critical need for rethinking our approach.
One of the primary reasons foundation models are failing in pathology is what experts call "domain mismatch." These AI systems begin their training on diverse datasets like ImageNet, which contain millions of photographs of natural scenes, objects, animals, and landscapes. This pretraining helps them develop powerful general-purpose features for recognizing patterns in natural images.
However, pathology slides are a different beast entirely. They show remarkable uniformity with consistent H&E staining (a standard technique used to differentiate cell structures), similar tissue magnification, and homogeneous color palettes. The features that foundation models learn from diverse compositions, varied lighting, and natural scene structures are often irrelevant when applied to the structured and standardized world of pathology slides.
This domain mismatch has significant consequences. When pathologists use these AI tools, they find them frustratingly inaccurate. The inductive biases built into the model architecture-essentially the assumptions the model makes about what it sees-are optimized for ImageNet-style patterns rather than tissue morphology. This means that even with additional pathology-specific pretraining using techniques like contrastive learning (SimCLR, MoCo) or masked image modeling, the performance of these models remains subpar.

The failure of foundation models in pathology suggests that simply scaling up training data and computational resources is not enough. Instead, a more tailored approach is needed-one that takes into account the unique characteristics of pathology data and the specific needs of pathologists.
One promising direction is to develop specialized AI models that are trained from the ground up on pathology datasets. These models would be designed to recognize the subtle patterns and features that are crucial for accurate diagnosis in medical imaging. Additionally, involving pathologists in the development process can help ensure that these tools are user-friendly and clinically relevant.
Collaboration between AI researchers and medical professionals is essential. Pathologists bring invaluable expertise and insights into what works-and what doesn’t-in clinical practice. By working together, we can create AI tools that not only perform well on benchmarks but also meet the real-world needs of healthcare providers.
The current state of foundation models in pathology highlights a critical need for rethinking our approach to medical AI. While these models have shown promise in other domains, their failure in pathology underscores the importance of domain-specific design and collaboration with clinical experts. By addressing the domain mismatch and focusing on user-centered development, we can pave the way for more effective and reliable AI tools in healthcare.
Tags
Original Sources
About the author
Amara's entry point into AI was an epidemiology role at a London research hospital, where she spent five years studying how digital health tools reached — or conspicuously failed to reach — underserved communities. Watching early algorithmic systems in healthcare quietly entrench existing inequalities, she redirected her career toward the systemic consequences of AI at scale. She covers AI through an unflinching lens: who benefits, who bears the cost, and what evidence actually says versus what the press release claims. Her writing is calm and precise, but she doesn't mistake balance for neutrality.
More from The Steward →This Week's Edition
30 October 2025
88 articles
Related Articles
Related Articles
More Stories