Histology of human cells

New Foundation Model Reveals How Cells Are Organized in Tissues

AI New Research Findings Computational Health ICB

Researchers at Helmholtz Munich and the Technical University of Munich (TUM) have developed Nicheformer, the first large-scale foundation model that integrates single-cell analysis with spatial transcriptomics. Trained on more than 110 million cells, it offers a new way to study how cells are organized and interact in tissues – knowledge that is crucial for understanding health and disease.

Missing Context in Single-Cell Data

Single-cell RNA sequencing has transformed biology by showing which genes are active in individual cells. However, this approach requires cells to be removed from their natural environment, erasing information about their position and neighbors. Spatial transcriptomics preserves this context but is technically more limited and harder to scale. Researchers have long lacked a way to study cell identity and tissue organization together.

AI Model Reveals Hidden Tissue Structures

Nicheformer overcomes this barrier by learning from both dissociated and spatial data. It can “transfer” spatial context back onto cells that were previously studied in isolation – essentially reconstructing how they fit into the bigger picture of a tissue. To make this possible, the research team created SpatialCorpus-110M, one of the largest curated resources of single-cell and spatial data to date. In their study published in Nature Methods, the model consistently outperformed existing approaches and showed that spatial patterns leave measurable traces in gene expression, even when cells are dissociated. Beyond performance, the researchers also explored interpretability, revealing that the model identifies biologically meaningful patterns in its internal layers – offering a new window into how AI learns from biology.

“With Nicheformer we can now transfer spatial information onto dissociated single-cell data at scale,” says Alejandro Tejada-Lapuerta, PhD student at Helmholtz Munich and TUM and co-first author of the study together with Anna Schaar. “This opens up many possibilities to study tissue organization and cellular neighborhoods without additional experiments.”

The study connects to the emerging idea of a “Virtual Cell”, a computational representation of how cells behave and interact within their native environments. While this concept is gaining momentum across biology and AI, previous models have largely treated cells as isolated entities, without reasoning their spatial relationships. Nicheformer is the first foundation model to learn directly from spatial organization, offering a way to reconstruct how cells sense and influence their neighbors. Beyond introducing this new capability, the researchers also present an entire suite of spatial benchmarking tasks that challenge future models to capture tissue architecture and collective cellular behavior – an essential step toward biologically realistic AI systems.

Info Box: Single-Cell Analysis vs. Spatial Transcriptomics

Single-cell analysis: Measures the molecular profile (e.g., gene activity) of individual cells, but cells are studied outside their original tissue context.
Spatial transcriptomics: Measures gene activity directly in tissue slices, keeping the spatial arrangement of cells intact.
Nicheformer combines both approaches, projecting the spatial context back onto dissociated single-cell data.

Next Steps

“With Nicheformer we are taking the first steps toward building general-purpose AI models that represent cells in their natural context – the foundation of a Virtual Cell and Tissue model,” says Prof. Fabian Theis, Director of the Computational Health Center at Helmholtz Munich and Professor at TUM. “Such models will transform how we study health and disease and could ultimately guide the development of new therapies.”

In their next project, the team aims to develop a “tissue foundation model” that also learns the physical relationships between cells. Such a model could help analyze tumor microenvironments and other complex structures in the body with direct relevance for diseases such as cancer, diabetes, and chronic inflammation.

Original Publication

Tejada-Lapuerta et al., 2025: Nicheformer: a foundation model for single-cell and spatial omics. Nature Methods. DOI: 10.1038/s41592-025-02814-z.  
 

Fabian Theis
Prof. Dr. Dr. Fabian Theis

Director of Computational Health Center, Director of Institute for Computational Biology

View profile
Alejandro Tejada

PhD candidate

Anna Schaar

PhD candidate

Related news

Prologue with Eric Topol

AI, Computational Health, ICB,

Prof. Eric Topol zu Gast bei Helmholtz Munich: Runder Tisch zum „Virtual Human“

Helmholtz Munich begrüßte Prof. Eric Topol, den international renommiertern Kardiologen, Genetiker und Experten für digitale Medizin, zu einem Round Table über die Zukunft der Künstlichen Intelligenz (KI) in der Medizin. Die Veranstaltung, die als…

Lung Cell Atlas

AI, Environmental Health, PRM, Computational Health,

Helmholtz Munich und Parse Biosciences Collaborate entwickeln Human Lung Tissue Perturbation Atlas

Helmholtz Munich und Parse Biosciences starten eine Zusammenarbeit zur Erstellung eines der bislang umfassendsten Perturbationsatlanten für Lungenerkrankungen. Das Projekt basiert auf einem ex vivo Gewebeschnittmodell der menschlichen Lunge, das…

HMGU_Icon_Computat_Health

AI, Featured Publication, Computational Health, ICB,

Integrierter Einzelzell-Atlas menschlicher atherosklerotischer Plaques

Forschende von Helmholtz Munich, von Roche Diagnostics und der Technischen Universität München haben den bislang umfassendsten Einzelzell-Atlas menschlicher atherosklerotischer Plaques erstellt. Diese Plaques, die sich in den Arterien bilden und zu…

Human (animal) cell under microscope. 3d illustration

New Research Findings, Computational Health, ICB,

Details im Fokus: Neue Methode enthüllt das Innenleben unserer Zellen

Fabian Theis und dänische Kolleg:innen haben ein besseres Verständnis dafür gewonnen, wie sich Blut­zellen aus Stammzellen entwickeln. Weitere Studien könnten klären, warum manche Zellen sich gegen den Körper wenden und gesundheitliche Probleme…

An AI powered system automating remote patient monitoring by analyzing real time health data and vital signs, futuristic AI-driven healthcare platform

AI, Computational Health, HCA, ICB, IML,

Wie Foundation-Modelle die biomedizinische Forschung prägen

KI-basierte Foundation-Modelle wie GPT haben sich von einfachen Alltagswerkzeugen zu leistungsstarken Systemen entwickelt, die ganze Branchen transformieren können. Forschende bei Helmholtz Munich nutzen das enorme Potenzial dieser Modelle, um…