Gruppenbild Paolo Casale Group

Systems Genetics and Machine Learning

Francesco Paolo Casale Lab

Paolo’s research group focuses on the development and application of machine learning and statistical approaches to advance our understanding of complex trait and disease biology.

About our Research

Our research interests lie in the development and application of machine learning and statistical tools to analyze large genetic cohorts with deep molecular and phenotypic data, with the ultimate goal to further our understanding of complex trait biology. We aim to address fundamental biomedical questions such as: Which are the molecular, cellular and organ-level traits associated with disease severity and progression? Which of these are likely to drive disease pathogenesis? How does the interplay of genetic and environmental factors affect these traits?

Our approach combines principles from machine learning, statistical inference and systems genetics, with a strong focus on model scalability, robustness and interpretability. Current major research areas include the development of scalable tools for genetic association studies, deep learning models for imaging genetics, and computational methods to study gene-environment interactions and disease subtypes.

Leverage scalable machine learning and statistical tools together with large system genetics datasets to further our understanding of human disease biology.

Lab Members

Paolo Casale freigestellt
Francesco Paolo Casale

PI "Systems Genetics and Machine Learning", Helmholtz Pioneer Campus

View profile
Ayshan Alieva freigestellt
Ayshan Aliyeva
Subham Chaudhary freigestellt
Shubham Chaudhary
Jan Engelmann freigestellt
Jan Engelmann
Diyuan Lu freigestellt
Diyuan Lu
Mairi McClean freigestellt
Mairi McClean
Martin Meinel freigestellt
Martin Meinel
Anonio Nappi freigestellt
Antonio Nappi
Benedikt Roth freigestellt
Benedikt Roth

Publications

Vercellone, F. ; Kundu, S. ; Esposito, A. ; Chiariello, A.M. ; Conte, M. ; Abraham, A. ; Fontana, A. ; Pierno, F.D. ; Guha, S. ; Carluccio, C.D. ; Olimpo, M. ; Nicodemi, M. ; Casale, F.P. ; Bianco, S.

Physics-based modeling of sparse single-cell Hi-C uncovers structural and epigenetic variability.
Genome Biol. 27:122 (2026)

Chaudhary, S. ; Voigts, A. ; Bereket, M. ; Albert, M.L. ; Schwamborn, K. ; Zeggini, E. ; Casale, F.P.

HistoGWAS: An AI-enabled framework for automated genetic analysis of tissue phenotypes in histology cohorts.
2025 in
In: (20th Machine Learning in Computational Biology, MLCB 2025, 10-11 September 2025, New York). 2025. accepted ( ; 311)

Chaudhary, S. ; Voigts, A. ; Vilov, S. ; Heinig, M. ; Casale, F.P.

AI-based histopathology phenotyping reveals germline loci shaping breast cancer morphology.
Genome Res. 35, 2682-2690 (2025)

Nappi, A. ; Shilova, L. ; Karaletsos, T. ; Cai, N. ; Casale, F.P.

BayesRVAT enhances rare-variant association testing through Bayesian aggregation of functional annotations.
In: (Research in Computational Molecular Biology). 2025. 428-431 (Lect. Notes Comput. Sc. ; 15647 LNBI)

Nappi, A. ; Cai, N. ; Casale, F.P.

Bayesian aggregation of multiple annotations enhances rare variant association testing.
Alzheimers Dement. 21:e70170 (2025)

Luan, Y. ; Zheng, L. ; Denecke, J. ; Dehsarvi, A. ; Roemer-Cassiano, S.N. ; Dewenter, A. ; Steward, A. ; Shcherbinin, S. ; Svaldi, D.O. ; Kotari, V. ; Higgins, I.A. ; Pontecorvo, M.J. ; Valentim, C. ; Schnabel, J.A. ; Casale, F.P. ; Dyrba, M. ; Teipel, S. ; Franzmeier, N. ; Ewers, M.

Multimodal spatial gradients to explain regional susceptibility to fibrillar tau in Alzheimer's disease.
Nat. Commun. 16:3061 (2025)

Hölzlwimmer, F.R. ; Lindner, J. ; Tsitsiridis, G. ; Wagner, N. ; Casale, F.P. ; Yépez, V.A. ; Gagneur, J.

Aberrant gene expression prediction across human tissues.
Nat. Commun. 16:3278 (2025)

Han, S. ; Yu, S. ; Shi, M. ; Harada, M. ; Ge, J. ; Lin, J. ; Prehn, C. ; Petrera, A. ; Li, Y. ; Sam, F. ; Matullo, G. ; Adamski, J. ; Suhre, K. ; Gieger, C. ; Hauck, S.M. ; Herder, C. ; Roden, M. ; Casale, F.P. ; Cai, N. ; Peters, A. ; Wang-Sattler, R.

LEOPARD: Missing view completion for multi-timepoint omics data via representation disentanglement and temporal knowledge transfer.
Genome Res. 34, 1276-1285 (2024)

Sens, D. ; Shilova, L. ; Gräf, L. ; Grebenshchikova, M. ; Eskofier, B.M. ; Casale, F.P.

Genetics-driven risk predictions leveraging the Mendelian randomization framework.
In: (Research in Computational Molecular Biology). Gewerbestrasse 11, Cham, Ch-6330, Switzerland: Springer International Publishing Ag, 2024. 385-389 (Lect. Notes Comput. Sc. ; 14758 LNCS)

Gräf, L. ; Sens, D. ; Shilova, L. ; Casale, F.P.

Disease risk predictions with differentiable mendelian randomization.

Contact PioneerCampus

Porträt Paolo Casale
Francesco Paolo Casale

PI "Systems Genetics & Machine Learning"

View profile