Abstract
This study presents a comprehensive benchmarking of three state-of-the-art single-cell foundation models scGPT, Geneformer, and scFoundation, on cell-type classification tasks. We evaluate the models on three datasets: myeloid, human pancreas, and multiple sclerosis, examining both standard fine-tuning and few-shot learning scenarios. Our work reveals that scFoundation consistently achieves the best performance while Geneformer performs poorly, yielding results sometimes even worse than those of the baseline models. Additionally, we demonstrate that a good foundation model can generalize well even when fine-tuned with out-of-distribution data, a capability that the baseline models lack. Our work highlights the potential of foundation models for addressing challenging biomedical questions, particularly in contexts where models are trained on one population but deployed on another.
| Originalsprache | Englisch |
|---|---|
| Titel des Sammelwerks | WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining |
| Herausgeber (Verlag) | Association for Computing Machinery, Inc |
| Seiten | 1112-1113 |
| Seitenumfang | 2 |
| ISBN (elektronisch) | 9798400713293 |
| DOIs | |
| Publikationsstatus | Veröffentlicht - 10 März 2025 |
| Veranstaltung | 18th ACM International Conference on Web Search and Data Mining, WSDM 2025 - Hannover, Deutschland Dauer: 10 März 2025 → 14 März 2025 |
Konferenz
| Konferenz | 18th ACM International Conference on Web Search and Data Mining, WSDM 2025 |
|---|---|
| Kurztitel | WSDM 2025 |
| Land/Gebiet | Deutschland |
| Ort | Hannover |
| Zeitraum | 10 März 2025 → 14 März 2025 |
ASJC Scopus Sachgebiete
- Computernetzwerke und -kommunikation
- Angewandte Informatik
- Software
Dieses zitieren
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver