Skip to main navigation Skip to search Skip to main content

Neuro-Symbolic Federated Research Artifact Search

  • Farhana Keya*
  • , Sören Auer
  • , Mohamad Yaser Jaradeh
  • *Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Abstract

Scientific research artifacts such as datasets, software or ontologies are essential components of scientific discovery. Yet, the growing volume of such artifacts requires more efficient and relevant search and retrieval systems. We present a neuro-symbolic approach for federated research artifact search, specifically for datasets and software metadata over Resodate and Wikidata. Integrated into the ORKG ASK platform, our system processes user queries through linguistic analysis to extract key terms. These key terms are then used to retrieve and recommend relevant research artifacts from federated sources, ensuring precise and contextually relevant metadata discovery. To further enhance retrieval accuracy, we employ a ranking mechanism that organizes research artifacts based on each user query’s structure and morphological features. We evaluate various key-term extraction methods and ranking approaches, integrating both symbolic and neural techniques. We rigorously evaluate the key-term extraction using Precision, Recall, and F1-score, and assess the re-ranking effectiveness by comparing with human rankings through correlation metrics and LLM-based evaluations. Our experiments show that symbolic methods outperform the neural approach regarding accuracy and response time. As a result, our system offers users more effective and efficient research artifact recommendations.

Original languageEnglish
Title of host publicationLinking Theory and Practice of Digital Libraries - 29th International Conference on Theory and Practice of Digital Libraries, TPDL 2025, Proceedings
EditorsWolf-Tilo Balke, Koraljka Golub, Yannis Manolopoulos, Kostas Stefanidis, Zheying Zhang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages145-162
Number of pages18
ISBN (Electronic)978-3-032-05409-8
ISBN (Print)9783032054081
DOIs
Publication statusPublished - 15 Sept 2026
Event29th International Conference on Theory and Practice of Digital Libraries, TPDL 2025 - Tampere, Finland
Duration: 23 Sept 202526 Sept 2025

Publication series

NameLecture Notes in Computer Science
Volume16097 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference29th International Conference on Theory and Practice of Digital Libraries, TPDL 2025
Abbreviated titleTPDL 2025
Country/TerritoryFinland
CityTampere
Period23 Sept 202526 Sept 2025

Keywords

  • Federated Search
  • Key Term Extraction
  • Neuro-Symbolic Systems

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Cite this