Skip to main navigation Skip to search Skip to main content

DivQ: Diversification for keyword search over structured databases

  • Elena Demidova*
  • , Peter Fankhauser
  • , Xuan Zhou
  • , Wolfgang Nejdl
  • *Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Abstract

Keyword queries over structured databases are notoriously ambiguous. No single interpretation of a keyword query can satisfy all users, and multiple interpretations may yield overlapping results. This paper proposes a scheme to balance the relevance and novelty of keyword search results over structured databases. Firstly, we present a probabilistic model which effectively ranks the possible interpretations of a keyword query over structured data. Then, we introduce a scheme to diversify the search results by re-ranking query interpretations, taking into account redundancy of query results. Finally, we propose α-nDCG-W and WS-recall, an adaptation of α-nDCG and S-recall metrics, taking into account graded relevance of subtopics. Our evaluation on two real-world datasets demonstrates that search results obtained using the proposed diversification algorithms better characterize possible answers available in the database than the results of the initial relevance ranking.

Original languageEnglish
Title of host publicationSIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
Pages331-338
Number of pages8
DOIs
Publication statusPublished - 9 Jul 2010
Event33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010 - Geneva, Switzerland
Duration: 19 Jul 201023 Jul 2010

Publication series

NameSIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval

Conference

Conference33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010
Country/TerritorySwitzerland
CityGeneva
Period19 Jul 201023 Jul 2010

Keywords

  • Diversity
  • Query intent
  • Ranking in databases

ASJC Scopus subject areas

  • Information Systems

Cite this