Zur Hauptnavigation wechseln Zur Suche wechseln Zum Hauptinhalt wechseln

On the Limitations of Combining Sentiment Analysis Tools in a Cross-Platform Setting

  • Martin Obaidi*
  • , Henrik Holm
  • , Kurt Schneider
  • , Jil Klünder
  • *Korrespondierende*r Autor*in für diese Arbeit

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Abstract

A positive working climate is essential in modern software development. It enhances productivity since a satisfied developer tends to deliver better results. Sentiment analysis tools are a means to analyze and classify textual communication between developers according to the polarity of the statements. Most of these tools deliver promising results when used with test data from the domain they are developed for (e.g., GitHub). But the tools' outcomes lack reliability when used in a different domain (e.g., Stack Overflow). One possible way to mitigate this problem is to combine different tools trained in different domains. In this paper, we analyze a combination of three sentiment analysis tools in a voting classifier according to their reliability and performance. The tools are trained and evaluated using five already existing polarity data sets (e.g. from GitHub). The results indicate that this kind of combination of tools is a good choice in the within-platform setting. However, a majority vote does not necessarily lead to better results when applying in cross-platform domains. In most cases, the best individual tool in the ensemble is preferable. This is mainly due to the often large difference in performance of the individual tools, even on the same data set. However, this may also be due to the different annotated data sets.
OriginalspracheEnglisch
Titel des SammelwerksProduct-Focused Software Process Improvement
Herausgeber/-innenDavide Taibi, Marco Kuhrmann, Tommi Mikkonen, Pekka Abrahamsson, Jil Klünder
ErscheinungsortCham
Herausgeber (Verlag)Springer International Publishing AG
Seiten108-123
Seitenumfang16
ISBN (elektronisch)978-3-031-21388-5
ISBN (Print)978-3-031-21387-8
DOIs
PublikationsstatusVeröffentlicht - 14 Nov. 2022

Publikationsreihe

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band13709 LNCS
ISSN (Print)0302-9743
ISSN (elektronisch)1611-3349

ASJC Scopus Sachgebiete

  • Theoretische Informatik
  • Allgemeine Computerwissenschaft

Dieses zitieren