Skip to main navigation Skip to search Skip to main content

Community Knowledge about Security: Identification and Classification of User Contributions

Fabien Patrick Viertel*, Wasja Brunotte, Yannick Evers, Kurt Schneider

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Abstract

Nowadays, confidential data of users and companies are processed by various software applications. Therefore, it is necessary to protect them against security flaws in source code, which could, for example, allow the infringement of privacy. However, developers are usually not equipped with the required expertise to fulfill this task. To their rescue, there are tools like security code clone detectors to disclose vulnerable methods in source code. They try to find clones of written project code and vulnerable code fragments stored in a reference repository. Existing vulnerability databases, for instance the National Vulnerability Database (NVD), contain data on reported weaknesses, but the availability of example code for their occurrence, patch and exploit is scarce. Developers also use community websites to find help for secure implementations. In this paper, we propose a semi-automated process to extract security-related code from the Stack Exchange community network, where also the coding community Stack Overflow belongs. We classify the obtained code through artificial intelligence combined with natural language processing into the three security types: vulnerable, patch or exploit. In a twofold evaluation, we compared both parts with the manual activity of security experts. At first, for the search, our approach shows better precision than the experts as well as a moderate recall. Secondly, the results show that the classification of code fragments in security types is not quite easy. The investigated approaches and security experts perform with different strength regarding types of security.

Original languageEnglish
Title of host publicationRisks and Security of Internet and Systems
Subtitle of host publication15th International Conference, CRiSIS 2020, Paris, France, November 4–6, 2020, Revised Selected Papers
EditorsJoaquin Garcia-Alfaro, Jean Leneutre, Nora Cuppens, Reda Yaich
PublisherSpringer, Cham
Pages181-197
Number of pages17
ISBN (Electronic)978-3-030-68887-5
ISBN (Print)978-3-030-68886-8
DOIs
Publication statusPublished - 12 Feb 2021
EventThe 15th International Conference on Risks and Security of Internet and Systems - Online, France
Duration: 3 Nov 20206 Nov 2020
https://www.crisis-conference.com/

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12528 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceThe 15th International Conference on Risks and Security of Internet and Systems
Abbreviated titleCRISIS
Country/TerritoryFrance
Period3 Nov 20206 Nov 2020
Internet address

Keywords

  • Artificial intelligence
  • Clone detection
  • Community knowledge
  • Security
  • Source code

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Cite this