Description
Grounding Scientific Entity References in STEM Scholarly Content to Authoritative Encyclopedic and Lexicographic Sources The STEM ECR v1.0 dataset has been developed to provide a benchmark for the evaluation of scientific entity extraction, classification, and resolution tasks in a domain-independent fashion. It comprises annotations for scientific entities in scientific Abstracts drawn from 10 disciplines in Science, Technology, Engineering, and Medicine. The annotated entities are further grounded to Wikipedia and Wiktionary, respectively.
What this repository contains?
The dataset is organized in the following folders:
* Scientific Entity Annotations: Contains annotations for Process, Material, Method, and Data scientific entities in the STEM dataset.
* Scientific Entity Resolution: Annotations for the STEM dataset scientific entities with Entity Linking (EL) annotations to Wikipedia and Word Sense Disambiguation (WSD) annotations to Wiktionary.
What this repository contains?
The dataset is organized in the following folders:
* Scientific Entity Annotations: Contains annotations for Process, Material, Method, and Data scientific entities in the STEM dataset.
* Scientific Entity Resolution: Annotations for the STEM dataset scientific entities with Entity Linking (EL) annotations to Wikipedia and Word Sense Disambiguation (WSD) annotations to Wiktionary.
| Date made available | 2020 |
|---|---|
| Publisher | Forschungsdaten-Repositorium der LUH |
Research output
- 1 Conference contribution
-
The STEM-ECR Dataset: Grounding Scientific Entity References in STEM Scholarly Content to Authoritative Encyclopedic and Lexicographic Sources
D'Souza, J., Hoppe, A., Brack, A., Jaradeh, M. Y., Auer, S. & Ewerth, R., May 2020, LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings. Calzolari, N., Bechet, F., Blache, P., Choukri, K., Cieri, C., Declerck, T., Goggi, S., Isahara, H., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., Odijk, J. & Piperidis, S. (eds.). European Language Resources Association (ELRA), p. 2192-2203 12 p. (LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings).Research output: Chapter in book/report/conference proceeding › Conference contribution › Research
Cite this
- DataSetCite