An Overview of Knowledge Extraction Projects in the NLP group at Lund University

Pierre Nugues
Department of Computer Science, Lund University, Lund, Sweden

Ingår i: Digital Humanities 2016. From Digitization to Knowledge 2016: Resources and Methods for Semantic Processing of Digital Works/Texts, Proceedings of the Workshop, July 11, 2016, Krakow, Poland

Linköping Electronic Conference Proceedings 126:5, s. 25--31

Publicerad: 2016-07-08

ISBN: 978-91-7685-733-5

ISSN: 1650-3686 (tryckt), 1650-3740 (online)


In this paper, I describe systems and prototypes we created in the natural language processing group at Lund to extract structured knowledge from text. Starting from syntactic and semantic parsing components, we developed applications that can handle large corpora, typically complete Wikipedia versions consisting of millions of documents and process text to identify entities and the relations between them. I describe the overall goals of our projects, the data structure we designed to handle the documents, as well as three applications to extract knowledge from text.


