Skip to main content Skip to main navigation

Publication

Processing Document Collections to Automatically Extract Linked Data: Semantic Storytelling Technologies for Smart Curation Workflows

Peter Bourgonje; Julian Moreno Schneider; Georg Rehm; Felix Sasaki
In: Aldo Gangemi and Claire Gardent (Hrsg.). Proceedings of the 2nd International Workshop on Natural Language Generation and the Semantic Web (WebNLG 2016). International Workshop on Natural Language Generation and the Semantic Web (WebNLG-16), located at INLG 2016, September 5-8, Edinburgh, United Kingdom, Pages 13-16, The Association for Computational Linguistics, 9/2016.

Abstract

We develop a system that operates on a document collection and represents the contained information to enable the intuitive and efficient exploration of the collection. Using various NLP, IE and Semantic Web methods, we generate a semantic layer on top of the collection, from which we take the key concepts. We define templates for structured reorganisation and rearrange the information related to the key concepts to fit the respective template. The use case of the system is to support knowledge workers (journalists, editors, curators, etc.) in their task of processing large amounts of documents by summarising the information contained in these documents and suggesting potential story paths that the knowledge worker can then process further.

Projekte

Weitere Links