DFKI-LT - Publications

The Lab's scientific and technological results are documented by numerous publications every year. Besides the most recent paper list, a search facility opens up access to all Lab publications since 1988. Co-authors affiliated with the Lab have their name linked up so that the publications by that author can be displayed by a single mouse click.

2017
 
Georg Rehm
Language Technologies for Multilingual Europe: Towards a Human Language Project. Strategic Research and Innovation Agenda
in: Georg Rehm (ed.):
11/2017
 
Viviana Cotik, Darío Filippo, Roland Roller, Hans Uszkoreit, Feiyu Xu
Annotation of Entities and Relations in Spanish Radiology Reports
in: Galia Angelova, Kalina Bontcheva, Ruslan Mitkov, Ivelina Nikolova, Irina Temnikova (eds.):
Proceedings of the International Conference Recent Advances in Natural Language Processing, Varna, Bulgaria, INCOMA Ltd. Shoumen, Bulgaria, 9/2017
 
Eleftherios Avramidis
Sentence-level quality estimation by predicting HTER as a multi-component metric
Second Conference on Machine Translation, Copenhagen, Denmark, Association for Computational Linguistics, 9/2017
 
Dagmar Gromann, Thierry Declerck
Hashtag Processing for Enhanced Clustering of Tweets
in: Galia Angelova, Kalina Bontcheva, Ruslan Mitkov, Ivelina Nikolova, Irina Temnikova (eds.):
Proceedings of the INTERNATIONAL CONFERENCE RECENT ADVANCES IN NATURAL LANGUAGE PROCESSING 2017, Varna, Bulgaria, INCOMA Ltd, University of Wolverhampton and Bulgarian Academy of Sciences, Shoumen, Bulgaria, 9/2017
 
Rajen Chatterjee, M. Amin Farajian, Matteo Negri, Marco Turchi, Ankit Srivastava, Santanu Pal
Multi-source Neural Automatic Post-Editing: FBK's participation in the WMT 2017 APE shared task
Second Conference on Machine Translation, Pages 630-638, Copenhagen, Denmark, Association for Computational Linguistics, 9/2017
 
Thierry Declerck
Software Projects for Developing Digital Humanities Resources
in: Peggy Bockwinkel, Thierry Declerck, Sandra Kübler, Heike Zinsmeister (eds.):
Proceedings of the Workshop on Teaching NLP for Digital Humanities (Teach4DH 2017) volume 1918, Berlin, Germany, CEURS, DFKI, RWTH Aachen University, 9/2017
 
Robert Schwarzenberg, Leonhard Hennig, Holmer Hemsen
In-Memory Distributed Training of Linear-Chain Conditional Random Fields, with an Application to Fine-Grained Named Entity Recognition
Proceedings of the International Conference of the German Society for Computational Linguistics and Language Technology, Berlin, Germany, GSCL, 9/2017
 
Carole Tiberius, Thierry Declerck
A lemon Model for the ANW Dictionary
in: Iztok Kosem, Jelena Kallas, Carole Tiberius, Simon Krek, Milo¨ Jakubíček, Vít Baisa (eds.):
Proceedings of the eLex 2017 conference, Pages 237-251, Leiden, Netherlands, Lexical Computing CZ s.r.o., INT, Trojína and Lexical Computing, Brno, Czech Republic, 9/2017
 
Philippe Thomas, Leonhard Hennig
Twitter Geolocation Prediction using Neural Networks
Proceedings of the International Conference of the German Society for Computational Linguistics and Language Technology, Berlin, Germany, GSCL, 9/2017
 
Eleftherios Avramidis
QE::GUI – A Graphical User Interface for Quality Estimation
The Prague Bulletin of Mathematical Linguistics volume 109, Pages 51-60, Charles University, Prague, Czech Republic, 9/2017
 
Peggy Bockwinkel, Thierry Declerck, Sandra Kübler, Heike Zinsmeister (eds.)
Proceedings of the Workshop on Teaching NLP for Digital Humanities
volume 1918, Berlin, Germany, CEURS, RWTH Aachen University, 9/2017
 
Philippe Thomas, Johannes Kirschnick, Leonhard Hennig, Renlong Ai, Sven Schmeier, Holmer Hemsen, Feiyu Xu, Hans Uszkoreit
Streaming Text Analytics for Real-time Event Recognition
Proceedings of the International Conference Recent Advances in Natural Language Processing, Varna, Bulgaria, tbd, 9/2017
 
Antonio Jimeno Yepes, Aurelie Neveol, Mariana Neves, Karin Verspoor, Ondrej Bojar, Arthur Boyer, Cristian Grozea, Barry Haddow, Madeleine Kittner, Yvonne Lichtblau, Pavel Pecina, Roland Roller, Rudolf Rosa, Amy Siu, Philippe Thomas, Saskia Trescher
Findings of the WMT 2017 Biomedical Translation Shared Task
Proceedings of the Second Conference on Machine Translation volume 2: Shared Task Papers, Pages 234-247, Copenhagen, Denmark, Association for Computational Linguistics, 9/2017
 
Thierry Declerck, Antónia Kostová, Lisa Schäfer
Towards a Linked Data Access to Folktales classified by Thompson’s Motifs and Aarne-Thompson-Uther’s Types
Proceedings of Digital Humanities 2017, Montréal, QC, Canada, ADHO, 8/2017
 
Cristina España i Bonet, Alberto Barrón-Cedeño
Lump at SemEval-2017 Task 1: Towards an Interlingua Semantic Similarity
International Workshop on Semantic Evaluation, Pages 144-149, Vancouver, BC, Canada, Association for Computational Linguistics, Association for Computational Linguistics, 8/2017
 
Pranava Swaroop Madhyastha, Cristina España i Bonet
Learning Bilingual Projections of Embeddings for Vocabulary Expansion in Machine Translation
Proceedings of the 2nd Workshop on Representation Learning for NLP, Pages 139-145, Vancouver, BC, Canada, Association for Computational Linguistics, Association for Computational Linguistics, 8/2017
 
Ankit Srivastava, Georg Rehm, Julian Moreno Schneider
Rumour Detection and Classification Using Cascading Heuristics
Proceedings of the 11th International Workshop on Semantic Evaluation, Vancouver, Canada, In print, 8/2017
 
Thierry Declerck, Carole Tiberius, Eveline Wandl-Vogt
Encoding lexicographic Data in lemon: Lessons learned
in: John P. McCrae, Francis Bond, Paul Buitelaar, Philipp Cimiano, Thierry Declerck, Jorge Gracia, Ilan Kernerman, Elena Montiel Ponsoda, Noam Ordan, Maciej Piasecki, Jan Wieczorek (eds.):
Proceedings of the LDK workshops: OntoLex, TIAD and Challenges for Wordnets, Galway, Ireland, CEURS, 8/2017
 
Dirk Weißenborn, Georg Wiese, Laura Seiffe
Making Neural QA as Simple as Possible but not Simpler
ACL, Vacnouver, BC, Canada, ACL, 8/2017
 
Georg Rehm, Jing He, Julian Moreno Schneider, Jan Nehring, Joachim Quantz
Designing User Interfaces for Curation Technologies
19th International Conference on Human-Computer Interaction -- HCI International 2017, Vancouver, Canada, In print, 7/2017
 
Thierry Declerck, Dagmar Gromann
Porting the xEBR Taxonomy to a Linked Open Data compliant Format
in: Maria Mora (ed.):
Proceedings of The Academic Track is part of the Eurofilling XBRL week, Frankfurt, Germany, CEURS, XBRL, 6/2017
 
Jörg Lehmann, Moritz Mittelbach, Sven Schmeier
Quantifizierung von Emotionswörtern in Texten
in: Mirjam Blümm, Thomas Kollatz, Stefan Schmunk, Christof Schöch (eds.):
6/2017
 
Eva Martínez Garcia, Carles Creus, Cristina España i Bonet, Lluís Màrquez
Using Word Embeddings to Enforce Document-Level Lexical Consistency in Machine Translation
The Prague Bulletin of Mathematical Linguistics volume 108, Pages 85-96, DE GRUYTER OPEN, Warsaw, Poland, 6/2017
 
Thierry Declerck, Lisa Schäfer
Porting past Classification Schemes for Narratives to a Linked Data Framework
in: Apostolos Antonacopoulos, Marco Büchler (eds.):
Proceedings of DATeCH2017, Göttingen, Germany, ACM, 6/2017
 
Bernd Kiefer, Abraham Gebru Tesfay, Dietrich Klakow
Terrain Classification for Ground Robots Based on Acoustic Features
International Journal of Electrical, Computer, Energetic, Electronic and Communication Engineering volume 11 number 6, Pages 544-548, World Academy of Science, Engineering and Technology, 6/2017
 
Ankit Srivastava, Georg Rehm, Felix Sasaki
Improving Machine Translation through Linked Data
in: Ondřej Bojar, Alexander M. Fraser, Lucia Specia, Mikel L. Forcada (eds.):
The Prague Bulletin of Mathematical Linguistics volume 108, Pages 355-366, Charles University (Prague, Czech Republic), 6/2017
 
Vivien Macketanz, Eleftherios Avramidis, Aljoscha Burchardt, Jindrich Helcl, Ankit Srivastava
Machine Translation: Phrase-Based, Rule-Based and Neural Approaches with Linguistic Evaluation
Cybernetics and Information Technologies volume 17 number 2, Pages 28-43, De Gruyter, 6/2017
 
Eleftherios Avramidis
Comparative Quality Estimation for Machine Translation: Observations on machine learning and features
The Prague Bulletin of Mathematical Linguistics volume 108, Pages 307-318, Prague, Czech Republic, Charles University, Prague, Czech Republic, 5/2017
 
Johannes Kirschnick, Philippe Thomas
SIA: Scalable Interoperable Annotation Server
Proceedings of the BioCreative V.5 Challenge Evaluation Workshop, Pages 138-145, Barcelona, Spain, -, 4/2017
 
Hans Uszkoreit, Aleksandra Gabryszak, Leonhard Hennig, Jörg Steffen, Renlong Ai, Stephan Busemann, Jonathan Dehdari, Josef van Genabith, Georg Heigold, Nils Rethmeier, Raphael Rubino, Sven Schmeier, Philippe Thomas, He Wang, Feiyu Xu
Common Round: Application of Language Technologies to Large-Scale Web Debates
Proceedings of the Software Demonstrations of the 15th Conference of the European Chapter of the Association for Computational Linguistics, Pages 5-8, Valencia, Spain, Association for Computational Linguistics, 4/2017
 
Georg Heigold, Günter Neumann, Josef van Genabith
An Extensive Empirical Evaluation of Character-Based Morphological Tagging for 14 Languages
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics 15th Conference of the European Chapter of the Association for Computational Linguistics- Proceedings of Conference, Volume 1: Long Papers volume 1, Long Papers, Pages 505-5013, Valencia, Spain, Association for Computational Linguistics (ACL), 4/2017
 
Ingmar Steiner
A DevOps Manifesto for Speech Corpus Management
in: Jürgen Trouvain, Ingmar Steiner, Bernd Möbius (eds.):
28th Conference on Electronic Speech Signal Processing (ESSV), Pages 160-166, Saarbrücken, Germany, TUD Press, Dresden, 3/2017
 
Ingmar Steiner, Sébastien Le Maguer, Judith Manzoni, Peter Gilles, Jürgen Trouvain
Developing new language tools for MaryTTS: the case of Luxembourgish
in: Jürgen Trouvain, Ingmar Steiner, Bernd Möbius (eds.):
28th Conference on Electronic Speech Signal Processing (ESSV), Pages 186-192, Saarbrücken, Germany, TUD Press, Dresden, 3/2017
 
Eran Raveh, Iona Gessinger, Sébastien Le Maguer, Ingmar Steiner, Bernd Möbius
Investigating Phonetic Convergence in a Shadowing Experiment with Synthetic Stimuli
in: Jürgen Trouvain, Ingmar Steiner, Bernd Möbius (eds.):
28th Conference on Electronic Speech Signal Processing (ESSV), Pages 254-261, Saarbrücken, Germany, TUD Press, Dresden, 3/2017
 
Arif Khan, Ingmar Steiner
Qualitative Evaluation and Error Analysis of Phonetic Segmentation
in: Jürgen Trouvain, Ingmar Steiner, Bernd Möbius (eds.):
28th Conference on Electronic Speech Signal Processing (ESSV), Pages 138-144, Saarbrücken, Germany, TUD Press, Dresden, 3/2017
 
Benjamin Weitz, Ingmar Steiner, Peter Birkholz
Gesture-Based Articulatory Text-to-Speech Synthesis
in: Jürgen Trouvain, Ingmar Steiner, Bernd Möbius (eds.):
28th Conference on Electronic Speech Signal Processing (ESSV), Pages 324-331, Saarbrücken, Germany, TUD Press, Dresden, 3/2017
 
Sébastien Le Maguer, Ingmar Steiner
Uprooting MaryTTS: Agile Processing and Voicebuilding
in: Jürgen Trouvain, Ingmar Steiner, Bernd Möbius (eds.):
28th Conference on Electronic Speech Signal Processing (ESSV), Pages 152-159, Saarbrücken, Germany, TUD Press, Dresden, 3/2017
 
Eric Malmi, Daniele Pighin, Sebastian Krause, Mikhail Kozhevnikov
Automatic Prediction of Discourse Connectives
Computing Research Repository eprint Journal volume abs/1702.00992, Pages 1-9, arXiv, 2/2017
 
Thierry Declerck, Sandra Kübler (eds.)
Proceedings of the Workshop on Corpora in the Digital Humanities
volume 1786, Pages 85, Bloomington, Indiana, USA, CEURS, 1/2017
 
Thierry Declerck
A Set of Annotations for supporting a TTS application for Folktales
in: Thierry Declerck, Sandra Kübler (eds.):
Proceedings of the Workshop on Corpora in the Digital Humanities volume 1786, Bloomington, Indiana, USA, CEURS, 1/2017
 
Jindrich Helcl, Jindřich Libovický
Neural Monkey: An Open-source Tool for Sequence Learning
Prague Bulletin of Mathematical Linguistics volume 107, Pages 1-11, Charles University, 2017
 
Sebastian Krause, Mikhail Kozhevnikov, Eric Malmi, Daniele Pighin
Redundancy Localization for the Conversationalization of Unstructured Responses
Proceedings of the 18th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Saarbrücken, Germany, Association for Computational Linguistics, 2017
 
Sébastien Le Maguer, Ingmar Steiner, Alexander Hewer
An HMM/DNN comparison for synchronized text-to-speech and tongue motion synthesis
Interspeech 2017, Pages 239-243, Stockholm, Sweden, ISCA, 2017
 
Peter Bourgonje, Julian Moreno Schneider, Georg Rehm
From Clickbait to Fake News Detection: An Approach based on Detecting the Stance of Headlines to Articles
in: Octavian Popescu, Carlo Strapparava (eds.):
Proceedings of Natural Language Processing meets Journalism, Copenhagen, Denmark, Association for Computational Linguistics, 2017
 
Julian Moreno Schneider, Peter Bourgonje, Georg Rehm
Towards User Interfaces for Semantic Storytelling
19th International Conference on Human-Computer Interaction -- HCI International 2017, Vancouver, Canada, In print, 2017
 
Georg Wiese, Dirk Weißenborn, Mariana Neves
Neural Domain Adaptation for Biomedical Question Answering
ACL, ACL, 2017
 
Georg Rehm, Julian Moreno Schneider, Peter Bourgonje, Ankit Srivastava, Jan Nehring, Armin Berger, Luca König, Sören Räuchle, Jens Gerth
Event Detection and Semantic Storytelling: Generating a Travelogue from a large Collection of Personal Letters.
in: Tommaso Caselli, Ben Miller, Tommaso Caselli, Ben Miller, Marieke van Erp, Piek Vossen, Martha Palmer, Eduard Hovy, Teruko Mitamura (eds.):
Proceedings of the Events and Stories in the News Workshop, Pages 42-51, Vancouver, BC, Canada, Association for Computational Linguistics, 2017
 
Eran Raveh, Ingmar Steiner, Bernd Möbius
A Computational Model for Phonetically Responsive Spoken Dialogue Systems
Interspeech 2017, Pages 884-888, Stockholm, Sweden, ISCA, 2017
 
Kimberley Harris, Aljoscha Burchardt
Improving Machine Translation: The Gap Between Research Approaches and Industry Needs
in: Jörg Porsiel (ed.):
Machine Translation, BDÜ Weiterbildungs- und Fachverlagsgesellschaft mbH, Berlin, 2017
 
Julian Moreno Schneider, Ankit Srivastava, Peter Bourgonje, David Wabnitz, Georg Rehm
Semantic Storytelling, Cross-lingual Event Detection and other Semantic Services for a Newsroom Content Curation Dashboard
in: Octavian Popescu, Carlo Strapparava (eds.):
Proceedings of Natural Language Processing meets Journalism, Copenhagen, Denmark, Association for Computational Linguistics, 2017
 
Peter Bourgonje, Julian Moreno Schneider, Georg Rehm
Digital Curation Technologies for Forensic Linguistics
13th Biennial Conference of the International Association of Forensic Linguists, Porto, Portugal, In print, 2017
 
Aljoscha Burchardt, Marco Pennacchiotti
FATE: Annotating a Textual Entailment Corpus with FrameNet
Handbook of Linguistic Annotation, Pages 1101-1118, Springer Netherlands, Dordrecht, 2017
 
Georg Rehm, Thierry Declerck (eds.)
Language Technologies for the Challenges of the Digital Age: Proceedings of the GSCL Conference 2017
Berlin, Germany, Springer, Berlin, 2017
 
Iona Gessinger, Eran Raveh, Sébastien Le Maguer, Bernd Möbius, Ingmar Steiner
Shadowing Synthesized Speech -- Segmental Analysis of Phonetic Convergence
Interspeech 2017, Pages 3797-3801, Stockholm, Sweden, ISCA, 2017
 
Arle Lommel, Aljoscha Burchardt
Quality Management for Translation
in: Jörg Porsiel (ed.):
Machine Translation, BDÜ Weiterbildungs- und Fachverlagsgesellschaft mbH, Berlin, 2017
 
Daniel Zeman, Martin Popel, Milan Straka, Jan Hajic, Joakim Nivre, Filip Ginter, Juhani Luotolahti, Sampo Pyysalo, Slav Petrov, Martin Potthast, Francis Tyers, Elena Badmaeva, Memduh Gokirmak, Anna Nedoluzhko, Silvie Cinkova, Jan Hajic jr., Jaroslava Hlavacova, Václava Kettnerová, Zdenka Uresova, Jenna Kanerva, Stina Ojala, Anna Missilä, Christopher D. Manning, Sebastian Schuster, Dima Taji Siva Reddy, Nizar Habash, Herman Leung, Marie-Catherine de Marneffe, Manuela Sanguinetti, Maria Simi, Hiroshi Kanayama, Valeria dePaiva, Kira Droganova, Héctor Martínez Alonso, Çağrı Çöltekin, Umut Sulubacak, Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Georg Rehm, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Michael Mandl, Jesse Kirchner, Hector Fernandez Alcalde, Jana Strnadová, Esha Banerjee, Ruli Manurung, Antonio Stella, Atsuko Shimada, Sookyoung Kwak, Gustavo Mendonca, Tatiana Lando, Rattima Nitisaroj, Josie Li
CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies
Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Pages 1-19, Vancouver, BC, Canada, Association for Computational Linguistics, 2017
 
Boyuan Deng, Denis Jouvet, Yves Laprie, Ingmar Steiner, Aghilas Sini
Towards Confidence Measures on Fundamental Frequency Estimations
42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Pages 5605-5609, New Orleans, Louisiana, USA, IEEE, IEEE, 2017
 
Georg Rehm
An Infrastructure for Empowering Internet Users to handle Fake News and other Online Media Phenomena
in: Georg Rehm, Thierry Declerck (eds.):
Language Technologies for the Challenges of the Digital Age: Proceedings of the GSCL Conference 2017, Berlin, Springer, Berlin, 2017
 
Anne Beyer, Vivien Macketanz, Aljoscha Burchardt, Philip Williams
Can Out-of-the-box NMT Beat a Domain-trained Moses on Technical Data?
The 20th Annual Conference of the European Association for Machine Translation, Pages 41-46, Prague, Czech Republic, Charles University, Faculty of Mathematics and Physics, Charles University, Malostranské náměstí 25, 11800 Prague 1, Czech Republic, 2017
 
Sébastien Le Maguer, Ingmar Steiner
The ``Uprooted'' MaryTTS Entry for the Blizzard Challenge 2017
Blizzard Challenge 2017, Stockholm, Sweden, ISCA, 2017
 
Ankit Srivastava, Sabine Weber, Peter Bourgonje, Georg Rehm
Different German and English Coreference Resolution Models for Multi-Domain Content Curation Scenarios
in: Georg Rehm, Thierry Declerck (eds.):
Language Technologies for the Challenges of the Digital Age: Proceedings of the GSCL Conference 2017, Berlin, Germany, Springer, Berlin, 2017
 
Peter Bourgonje, Julian Moreno Schneider, Georg Rehm
Semantically Annotating heterogeneous Document Collections - Curation Technologies for Digital Humanities and Text Analytics.
CUTE Workshop 2017 -- CRETA Unshared Task zu Entitätenreferenzen. Workshop bei DHd2017, Bern, Switzerland, In print, Digital Humanities im deutschsprachigen Raum e. V., Universität Trier FB II/ Trier Center for Digital Humanities D-54286 Trier Deutschland, 2017
 
Aljoscha Burchardt, Vivien Macketanz, Jonathan Dehdari, Georg Heigold, Jan-Thorsten Peter, Philip Williams
A Linguistic Evaluation of Rule-Based, Phrase-Based, and Neural MT Engines
The Prague Bulletin of Mathematical Linguistics volume 108 number 1, Pages 159-170, De Gruyter Open, 2017
 
Laura Frädrich, Fabrizio Nunnari, Maria Staudte, Alexis Heloir
Simulating Listener Gaze and Evaluating Its Effect on Human Speakers
in: Jonas Beskow, Christopher Peters, Ginevra Castellano, Carol O'Sullivan, Iolanda Leite, Stefan Kopp (eds.):
Intelligent Virtual Agents: 17th International Conference, IVA 2017, Proceedings, Pages 156-159, Stockholm, Sweden, Springer International Publishing, 2017
 
Philip Hake, Peter Fettke, Günter Neumann, Peter Loos
Extracting Business Objects and Activities from Labels of German Process Models
in: Alexander Maedche, Jan vom Brocke, Alan Hevner (eds.):
Designing the Digital Transformation volume 10243,
Lecture Notes in Computer Science, Pages 21-38, Karlsruhe, Germany, Springer International Publishing, 2017
 
Kathrin Eichler, Feiyu Xu, Hans Uszkoreit, Sebastian Krause
Generating Pattern-Based Entailment Graphs for Relation Extraction
Proceedings of the 6th Joint Conference on Lexical and Computational Semantics (*SEM 2017), Vancouver, BC, Canada, Association for Computational Linguistics, 2017
 
Eran Raveh, Ingmar Steiner
A Phonetic Adaptation Module for Spoken Dialogue Systems
in: Volha Petukhova, Ye Tian (eds.):
21st Workshop on the Semantics and Pragmatics of Dialogue (SemDial), Pages 162-163, Saarbrücken, Germany, ACL, 2017
 
Jurica Seva, Madeleine Kittner, Roland Roller, Ulf Leser
Multi-lingual ICD-10 Coding using a Hybrid rule-based and Supervised Classification Approach at CLEF eHealth 2017
Working Notes of CLEF 2017 - Conference and Labs of the Evaluation Forum volume 1866,
CEUR Workshop Proceedings, Dublin, Ireland, CEUR-WS.org, 2017
 
Peter Bourgonje, Julian Moreno Schneider, Georg Rehm
Automatic Classification of Abusive Language and Personal Attacks in Various Forms of Online Communication
in: Georg Rehm, Thierry Declerck (eds.):
Language Technologies for the Challenges of the Digital Age: Proceedings of the GSCL Conference 2017, Berlin, Germany, Springer, Berlin, 2017