Publikation
Hybrid Machine Translation Architectures within and beyond the EuroMatrix project
Andreas Eisele; Christian Federmann; Hans Uszkoreit; Hervé Saint-Amand; Martin Kay; Michael Jellinghaus; Sabine Hunsicker; Teresa Herrmann; Yu Chen
In: J. Hutchins; W.V. Hahn (Hrsg.). Hybrid MT Methods in Practice: Their Use in Multilingual Extraction, Cross-Language Information Retrieval, Multilingual Summarization, and Applications in Hand-Held Devices.Proceedings of the European Machine Translation Conference. European Machine Translation Conference, September 22-23, Hamburg, Germany, Pages 27-34, HITEC e.V. 2008.
Zusammenfassung
This paper presents two hybrid architectures combining rule-based
and statistical machine translation (RBMT and SMT) technology. In
the first case, several existing MT engines are combined in a
multi-engine setup, and a decoder for SMT is used to select and
combine the best expressions proposed by different engines. The
other architecture uses lexical entries found using a combination
of SMT technology with shallow linguistic processing, which are then
included in a rule-based system.
The first architecture has been implemented in the framework of the
EuroMatrix project, where results from the recent evaluation
campaign give important insights into the strengths and weaknesses
of the approach relative to other participating RBMT and SMT
systems. The second architecture has been developed in
collaboration with the European Patent Office who uses the enhanced
RBMT system in the framework of their European Machine Translation
Project.