Advances in Information Extraction
From Text and Image to Knowledge  The Schedule

Technical University of  Kaiserslautern, Computer Science, Course: INF-71-60-V-6
Spring Semester 2014
Instructor: Dr. Daniel Sonntag, DFKI   
Lecture time: Fr.,  14:30 - 16:00, 16:30 -18:00
(Not in room  42-110   (and 46-280))
Recitation time: Thursdays, 12:00 (noon), DFKI Kaiserslautern, room  2.04 BAYES, next recitation: July 17 (sheet 4),  July 24 (sheet 5)
Tutors: Takumi Toyama, Tigran Mkrtchyan
Credit points: 4 ECTS-CP (2C+1R)
Office hours: By appointment 

Description of the Course
Increased accessibility of textual information has led to a corresponding interest in technology for processing text automatically and extracting task‐relevant information. Information extraction (IE) is the task of automatically extracting structured information from unstructured and/or semi-structured documents. This course gives an overview of several IE tasks, reviews existing approaches and focusses on the state-of the-art in (1) processing human language texts by means of natural language processing (NLP) and (2) recent activities towards multimedia document processing (content extraction out of texts/images/videos) so that we could achieve content-based semantic access: bridging the Semantic Gap!

This is important because we have become overwhelmed by the sheer amount of multimedia material--big data--available to us. Content-based information access has changed, now we tackle information encoded in several media, in multimedia. Major challenges that remain and advances in information extraction include, most notably, text extraction in combination with visual media towards a global knowledge graph! 
 
Prerequisites
Methods and models of artificial intelligence, it is good to have taken one artificial intelligence (AI) class or another class in the master programme (Msc.) curriculum "Intelligente Systeme" but this is not required.

Grading

Grading will be based on the assignments, the recitations (required prerequisite for final examination), and the written examination.

Major Topics Related Computer Science Courses (University of Kaiserslautern, Fall 2013-2014)
Introduction to Artifical Intelligence, Prof. Lukowicz 
Semantic Technologies, Prof. Dengel 
Natural Language Processing and Applications, Prof. Breuel

Related International Artificial Intelligence & NLP Courses
Literature
Course Schedules and Syllabus   
Approximate Schedule (subject to chance)

Dates Topics Sources Exercise Sheets
1. (April 25) Introduction to (Multimedia) IE from an AI perspective [slides1]

2. (April 25) Overview of several IE tasks including named entity recognition, co-reference resolution, and relation extraction, image extraction, video extraction 

[slides1]


3. (May 2)  
Reading List, see How-to-Read-an-IE-Research-Paper  Reading List [Exercise Sheet 1]
(due to May 15)
4. (May 16)
Linguistic (Pre)Processing (NLP) [slides2]
5. (May 16) Information Extraction from Biomedical Texts and Images
Invited Lecture 1, UIMA @ SIEMENS, Abstract

[Exercise Sheet 2] (due to May 29)
6. (May 23) Constituency and Dependency parsing (NLP) [slides3]
7. (May 23) Multimedia Knowledge Capture in Ontologies and the Semantic Web, Multimedia Annotation and Authoring, Bridging the Semantic Gap 
[Special Exercise] (critiques due to June 12)
8. (June 6) Automated Question Answering, Open Information Extraction at Web Scale [slides4]
9. (June 6) Recitation / Examination: Form for Critiques  Form for Critiques [Exercise Sheet 3] (due to June 19)
10. (June 20) Machine Learning in IE: integrating multimedia clustering and classification, precision/recall, ROC, ANOVA
[ML Chapter]
[Eval Chapter]
[slides5]
[slides5a]

11. (June 20) Multimedia Image and Affect /Activity Information Extraction, Special content-based techniques  [slides5b] [Exercise Sheet 4] (due to July 10)
12. (July 4) Social Multimedia Analysis and Opinion Mining, Crowdsourcing
Invited Lecture 2, IBM Watson
[slides6]
13. (July 4) Applications and Projects, e.g., Real-time information extraction (1) in cyber-physical environments (CPE) and smart factories; and (2) while reading a multimedia text using eye tracking  

[Exercise Sheet 5] (due to July 17)
14. (August 1)
Final written examination, 13:00 - 14:30 in 42-110

     
Last change: 07/07/2014