Lecture Overview |
Number and Type: |
181.130 VU WS 2012/13 |
Lecturer: |
Robert Baumgartner (exercises together with tutor Alexander Fischl) |
Selected Keywords: |
Information extraction, approaches, tools and methods for wrapper generation, web querying, data integration, XML |
Preliminary Meeting: |
Friday 5th of October, 16:00 (s.t.), EI 2 Pichlmayer HS |
Registration: |
Until 4th of October via TISS (limited participant number). Please de-register in TISS in case you decide not to take the course. ECML students who can not yet register please write me a message to reserve a place for you. |
Language: |
Slides in English, lecture language depending whether
non-german speaking students join |
Timetable: |
Selected Fridays 16:00-19:00 (see below for details; two exercise slots) |
Procedure: |
Lecture coupled with exercises and group work |
Topics: |
- Information Extraction: Setting, History, IE vs. IR
- Structured Data Extraction and Wrapping
- XML Transformation and Query Languages, DOM
- Web Wrapper Languages
- Wrapper Generation Tools
- Wrappers for Mashups, SOA and BI
- Inductive Wrapper Generation
- Automatic Data Extraction / Web Data Mining
- Supervised Wrapper Generation
- Deep Web Navigation Approaches
- Data Extraction from PDF documents
- Mediation and Integration Approaches
- Web Data Cleaning
- Lixto Visual Wrapper and Transformation Server
|
Fields of Study: |
This VU is a component of the curriculum of several master studies and is part of the European Master Programs Computational Logic. |