Task 8: Automata and query languages for semistructured data
Objectives
Main goal: A new approach to data on the web based on a new notion of automata, similar to tree automata, will be developed. On the practical side, automata concepts will be combined with database methods for analysing and improving existing XML query languages and automatic information extractors. We will also use verification techniques based on automata and logics for validating and comparing Document Type Definitions (DTDs). aspect and the practical impact of the network.
Background literature
[1] |
G. Gottlob and C. Koch, Monadic datalog and the expressive power of languages for Web
Information Extraction, in Proc. of PODS, 2002. [ BibTeX ] |
[2] |
R. Baumgartner, S. Flesca, and G. Gottlob, Visual web information extraction with
Lixto, in Proc. of VLDB, 2001. [ BibTeX ] |
[3] |
G. Gottlob and C. Koch, Monadic queries over tree-structured
data, in Proc. of LICS, 2002. [ BibTeX ] |
[4] |
G. Gottlob, N. Leone, and F. Scarcello, Hypertree decompositions: A
survey, in Proc. of MFCS, 2001. [ BibTeX ] |