Master class, part of the BCN Summer School, July 1-12, 1996
lectures by (among others)
GIORGIO SATTA
(University of Padua)
ATRO VOUTILAINEN
(University of Helsinki)
For any kind of mechanical processing of input, in whatever form, some kind of finite-state process is involved. Much theory has already been developed during the early days of computer science, much of it however very abstract, or at least not readily applicable to processing of natural language. The last few years have seen a surge of interesting publications that close the gap between the theory of finite-state techniques and the practice of computational linguistics.
During the course of our master class, students will be made familiar with these new developments. Three prominent researchers will discuss a wide range of topics, including some ideas just emerging in this field. Apart from the lectures by the three invited speakers, some ongoing research at the Humanities Computing Department (Alfa Informatica) of the University of Groningen will be discussed, by Gertjan van Noord and Mark-Jan Nederhof. To make the course accessible to students without any prior knowledge of finite-state techniques, we will start with some introductory lectures on formal language theory, finite-state automata and transducers, regular languages, rational transductions, etc.
A ``reader'' with papers from the speakers will become available during the event, price: DFL 5.
The master class will be held in the Educational Center of the University Hospital (AZG); room 15 (use the entrance at the Oostersingel and follow the signs to number 59). It comprises 5 sessions, each from 9:00 to 12:00. The schedule is:
The registration fee for the master class also covers the other events of the summer school.
For more information concerning this master class, contact the coordinator
Mark-Jan NederhofFor registration and for more information concerning the summer school of which this master class is part, click here or contact
University of Groningen
Faculty of Arts
P.O. Box 716
NL-9700 AS Groningen
The Netherlands
markjan@let.rug.nl
Tel. +31-50-3635970
Fax. +31-50-3636855
Office of Graduate School BCN
Nijenburgh 4
NL-9747 AG Groningen
The Netherlands
bureau@bcn.rug.nl
Tel. +31-50-3634734
Fax. +31-50-3634740
We present the paradigm of transformation-based parsing (Brill, 1993) and develop efficient parsing algorithms based on finite state tree automata.
This lecture will cover the following topics. Top-down tree automata, bottom-up tree automata. Tree regular expressions. Tree automata and tree pattern matching algorithms. Precomputation of tree transformations into tree automata. Transformation-based parsing algorithms. Overlapping redexes.
For the slides in postscript, see here.
We present the notion of constraint ranking as developed by recent phonological theories. Under the assumption that constraints are represented through regular expressions, we develop finite state transducer implementations of these theories.
This lecture will cover the following topics. Optimality Theory (Prince and Smolensky 1993). Constraint ranking and constraint violability. Optimality systems. Conditional intersection of regular languages. Computation of constraint violability through finite state transducers.
For the slides in postscript, see here.
These lectures (3 hours) outline recent work on FS parsing in Helsinki (Koskenniemi, Tapanainen, Voutilainen). Most of the attention is given to linguistic rather than algorithmic issues.
For the slides in postscript, see here.
I will illustrate the FSA Utilities toolbox. The FSA Utilities toolbox is a Prolog implementation of a number of utilities to manipulate finite-state automata and finite-state transducers. Manipulations include determinization (both for finite-state recognizers and finite-state transducers), minimalization, composition, complementation, intersection, Kleene closure, etc. Furthermore various visualization tools are available to browse finite-state automata. The toolbox is implemented in SICStus Prolog.
The FSA Utilities toolbox is available under GNU General Public License from anonymous ftp. Alternatively you can obtain the toolbox via the World-wide Web. Click here or see ftp://ftp.let.rug.nl/pub/prolog-app/FSA/.