Direkt zum Inhalt | Direkt zur Navigation
Bitte beziehen Sie sich beim Zitieren dieses Dokumentes immer auf folgende
URN: urn:nbn:de:bsz:25-opus-280
URL: http://www.freidok.uni-freiburg.de/volltexte/28/
|
Technical interpretation is usually done on documents having a certain kind of structure. In the overall
layout of such documents components belonging together are grouped and hence geometrically
separated from other parts. The rules used for arranging components on
documents depend on the document's domain, the author's cultural background and many other things.
In order to extract the meaning of such document parts, the creation rules have to be known.
Therefore, a graph representation of structured documents as well as domains of such documents
has been developed, whereby the nodes stand for the components and the edges represent weighted
qualitative spatial relations among nodes. These relations are derived from Allen's qualitative
relations among time intervals. The representation of domains can be derived automatically from
sets of labeled documents.
From a learned models, consistent scenarios can be extracted and then be visualized. This makes it
more easy for the user to examine learned models. Models of domains can then be used to assign
labels to unseen documents by means of a heuristic search for inexact subgraph isomorphisms
between a model graph and a document graph.
| SWD-Schlagwörter: | Maschinelles Lernen , Qualitative Methode , Dokument Interpretation | |
| Freie Schlagwörter (englisch): | machine learning , qualitative spatial representation , document layout interpretation | |
| CCS Klassifikation | I.5 | |
| Institut: | Institut für Informatik | |
| Fakultät: | Fakultät für Angewandte Wissenschaften (bis Sept. 2002) | |
| DDC-Sachgruppe: | Informatik | |
| Dokumentart: | Dissertation | |
| Erstgutachter: | Prof. Dr. Bernhard Nebel | |
| Sprache: | Deutsch | |
| Tag der mündlichen Prüfung: | 22.12.1999 | |
| Erstellungsjahr: | 1999 | |
| Publikationsdatum: | 23.02.2000 |