UCL logo

UCL Discovery

UCL home » Library Services » Electronic resources » UCL Discovery

A logical framework for template creation and information extraction

Corney, D. and Byrne, E. and Buxton, B. and Jones, D. (2008) A logical framework for template creation and information extraction. In: Lin, T.Y. and Xie, Y. and Wasilewska, A. and Liau, C.-J., (eds.) Data Mining: Foundations and Practice. (pp. 79-108). Springer Verlag: Berlin/ Heidelberg, Germany.

An open access version is available from UCL Discovery

[img]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
399Kb

Abstract

Summary: Information extraction is the process of automatically identifying facts of interest from pieces of text, and so transforming free text into a structured database. Past work has often been successful but ad hoc, and in this paper we propose a more formal basis from which to discuss information extraction. We introduce a framework which will allow researchers to compare their methods as well as their results, and will help to reveal new insights into information extraction and text mining practices. One problem in many information extraction applications is the creation of templates, which are textual patterns used to identify information of interest. Our framework describes formally what a template is and covers other typical information extraction tasks. We show how common search algorithms can be used to create and optimise templates automatically, using sequences of overlapping templates, and we develop heuristics that make this search feasible. Finally we demonstrate a successful implementation of the framework and apply it to a typical biological information extraction task.

Type:Book chapter
Title:A logical framework for template creation and information extraction
ISBN-13:9783540784876
Open access status:An open access version is available from UCL Discovery
DOI:10.1007/978-3-540-78488-3_5
Publisher version:http://dx.doi.org/10.1007/978-3-540-78488-3_5
Language:English
Additional information:The original publication is available at www.springerlink.com
UCL classification:UCL > School of Life and Medical Sciences > Faculty of Brain Sciences > Institute of Ophthalmology

View download statistics for this item

Archive Staff Only: edit this record