UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

A logical framework for template creation and information extraction

Corney, D.; Byrne, E.; Buxton, B.; Jones, D.; (2008) A logical framework for template creation and information extraction. In: Lin, T.Y. and Xie, Y. and Wasilewska, A. and Liau, C.-J., (eds.) Data Mining: Foundations and Practice. (pp. 79-108). Springer Verlag: Berlin/ Heidelberg, Germany. Green open access

[thumbnail of 14371.pdf]
Preview
PDF
14371.pdf

Download (409kB)

Abstract

Summary: Information extraction is the process of automatically identifying facts of interest from pieces of text, and so transforming free text into a structured database. Past work has often been successful but ad hoc, and in this paper we propose a more formal basis from which to discuss information extraction. We introduce a framework which will allow researchers to compare their methods as well as their results, and will help to reveal new insights into information extraction and text mining practices. One problem in many information extraction applications is the creation of templates, which are textual patterns used to identify information of interest. Our framework describes formally what a template is and covers other typical information extraction tasks. We show how common search algorithms can be used to create and optimise templates automatically, using sequences of overlapping templates, and we develop heuristics that make this search feasible. Finally we demonstrate a successful implementation of the framework and apply it to a typical biological information extraction task.

Type: Book chapter
Title: A logical framework for template creation and information extraction
ISBN-13: 9783540784876
Open access status: An open access version is available from UCL Discovery
DOI: 10.1007/978-3-540-78488-3_5
Publisher version: http://dx.doi.org/10.1007/978-3-540-78488-3_5
Language: English
Additional information: The original publication is available at www.springerlink.com
UCL classification: UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Institute of Ophthalmology
URI: https://discovery.ucl.ac.uk/id/eprint/14371
Downloads since deposit
339Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item