Corney, D.;
Byrne, E.;
Buxton, B.;
Jones, D.;
(2008)
A logical framework for template creation and information extraction.
In: Lin, T.Y. and Xie, Y. and Wasilewska, A. and Liau, C.-J., (eds.)
Data Mining: Foundations and Practice.
(pp. 79-108).
Springer Verlag: Berlin/ Heidelberg, Germany.
![]() Preview |
PDF
14371.pdf Download (409kB) |
Abstract
Summary: Information extraction is the process of automatically identifying facts of interest from pieces of text, and so transforming free text into a structured database. Past work has often been successful but ad hoc, and in this paper we propose a more formal basis from which to discuss information extraction. We introduce a framework which will allow researchers to compare their methods as well as their results, and will help to reveal new insights into information extraction and text mining practices. One problem in many information extraction applications is the creation of templates, which are textual patterns used to identify information of interest. Our framework describes formally what a template is and covers other typical information extraction tasks. We show how common search algorithms can be used to create and optimise templates automatically, using sequences of overlapping templates, and we develop heuristics that make this search feasible. Finally we demonstrate a successful implementation of the framework and apply it to a typical biological information extraction task.
Type: | Book chapter |
---|---|
Title: | A logical framework for template creation and information extraction |
ISBN-13: | 9783540784876 |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1007/978-3-540-78488-3_5 |
Publisher version: | http://dx.doi.org/10.1007/978-3-540-78488-3_5 |
Language: | English |
Additional information: | The original publication is available at www.springerlink.com |
UCL classification: | UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Institute of Ophthalmology |
URI: | https://discovery.ucl.ac.uk/id/eprint/14371 |




Archive Staff Only
![]() |
View Item |