Evolving Lucene search queries for text classification.
Proceedings of GECCO 2007: Genetic and Evolutionary Computation Conference.
(pp. 1604 - 1611).
We describe a method for generating accurate, compact, human understandable text classifiers. Text datasets are indexed using Apache Lucene and Genetic Programs are used to construct Lucene search queries. Genetic programs acquire fitness by producing queries that are effective binary classifiers for a particular category when evaluated against a set of training documents. We describe a set of functions and terminals and provide results from classification tasks. Copyright 2007 ACM.
|Title:||Evolving Lucene search queries for text classification|
|UCL classification:||UCL > School of BEAMS > Faculty of Engineering Science > Computer Science|
Archive Staff Only