keywords = "genetic algorithms, genetic programming, apache
lucene, text classification",
abstract = "We describe a method for generating accurate, compact,
human understandable text classifiers. Text datasets
are indexed using Apache Lucene and Genetic Programs
are used to construct Lucene search queries. Genetic
programs acquire fitness by producing queries that are
effective binary classifiers for a particular category
when evaluated against a set of training documents. We
describe a set of functions and terminals and provide
results from classification tasks.",
notes = "GECCO-2007 A joint meeting of the sixteenth
international conference on genetic algorithms
(ICGA-2007) and the twelfth annual genetic programming
conference (GP-2007).