Interfacing knowledge discovery algorithms to large database management systems
Created by W.Langdon from
gp-bibliography.bib Revision:1.8051
- @Article{Lavington:1999:IST,
-
author = "S. Lavington and N. Dewhurst and E. Wilkins and
A. Freitas",
-
title = "Interfacing knowledge discovery algorithms to large
database management systems",
-
journal = "Information and Software Technology",
-
volume = "41",
-
pages = "605--617",
-
year = "1999",
-
number = "9",
-
month = "25 " # jun,
-
note = "special issue on data mining",
-
keywords = "genetic algorithms, genetic programming, data mining,
KDD primitives, decision trees, client-server",
-
ISSN = "0950-5849",
-
URL = "http://www.sciencedirect.com/science/article/B6V0B-3WN7DYN-8/1/cdabdda09c085c6a4536aa5e116366ee",
-
DOI = "doi:10.1016/S0950-5849(99)00024-5",
-
size = "13 pages",
-
abstract = "The efficient mining of large, commercially credible,
databases requires a solution to at least two problems:
(a) better integration between existing Knowledge
Discovery algorithms and popular DBMS; (b) ability to
exploit opportunities for computational speedup such as
data parallelism. Both problems need to be addressed in
a generic manner, since the stated requirements of
end-users cover a range of data mining paradigms, DBMS,
and (parallel) platforms. In this paper we present a
family of generic, set-based, primitive operations for
Knowledge Discovery in Databases (KDD). We show how a
number of well-known KDD classification metrics, drawn
from paradigms such as Bayesian classifiers,
Rule-Induction/Decision Tree algorithms, Instance-Based
Learning methods, and Genetic Programming, can all be
computed via our generic primitives. We then show how
these primitives may be mapped into SQL and, where
appropriate, optimised for good performance in respect
of practical factors such as client-server
communication overheads. We demonstrate how our
primitives can support C4.5, a widely-used rule
induction system. Performance evaluation figures are
presented for commercially available parallel
platforms, such as the IBM SP/2.",
- }
Genetic Programming entries for
S Lavington
N Dewhurst
E Wilkins
Alex Alves Freitas
Citations