Related Information
Publication List
|
Data Mining
An increase in the speed of data mining algorithms can be achieved by improving
the efficiency of the underlying technologies. Query engines are key components
in many knowledge discovery systems and the appropriate use of query engines can
impact the performance of data mining algorithms. By taking advantage of
hypothesis generation patterns, queries, generated from the hypotheses, can be
evaluated more efficiently. Caching query results and using the cached results
to evaluate new queries with similar constraints reduces the complexity of query
evaluation and improves the performance of data mining algorithms. In a
multi-processor environment, distributing the query result caches can improve
the performance of parallel query evaluations. This idea has been used in the
ParDRI system and has resulted in significant improvements in the execution
times of ParDRI. |