Data Mining: A Database Perspective.
Free (open access)
M.S. Sousa, M.L.Q. Mattoso & N.F.F. Ebecken
Data mining on large databases has been a major concern in research com- munity, due to the difficulty of analyzing huge volumes of data using only traditional OLAP tools. This sort of process implies a lot of computa- tional power, memory and disk I/O, which can only be provided by parallel computers. We present a discussion of how database technology can be integrated to data mining techniques. Finally, we also point out several ad- vantages of addressing data consuming activities through a tight integration of a parallel database server and data mining techniques. 1 Introduction Data mining techniques have increasingly been studied^'^, espe- cially in their application in real-world databases. One typical prob- lem is that d