WIT Press

Extraction of association rules using big data technologies

Price

Free (open access)

Volume

Volume 11 (2016), Issue 3

Pages

7

Page Range

178 - 185

Paper DOI

10.2495/DNE-V11-N3-178-185

Copyright

WIT Press

Author(s)

CARLOS FERNANDEZ-BASSO, M. DOLORES RUIZ & MARIA J. MARTIN-BAUTISTA

Abstract

The large amount of information stored by companies and the rise of social networks and the Internet of Things are producing exponential growth in the amount of data being produced. Data analysis techniques must therefore be improved to enable all this information to be processed. One of the most commonly used techniques for extracting information in the data mining field is that of association rules, which accurately represent the frequent co-occurrence of items in a dataset. Although several methods have been proposed for mining association rules, these methods do not perform well in very large databases due to high computational costs and lack of memory problems.

In this article, we address these problems by studying the current technologies for processing Big Data to propose a parallelization of the association rule mining process using Big Data technologies which implements an efficient algorithm that can handle massive amounts of data. This new algorithm is then compared with traditional association rule mining algorithms.

Keywords

Apriori, association rules, big data algorithms, data mining.