WIT Press


A Data Mining Toolset For Distributed High-performance Platforms

Price

Free (open access)

Paper DOI

10.2495/DATA020051

Volume

28

Pages

Published

2002

Size

716 kb

Author(s)

M Cannataro, A Congiusta, D Talia & P Trunfio

Abstract

Today a large number of scientific and commercial applications often require to analyse large data sets maintained over geographically distributed sites by using the computational power of distributed high-performance environments. Advances in networking technology and computational infrastructure made it possible to construct large-scale distributed computing platforms, called computational grids, that provide dependable, consistent, and pervasive access to high-end computational resources. Grids can play a significant role in providing an effective computational support for distributed data mining applications. Currently we are developing a software system for geographically distributed knowledge discovery applications called KNOWLEDGE GRID, which is designed on top of computational grid mechanisms, provided by grid environments such as Glob us. In this paper we present an integrated toolset named VEGA (Visual Environment for Grid Applications), which allows a Knowledge Grid user to develop and execute distributed data mining computations in a simple and effective way. 1 Introduction In many industrial, scientific and commercial applications, it is often necessary to mine large distributed data sets by using the computational power of distributed high-performance computers. Advances in networking technology and computational infrastructure made it possible to design computational grids as large-scale distributed computing platforms that provide dependable, consistent, and pervasive access to high-end computational resources. The term

Keywords