Citation
Abstract
Nowadays the growth of the datasets size causes some difficulties to extract useful information and knowledge especially in specific domains. However, new methods in data mining need to be developed in both sides of supervised and unsupervised approaches. Nevertheless, data stream clustering can be taken into account as an effective strategy to apply for huge data as an unsupervised fashion. In this research we not only propose a framework for data stream clustering but also evaluate different aspects of existing obstacles in this arena. The main problem in data stream clustering is visiting data once therefore new methods should be applied. On the other hand, concept drift must be recognized in real-time. In this paper, we try to clarify: first, the different aspects of problem with regard to data stream clustering generally and how several prominent solutions tackle different problems; second, the varying assumptions, heuristics, and intuitions forming the basis of approaches and finally a new framework for data stream clustering is proposed with regard to the specific difficulties encountered in this field of research.
Download File
Full text not available from this repository.
|
Additional Metadata
Item Type: | Book Section |
---|---|
Divisions: | Faculty of Computer Science and Information Technology |
DOI Number: | https://doi.org/10.1007/978-1-4614-3535-8_97 |
Publisher: | Springer |
Keywords: | Concept drifts; Data sets; Data stream clustering; Unsupervised approaches. |
Depositing User: | Suzila Mohamad Kasim |
Date Deposited: | 23 Jul 2014 01:44 |
Last Modified: | 23 Jul 2014 01:44 |
URI: | http://psasir.upm.edu.my/id/eprint/31331 |
Statistic Details: | View Download Statistic |
Actions (login required)
View Item |