UPM Institutional Repository

K-means clustering to improve the accuracy of decision tree response classification.


Citation

Ali, S. A. and Sulaiman , N. and Mustapha, Aida and Mustapha, Norwati (2009) K-means clustering to improve the accuracy of decision tree response classification. Information Technology Journal, 8 (8). pp. 1256-1262. ISSN 1812-5638

Abstract

The use of deep generation with statistical-based surface generation merits from response utterances readily available from corpus. Representation and quality of the instance data are the foremost factors that affect classification accuracy of the statistical-based method. Thus, in classification task, any irrelevant or unreliable tagging of response classes represented will result in low accuracy. This study focused on improving dialogue act classification of a user utterance into a response class by clustering the semantic and pragmatic features extracted from each user utterance. A Decision tree approach is used to classify 64 mixed-initiative, transaction dialogue corpus in theater domain. The experiment shows that by using clustering technique in pre-processing stage for re-tagging response classes, the Decision tree is able to achieve 97.5% recognition accuracy in classification, better than the 81.95% recognition accuracy when using Decision tree alone.


Download File

[img]
Preview
PDF (Abstract)
K.pdf

Download (85kB) | Preview

Additional Metadata

Item Type: Article
Divisions: Faculty of Computer Science and Information Technology
DOI Number: https://doi.org/10.3923/itj.2009.1256.1262
Publisher: Asian Network for Scientific Information (ANSINET)
Keywords: Classification; Clustering; K-means; Decision tree; Natural language generation; Dialogue systems.
Depositing User: Ms. Nida Hidayati Ghazali
Date Deposited: 22 Jul 2013 00:55
Last Modified: 24 Nov 2015 04:25
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.3923/itj.2009.1256.1262
URI: http://psasir.upm.edu.my/id/eprint/15392
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item