UPM Institutional Repository

Malay documents clustering algorithm based on singular value decomposition.


Ab Samat, Nordianah and Azmi Murad, Masrah Azrifah and Abdullah, Muhamad Taufik and Atan, Rodziah (2009) Malay documents clustering algorithm based on singular value decomposition. Journal of Theoretical and Applied Information Technology, 8 (2). pp. 180-186. ISSN 1992-8645


Document categorization is a widely researched area of information retrieval. A research on Malay natural language processing has been done up to the level of retrieving documents but not to the extent of automatic semantic categorization. Thus, an approach for the clustering of Malay documents based on semantic relations between words is proposed in this paper. The method described in this paper uses Singular Value Decomposition (SVD) technique for the vector representation of each document where familiar clustering techniques can be applied in this space. The experimental results we obtained taking into account the semantics of the document that performed good document clustering by obtaining relevant subjects appearing in a cluster.

Download File

PDF (Abstract)
Malay documents clustering algorithm based on singular value decomposition.pdf

Download (84kB) | Preview

Additional Metadata

Item Type: Article
Subject: Natural language processing (Computer science).
Subject: Malay language - Data processing.
Subject: Computer - Information theory.
Divisions: Faculty of Computer Science and Information Technology
Publisher: Asian Research Publishing Network (ARPN)
Keywords: Singular Value Decomposition (SVD); Latent Semantic Indexing (LSI); Document clustering; Malay natural language processing.
Depositing User: Nida Hidayati Ghazali
Date Deposited: 25 Jun 2013 03:53
Last Modified: 24 Nov 2015 06:40
URI: http://psasir.upm.edu.my/id/eprint/15515
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item