UPM Institutional Repository

Temporal integration based factorization to improve prediction accuracy of collaborative filtering


Citation

Al-Qasem, Al-Hadi Ismail Ahmed (2016) Temporal integration based factorization to improve prediction accuracy of collaborative filtering. Doctoral thesis, Universiti Putra Malaysia.

Abstract

A recommender system provides users with personalized suggestions for items based on the user’s behaviour history. These systems often use the collaborative filtering (CF) for analysing the users’ preferences for items in the rating matrix. The rating matrix typically contains a high percentage of unknown rating scores which is called the data sparsity problem. The data sparsity problem has been solved by several approaches such as Bayesian probabilistic, machine learning, genetic algorithm, particle swarm optimization and matrix factorization. The matrix factorization approach through temporal approaches has the accurate performance in addressing the data sparsity problem but still with low accuracy. The existing temporal-based factorization approaches used the long-term preferences and the short-term preferences. The difference between long-term preferences is that it utilizes the whole recorded preferences while the short-term preferences utilizes the recorded preferences within a session (e.g. week, month, season, etc.). However, there are four issues when a factorization approach is adopted which are latent feedback learning, score overfitting, user’s interest drifting and item’s popularity decay over time. This study proposes three approaches which are (i) the Ensemble Divide and Conquer (EDC) which achieved accurate latent feedback learning, (ii) two personalized matrix factorization (MF) based temporal approaches, namely the LongTemporalMF and ShortTemporalMF to solve overfitting during the optimization process, user’s interest drifting and item’s popularity decays over time and (iii) TemporalMF++ approach which solved all the issues. The TemporalMF++ approach relies on the k-means algorithm and the bacterial foraging optimization algorithm. The Root Mean Squared Error metric is used to evaluate the prediction accuracy. The factorization approaches such as the Singular Value Decomposition, Baseline, Matrix Factorization and Neighbours based Baseline are used to be compared against the proposed approaches. In addition, the Temporal Dynamics, Short-Term based Latent, Short-Term based Baseline, Long-Term, and Temporal Interaction approaches are used to benchmark the proposed approaches. The MovieLens, Epinions, and Netflix Prize are real-world datasets which are used in the experimental settings. The experimental results show the TemporalMF++ approach is higher prediction accuracy compared to the approaches of EDC, LongTemporalMF, and ShortTemporalMF. In addition, the TemporalMF++ approach has a prediction accuracy higher than the benchmark approaches of factorization and temporal. In summary, the TemporalMF++ approach has a superior effectiveness in improving the accuracy prediction of the CF by learning the temporal behaviour.


Download File

[img]
Preview
Text
FSKTM 2016 40 IR.pdf

Download (1MB) | Preview

Additional Metadata

Item Type: Thesis (Doctoral)
Subject: Recommender systems (Information filtering)
Call Number: FSKTM 2016 40
Chairman Supervisor: Nurfadhlina Mohd Sharef, PhD
Divisions: Faculty of Computer Science and Information Technology
Depositing User: Ms. Nur Faseha Mohd Kadim
Date Deposited: 10 Jul 2019 01:08
Last Modified: 10 Jul 2019 01:08
URI: http://psasir.upm.edu.my/id/eprint/69372
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item