UPM Institutional Repository

Document ranking using information quality criteria in weblog search engine


Citation

Azimzadeh, Fatemeh (2013) Document ranking using information quality criteria in weblog search engine. PhD thesis, Universiti Putra Malaysia.

Abstract

Social media has revolutionized the Web industry. Weblog medium, fundamentally,is an innovation in personal publishing. It has also come to engender a new form of social interaction on the web. Because much firsthand information is recorded in blog posts, more and more people tend to search their wanted information on blog sites. A major problem is that a weblog includes nontraditional features of the Web pages such as Weblog post, links, tags, and comments. Thus, the use of traditional rank algorithms like PageRank and HITS in general search engines are not appropriate to evaluate the Weblog posts because such algorithms do not consider the blog specific features. On the other hand, information quality criteria are important factors for the users. From Weblogs, which have unfiltered information without expert peer review, users expect that search engines deliver quality information for their queries. There has been little framework which consider information quality criteria in the Weblog search engine. This thesis establishes an integrated framework which incorporates information quality criteria into the ranking function of search engine on Persian weblogs. The presented framework rank Weblogs and posts based on the selected information quality criteria. Then, the ranking scores are merged with relevancy in the search engine. A ranking method is developed for the Weblog search engine where the post is considered as the document retrieved. This thesis proposes two ranking functions in the search engine which are combined with the information quality criteria, and then compared with a PageRank based ranking function. The results reveal that combination of quality criteria with relevancy, without suitable weight for each one, does not lead to user’s satisfaction. Instead, applying proper weights to both information quality factors and relevancy intelligibly improve the results of the search engine and consequently lead to user satisfaction.


Download File

[img]
Preview
PDF
FK 2013 4R.pdf

Download (926kB) | Preview

Additional Metadata

Item Type: Thesis (PhD)
Subject: Information services - Quality control
Subject: Search engines
Call Number: FK 2013 4
Chairman Supervisor: Associate Professor Abd Rahman Ramli, PhD
Divisions: Faculty of Engineering
Depositing User: Haridan Mohd Jais
Date Deposited: 18 Jan 2016 08:39
Last Modified: 18 Jan 2016 08:50
URI: http://psasir.upm.edu.my/id/eprint/38937
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item