UPM Institutional Repository

A method to enrich domain ontology using synonym and probability theory


Citation

Mohd Rafei Heng, Nur Fatin Nabila (2016) A method to enrich domain ontology using synonym and probability theory. Doctoral thesis, Universiti Putra Malaysia.

Abstract

Ontology has become a popular topic of research for numerous areas of computer science, such as question answering, information retrieval, and use of the semantic web. Considerable efforts have been made in constructing ontologies due to the complexity and time-consuming nature of the task. Concept, taxonomy, and non-taxonomic relations are three important components in the development of ontology. These three components are used to represent the knowledge of the domain texts. Most of the existing techniques focus on extracting the concept, the taxonomic relations, and non-taxonomic relationships within a single sentence. These techniques neglect a sentence when either the subject or object of a sentence is missing or not clear. Thus, the knowledge of domain texts is not properly represented as some relations cannot be identified. This thesis proposes a solution for the enrichment of the knowledge of domain text by finding possible relations. The proposed method suggests the appropriate or the most likely term for an uncertain subject or object of a sentence using the probability theory. In addition, the method can extract the relations between concepts (i.e. subject and object) that appear not only in a single sentence, but also in different sentences by using a synonym of the predicates. The proposed method has been tested and evaluated with three collections of domain texts that describe computers, tourism, and science. Precision, recall, and f-score metrics have been used to evaluate the results of the experiments. The experiment results were compared with the results that were completed manually by the domain experts. For the computer dataset, an F-score value of 62.33% has been achieved using the proposed solution. Additionally, the science dataset achieved an F-score of 78.98%, whereas the tourism dataset achieved an F-score of 81.58%. The result shows that the proposed method has increased and enriched the relationships of domain texts thus providing better results compared to several existing methods. The method is shown to be useful to assist ontology engineer in conceptualization process of ontology engineering.


Download File

[img]
Preview
Text
FSKTM 2016 15 IR.pdf

Download (1MB) | Preview

Additional Metadata

Item Type: Thesis (Doctoral)
Subject: Ontologies (Information retrieval)
Subject: Computer Science
Call Number: FSKTM 2016 15
Chairman Supervisor: Assc. Prof. Ali bin Mamat, PhD
Divisions: Faculty of Computer Science and Information Technology
Depositing User: Ms. Nur Faseha Mohd Kadim
Date Deposited: 10 Jul 2019 03:54
Last Modified: 10 Jul 2019 03:54
URI: http://psasir.upm.edu.my/id/eprint/69348
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item