UPM Institutional Repository

Effective predicate identification algorithm for XML retrieval


Roko, Abubakar and C. Doraisamy, Shyamala and Nakone, Bello (2018) Effective predicate identification algorithm for XML retrieval. In: 2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP'18), 26-28 Mar. 2018, Le Méridien Kota Kinabalu, Sabah, Malaysia. (pp. 158-163).


Query structuring systems are keyword search systems recently used for effective retrieval of XML documents. Existing systems fail to put keyword query ambiguity problems into consideration during query preprocessing. Thus, the systems return irrelevant user search intentions. A search intention consists of entity nodes and predicate nodes of XML data. In this paper, an entity based query segmentation (EBQS) method which interprets a user query as a list of keywords and/or named entities to resolve ambiguity. Then, segment terms proximity scorer (STPS) that assigns relevance scores to XML fragments that contains query keywords is proposed. Fragments containing the keywords as interpreted by EBQS are assigned higher scores. Finally, an effective predicate identification algorithm (EPIA) which uses EBQS and STPS to return relevant predicates is introduced. The effectiveness of the algorithm is demonstrated through experimental performance study on some real world XML documents.

Download File

[img] Text (Abstract)
Effective predicate identification algorithm for XML retrieval.pdf

Download (5kB)

Additional Metadata

Item Type: Conference or Workshop Item (Paper)
Divisions: Faculty of Computer Science and Information Technology
DOI Number: https://doi.org/10.1109/INFRKM.2018.8464696
Publisher: IEEE
Keywords: Query structuring; XML retrieval; Predicate node; Keyword query
Depositing User: Nabilah Mustapa
Date Deposited: 16 Jun 2020 06:38
Last Modified: 16 Jun 2020 06:38
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.1109/INFRKM.2018.8464696
URI: http://psasir.upm.edu.my/id/eprint/36982
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item