UPM Institutional Repository

A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources


Citation

Shaker, Mahmoud and Ibrahim, Hamidah and Mustapha, Aida and Abdullah, Lili Nurliyana (2010) A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources. Journal of Next Generation Information Technology, 1 (3). pp. 106-114. ISSN 2092-8637

Abstract

Extracting information from the web data sources becomes very important because the massive and increasing amount of diverse semi-structured information sources in the Internet that are available to users, and the variety of web pages making the process of information extraction from web a challenging problem. This paper proposes a framework for extracting, classifying, analyzing, and presenting semi-structured web data sources. The framework is able to extract relevant information from different web data sources, and classify the extracted information based on the standard classification scheme of Nokia products, which has been chosen as the case study.


Download File

Full text not available from this repository.

Additional Metadata

Item Type: Article
Subject: Information storage and retrieval systems.
Subject: Natural language processing (Computer science)
Subject: Interactive computer systems.
Divisions: Faculty of Computer Science and Information Technology
Keywords: Information Extraction, Semi-Structured, Web Data Sources
Depositing User: Umikalthom Abdullah
Date Deposited: 24 Nov 2011 04:50
Last Modified: 24 Nov 2011 04:50
URI: http://psasir.upm.edu.my/id/eprint/12693
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item