Shaker, Mahmoud and Ibrahim, Hamidah and Mustapha, Aida and Abdullah, Lili Nurliyana (2010) A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources. Journal of Next Generation Information Technology, 1 (3). pp. 106-114. ISSN 2092-8637
Full text not available from this repository.
Extracting information from the web data sources becomes very important because the massive and increasing amount of diverse semi-structured information sources in the Internet that are available to users, and the variety of web pages making the process of information extraction from web a challenging problem. This paper proposes a framework for extracting, classifying, analyzing, and presenting semi-structured web data sources. The framework is able to extract relevant information from different web data sources, and classify the extracted information based on the standard classification scheme of Nokia products, which has been chosen as the case study.
|Keyword:||Information Extraction, Semi-Structured, Web Data Sources|
|Subject:||Information storage and retrieval systems.|
|Subject:||Natural language processing (Computer science)|
|Subject:||Interactive computer systems.|
|Faculty or Institute:||Faculty of Computer Science and Information Technology|
|Deposited By:||Umikalthom Abdullah|
|Deposited On:||24 Nov 2011 12:50|
|Last Modified:||24 Nov 2011 12:50|
Repository Staff Only: item control page