Citation
Abstract
Morphological analysis is used to study the internal structure words by reducing the number of vocabularies used while retaining the semantic meaning of the knowledge in NLP system. Most of the existing algorithms are focusing on stemmatization instead of lemmatization process. Even with technology advancement, yet none of the available lemmatization algorithms able to produce 100 % accurate result. The base words produced by the current algorithm might be unusable as it alters the overall meaning it tried to represent, which will directly affect the outcome of NLP systems. This paper proposed a new method to handle lemmatization process during the morphological analysis. The method consists three layers of lemmatization process, which incorporate the used of Stanford parser API, WordNet database and adaptive learning technique. The lemmatized words yields from the proposed method are more accurate, thus it will improve the semantic knowledge represented and stored in the knowledge base.
Download File
Full text not available from this repository.
|
Additional Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Divisions: | Faculty of Computer Science and Information Technology |
DOI Number: | https://doi.org/10.1007/978-3-319-17530-0_24 |
Publisher: | Springer International Publishing |
Keywords: | Lemmatization; Morphology analysis; Natural language processing; Adaptive learning; Semi-supervised learning |
Depositing User: | Nursyafinaz Mohd Noh |
Date Deposited: | 03 Sep 2015 03:27 |
Last Modified: | 03 Sep 2015 03:27 |
Altmetrics: | http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.1007/978-3-319-17530-0_24 |
URI: | http://psasir.upm.edu.my/id/eprint/40308 |
Statistic Details: | View Download Statistic |
Actions (login required)
View Item |