Context-Aware Stemming algorithm for semantically related root words

dc.contributor.authorAgbele, Kehinde K.
dc.contributor.authorAdesina, Ademola Olusola
dc.contributor.authorAzeez, Nureni A.
dc.contributor.authorAbidoye, Ademola P.
dc.date.accessioned2014-03-23T19:23:07Z
dc.date.available2014-03-23T19:23:07Z
dc.date.issued2012
dc.description.abstractThere is a growing interest in the use of context-awareness as a technique for developing pervasive computing applications that are flexible and adaptable for users. In this context, however, information retrieval (IR) is often defined in terms of location and delivery of documents to a user to satisfy their information need. In most cases, morphological variants of words have similar semantic interpretations and can be considered as equivalent for the purpose of IR applications. Consequently, document indexing will also be more meaningful if semantically related root words are used instead of stems. The popular Porter’s stemmer was studied with the aim to produce intelligible stems. In this paper, we propose Context-Aware Stemming (CAS) algorithm, which is a modified version of the extensively used Porter’s stemmer. Considering only generated meaningful stemming words as the stemmer output, the results show that the modified algorithm significantly reduces the error rate of Porter’s algorithm from 76.7% to 6.7% without compromising the efficacy of Porter’s algorithm.en_US
dc.identifier.citationAgbele, K.K. (2012). Context-Aware Stemming algorithm for semantically related root words. African Journal of Computing & ICT, 5(4): 33-42en_US
dc.identifier.issn2006-1781
dc.identifier.urihttp://hdl.handle.net/10566/1063
dc.language.isoenen_US
dc.privacy.showsubmitterFALSE
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE) Inc.en_US
dc.rightsCopyright © 2012 Agbele, et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
dc.status.ispeerreviewedTRUE
dc.subjectContext awarenessen_US
dc.subjectInformation retrievalen_US
dc.subjectStemmingen_US
dc.subjectPrecisionen_US
dc.subjectRecallen_US
dc.titleContext-Aware Stemming algorithm for semantically related root wordsen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
AgbeleStemmingAlgorithms2012.pdf
Size:
291 KB
Format:
Adobe Portable Document Format
Description:
Published version
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.55 KB
Format:
Item-specific license agreed upon to submission
Description: