What is it about?

Whether or not word sense disambiguation (WSD) can improve information retrieval (IR) results represents a topic that has been intensely debated over the years, with many inconclusive or contradictory conclusions. The most rarely used type of WSD for this task is the unsupervised one, although it has been proven to be beneficial at a large scale. Our study builds on existing research and tries to improve the most recent unsupervised method which is based on spectral clustering. It investigates the possible benefits of "helping'' spectral clustering through feature selection when it performs sense discrimination for IR. Results obtained so far, involving large data collections, encourage us to point out the importance of feature selection even in the case of this advanced, state of the art clustering technique that is known for performing its own feature weighting. By suggesting an improvement of what we consider the most promising approach to usage of WSD in IR, and by commenting on its possible extensions, we state that WSD still holds a promise for IR and hope to stimulate continuation of this line of research, perhaps at an even more successful level.

Featured Image

Why is it important?

As a result of conducting the described study, we believe that appropriate feature selection for spectral clustering, within an appropriately generated context, could further improve retrieval results, thus making the discussed method even more valuable in real life IR applications.

Perspectives

Writing this article meant pushing a little further the limits of my research during my PhD thesis. It was a great pleasure to work on this article with my co-author (and a mentor to me) with whom I have a long standing collaboration.

Adrian-Gabriel CHIFU
Aix-Marseille Universite

Read the Original

This page is a summary of: Feature selection for spectral clustering: to help or not to help spectral clustering when performing sense discrimination for IR?, Open Computer Science, December 2018, De Gruyter,
DOI: 10.1515/comp-2018-0021.
You can read the full text:

Read

Resources

Contributors

The following have contributed to this page