Evolved term-weighting schemes in information retrieval: An analysis of the solution space

Research output: Contribution to a Journal (Peer & Non Peer)Articlepeer-review

14 Citations (Scopus)

Abstract

Evolutionary computation techniques are increasingly being applied to problems within Information Retrieval (IR). Genetic programming (GP) has previously been used with some success to evolve term-weighting schemes in IR. However, one fundamental problem with the solutions generated by this stochastic, non-deterministic process, is that they are often difficult to analyse. In this paper, we introduce two different distance measures between the phenotypes (ranked lists) of the solutions (term-weighting schemes) returned by a GP process. Using these distance measures, we develop trees which show how different solutions are clustered in the solution space.We show, using this framework, that our evolved solutions lie in a different part of the solution space than two of the best benchmark term-weighting schemes available.

Original languageEnglish
Pages (from-to)35-47
Number of pages13
JournalArtificial Intelligence Review
Volume26
Issue number1-2
DOIs
Publication statusPublished - Oct 2006

Keywords

  • Genetic programming
  • Information Retrieval
  • Term-weighting schemes

Fingerprint

Dive into the research topics of 'Evolved term-weighting schemes in information retrieval: An analysis of the solution space'. Together they form a unique fingerprint.

Cite this