French stemmer

Categories

Component ID

102208

Component name

French stemmer

Component type

module

Maintenance status

Development status

Component security advisory coverage

covered

Downloads

2016

Component created

Component changed

Component body

This module improves results of the search module for the french language.

It does so by using the Paice Husk stemmer algorithm and some extra process :

  • accents are removed,
  • 'au' 'eau' 'o' are interpreted as 'o',
  • double letters are reduced to one letter, 'tt', 'mm', 'nn' etc. are interpreted as 't', 'm', 'n' etc.,
  • depending on its following letters, 'c' is converted to 'k' or 's',
  • ...

Keys :

  • activated periodically by search and cron,
  • does not add new tables to your database.

This module has been developped for the french city of Rouen website.