Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

What is stemming?

XTM supports stemming for terminology wherein multiple forms of the same word are recognized. In other words, this is the process of reducing inflected (or sometimes derived) forms of a given term to their stem, base or root - generally a written word form. The current stemming implementation in XTM is based on Hunspell's dictionaries (http://hunspell.github.io).

Please, see the example below:


Guidelines

In order to activate the option in the XTM UI, you need to have a user with the Administrator role, and then got to: Configuration → Settings → Translation → Terminology → Terminology options → Highlight all term variants.

Once activated, if there is an occurrence of an inflected/derived form of a given term in the source file in Workbench and provided that its root form already exists in the terminology base, it will be highlighted:

A separate noteworthy case concerns compound words which are hyphenated, for example, color-coded. Let’s consider the following example:

We have got an English (USA) term color which we want to translate into an English (UK) variant: colour. For this purpose we created a relevant entry in the XTM UI terminology module:

  • Term: color English (USA);

  • Translation: colour English (UK).

  1. When a project with the term color-coded is created, and the option Highlight all term variants (stemming) is disabled, the word color will not be highlighted in blue (signifying an available translation) since XTM treats hyphenated compounds as one “word”, so in this case it would be a whole different word.

  2. When a project with the term color-coded is created, and the option Highlight all term variants (stemming) is enabled, the word color will be highlighted in blue (signifying an available translation) since XTM will properly recognize a common stemmed form for both words, which is color.

Please, see a couple of exemplary sentences in Workbench containing the word color in various combinations when stemming is enabled.

  • No labels