...
Page Properties | ||
---|---|---|
| ||
Introduction
You might sometimes wonder why there is a discrepancy between the number of words in the source file and XTM Cloud Metrics after project creation. This is because each CAT Tool or any other application operates according to its own set of rules. Wordcount is not exempted from such rules and tends to differ across various software. For example, whereas one application might count an email address as one word, another one can count it as three words. Wordcount in XTM Cloud is therefore calculated “on its own terms".
...
Note |
---|
IMPORTANT! Keep in mind that the vast majority of elements you want to be extracted for translation can, in fact, be set up either by you or our XTM International Support team. To find out more about how XTM Cloud extracts content from Office files for translation, we recommend that you read this article: How are segments in MS Office files (Word, Excel, PowerPoint) extracted? In XTM Cloud, you can create your own filter templates. These are used to identify translatable text in a document with a specific Office format. Familiarize yourself with the guideline: How to create a filter template in the XTM Cloud UI and apply it to a project. If there are any more complex filters, do not hesitate to contact our XTM International Support team and provide the details. When requesting our help, ensure that you follow the guidelines in this article: How to request a new configuration or change to an existing one? Additionally, learn about possible configuration levels from this article: How can a source file be processed and what are configuration levels? |
XTM wordcount vs. other CAT tools
...
Here you can set how many characters will be converted into words for metrics calculation. For each Asian language, enter the number of characters that is equivalent to one word. When it comes to a mixed script (e.g. Japanese + English source), the non-Asian language will be counted on a normal basis and the Asian alphabet will be counted according to factors set in the above-mentioned section.
...
Difference in wordcount for the same source file between different XTM Cloud projects
Sometimes you might encounter what at first might appear as an issue that different XTM Cloud projects contain different wordcount for the same source file.
In the vast majority of cases, the difference with extracted words is caused by a particular filter template being used in some XTM Cloud projects, while not in others.
For example, project A containing 1,060 words has been created with the use of the filter template called Word Exclude Hyperlink Address, the purpose of which is to exclude hyperlink addresses that are present in the document. On the other hand, project B containing 1,080 words has been created without using any filter template.