...
Aside from standard options for specifying a configuration to identify translatable text in a document, the embedded XTM UI Filter Builder also offers some additional text-related possibilities to make use of.
Segmentation
You can configure segmentation rules in a new template at:
...
Note |
---|
IMPORTANT! It is not possible to add a custom template at Global level. You can only add segmentation configurations in the existing global filter template. |
...
In the next and final section (Create your configuration )section, you first need to specify type of the file (File type), as well as source language (+Add language), for which you would like to configure the segmentation rules. Once you have selected a particular language, a list with corresponding segmentation rules (regular expressions) will be displayed.
The In the below screenshot demonstrates , four segmentation rules, that handle four English honorifics with full stop, are marked:
(^|\p{Zs}|\p{Ps}|\p{Po}|(<[^>]*>))Mr\.
→ Mr.(^|\p{Zs}|\p{Ps}|\p{Po}|(<[^>]*>))Mrs\.
→ Mrs.(^|\p{Zs}|\p{Ps}|\p{Po}|(<[^>]*>))Ms\.
→ Ms.(^|\p{Zs}|\p{Ps}|\p{Po}|(<[^>]*>))Dr\.
→ Dr.
...
The segmentation in XTM Workbench, after applying such newly created configuration to a project, looks as followsshown below. As you can see, sentences are broken into multiple segments after each and every occurrence of a particular abbreviation.
...
Font mapping generally allows you to define which font needs to be used in the target file. You can condition the application of a particular font in the target file basing on the presence of another certain font in the corresponding source file, specific target language, or specific source font category. Font mapping in the XTM UI is based on configuration files for MS Word format.
The below screenshot demonstrates three exemplary font mapping configuration rules that are created within the framework of one filter template:
If a MS Word the source file’s text is in Calibri → the target file will be in Times New Roman Bold.
If the target language of a MS Word source file’s text is Polish → the target file will be in Arial Black.
If the source language’s writing system belongs to the so called complex scripts (in which the shape or positioning of a grapheme depends on its relation to other graphemes, for instance, Arabic alphabet), and the target language is Japanese → the target file will be in Comic Sans MS.
...
Note |
---|
IMPORTANT! Note that you can add/set up a particular font manually by typing it in the field, and it will work but it the font itself will not add it be added to the list you see in the dropdown. Font list is hard-coded and is based on Windows default font list. |
...