Extraction of Word-embedded Content from MS Word files
Introduction
You might sometimes receive MS Word files for translation in XTM Cloud, which contain another Word-embedded content that are not being extracted for translation in XTM Workbench.
Take a look at the sample file:
How can this content be extracted?
The table that you are seeing in the MS Word file above was pasted into the file as a Picture (Enhanced Metafile) rather than as an embedded Excel file or a proper table. This behavior is specific to Microsoft Office products.
If you actually double-click on the embedding, a separate Word file will open up with the content in question.
When an object is pasted as an enhanced metafile, it retains some editable features that allow for partial modification, such as adjusting text or shapes. This is why, when you double-click on the object, Word allows you to make limited edits, although not all the data may be editable or correctly replicated.
However, despite allowing some degree of editing, the object is still technically a picture, and because of that, XTM Cloud is unable to extract its content for translation. This editable picture behavior is a feature of Word, but it does not make the content fully accessible for processes like extraction for translation, as would be the case with a properly embedded Excel table or sheet.
The only solution in this case is to reinsert the content as a properly structured Excel table or an embedded document, which would allow for easier extraction and translation.