Bilingual TMX file - tag handling/preprocessing needs for engine training

Conchita Laguardia 1 Reputation point
2021-03-11T14:52:24.373+00:00

Hi,
We are using Custom Translator to customise engines with our bilingual data. When it comes to inline tags, it is not clear to us whether we can submit TMX files directly downloaded from our TMS, or whether we need to clean any of the tags. My understanding is any tags should be left alone. For example:

<tuv xml:lang="en-US">
<seg>
<bpt i="0"><g id="i4"></bpt> Android Enterprise: <ept i="0"></g></ept></seg>
</tuv>
<tuv xml:lang="ja-JP">
<seg>
<bpt i="0"><g id="i4"></bpt>Android Enterprise:<ept i="0"></g></ept></seg>
</tuv>

We appreciate your help to confirm or clarify how to proceed, thank you.

Azure Translator
Azure Translator
An Azure service to easily conduct machine translation with a simple REST API call.
393 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Sagar Sarkar 1 Reputation point
    2021-03-18T12:51:23.033+00:00
    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.