Custom Translator Phrase Dictionary only working with exact case

Eric McStravick 0 Reputation points
2024-06-24T17:27:57.5866667+00:00

We have an issue that our Fully trained Custom Translator engine is not using the right phrases.

It's trained with about 18,000 segments + 700 terms in phrase dictionaries.

If we match the case exactly, it seems to work, but if we don't it's hit or miss what we will get.

I can't give specific examples since the data is PIP; but I have multiple entries I tested with. For example, if the dictionary entry is "The ABC Corp" and the sentence contains "we at the ABC corp believe....," it doesn't work, but "we at The ABC corp.." might work and "we at The ABC Corp" will work.

The other thing we found is astrerisk completely rules out a match, "The ABC* Corp" will never match "The ABC Corp" regardless of case variation. This one does makes more sense, but I would expected that some punctation would be ignored.

Is there something I can do to make the Phrase dictionary less observant of capitalization/punctuation?

Azure Translator
Azure Translator
An Azure service to easily conduct machine translation with a simple REST API call.
361 questions
{count} votes