In, e.g., 0001.sentence.json, quotation marks present in the original sentence are dropped, if that quotation mark occurs at the beginning or end of the detected sentence. Is this expected behavior?

Question

This is mostly in the title.

Initially, I suspected this was a bug in the JSON serialization since JSON also uses " to delimit its fields, and these also have to be escaped in SSML.

Upon further investigation, however, i found it also affects other quotation marks, e.g., in a recent file, the original sentence was:

«Je suis votre guide touristique, » dit-elle avec un sourire amical.

But in my 0001.sentence.json, I see this:

  {
    "Text": "Je suis votre guide touristique, » dit-elle avec un sourire amical.",
    "AudioOffset": 45587,
    "Duration": 4487
  },

Is this expected behavior?

Answer

@Verbari LLC Which API or feature are you referring to in the above case? I tried using this sentence with TTS service with audio content creation tool from speech studio and the exported file provided the input text as-is when I exported the content. If you are referring to STT then could you add more details?

Example, this is the SSML I used with ACC tool to generate audio.

"Je suis votre guide touristique, » dit-elle avec un sourire amical."

Share via

In, e.g., 0001.sentence.json, quotation marks present in the original sentence are dropped, if that quotation mark occurs at the beginning or end of the detected sentence. Is this expected behavior?

1 answer