In, e.g., 0001.sentence.json, quotation marks present in the original sentence are dropped, if that quotation mark occurs at the beginning or end of the detected sentence. Is this expected behavior?

Verbari LLC 20 Reputation points
2024-06-01T05:49:55.17+00:00

This is mostly in the title.

Initially, I suspected this was a bug in the JSON serialization since JSON also uses " to delimit its fields, and these also have to be escaped in SSML.

Upon further investigation, however, i found it also affects other quotation marks, e.g., in a recent file, the original sentence was:

«Je suis votre guide touristique, » dit-elle avec un sourire amical.

But in my 0001.sentence.json, I see this:

  {
    "Text": "Je suis votre guide touristique, » dit-elle avec un sourire amical.",
    "AudioOffset": 45587,
    "Duration": 4487
  },

Is this expected behavior?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
{count} votes

1 answer

Sort by: Most helpful
  1. romungi-MSFT 43,656 Reputation points Microsoft Employee
    2024-06-03T07:42:50.3666667+00:00

    @Verbari LLC Which API or feature are you referring to in the above case? I tried using this sentence with TTS service with audio content creation tool from speech studio and the exported file provided the input text as-is when I exported the content. If you are referring to STT then could you add more details?

    Example, this is the SSML I used with ACC tool to generate audio.

    <speak xmlns="http://www.w3.org/2001/10/synthesis" xmlns:mstts="http://www.w3.org/2001/mstts" xmlns:emo="http://www.w3.org/2009/10/emotionml" version="1.0" xml:lang="fr-FR"><voice name="fr-FR-DeniseNeural">"Je suis votre guide touristique, » dit-elle avec un sourire amical."</voice></speak>
    
    
    0 comments No comments