@Verbari LLC Which API or feature are you referring to in the above case? I tried using this sentence with TTS service with audio content creation tool from speech studio and the exported file provided the input text as-is when I exported the content. If you are referring to STT then could you add more details?
Example, this is the SSML I used with ACC tool to generate audio.
<speak xmlns="http://www.w3.org/2001/10/synthesis" xmlns:mstts="http://www.w3.org/2001/mstts" xmlns:emo="http://www.w3.org/2009/10/emotionml" version="1.0" xml:lang="fr-FR"><voice name="fr-FR-DeniseNeural">"Je suis votre guide touristique, » dit-elle avec un sourire amical."</voice></speak>