Question About Azure Custom Neural Voice Training Data

Question

Hello.
I have a question while using Azure's Custom Neural Voice.

I wonder if the silence time in the front and back of the voice file of the training data used when training Custom Neural Voice affects the silence time in the front and back of the voice file as a result of TTS of Custom Neural Voice.

Accepted Answer

@현우 오 No, the training data doesn't affect the front or tail silence. It's defined by the training engine. However, the customers can use SSML to adjust the breaks and silence after the model is trained: Speech Synthesis Markup Language (SSML) - Speech service - Azure Cognitive Services | Microsoft Learn.

Did you create the CNV model and do you want silence shorter or longer?

Please don't forget to click on or upvote button whenever the information provided helps you. Original posters help the community find answers faster by identifying the correct answer. Here is how
- Want a reminder to come back and check responses? Here is how to subscribe to a notification
- If you are interested in joining the VM program and help shape the future of Q&A: Here is how you can be part of Q&A Volunteer Moderators

Share via

Question About Azure Custom Neural Voice Training Data

0 additional answers

Your answer