Question About Azure Custom Neural Voice Training Data

현우 오 181 Reputation points
2021-10-12T08:26:25.743+00:00

Hello.
I have a question while using Azure's Custom Neural Voice.

I wonder if the silence time in the front and back of the voice file of the training data used when training Custom Neural Voice affects the silence time in the front and back of the voice file as a result of TTS of Custom Neural Voice.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,771 questions
{count} vote

Accepted answer
  1. Ramr-msft 17,741 Reputation points
    2021-10-14T10:56:56.72+00:00

    @현우 오 No, the training data doesn't affect the front or tail silence. It's defined by the training engine. However, the customers can use SSML to adjust the breaks and silence after the model is trained: Speech Synthesis Markup Language (SSML) - Speech service - Azure Cognitive Services | Microsoft Learn.

    Did you create the CNV model and do you want silence shorter or longer?


    • Please don't forget to click on 130616-image.png or upvote 130671-image.png button whenever the information provided helps you. Original posters help the community find answers faster by identifying the correct answer. Here is how
      • Want a reminder to come back and check responses? Here is how to subscribe to a notification
      • If you are interested in joining the VM program and help shape the future of Q&A: Here is how you can be part of Q&A Volunteer Moderators
    1 person found this answer helpful.
    0 comments No comments

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.