In Azure AI Speech the training of a custom Speech-To-Text model with audio and transcript keeps throwing "Internal Error"

Benedikt Schmitt 20 Reputation points
2024-08-26T06:46:34.8966667+00:00

I am trying to fine-tune a baseline-20231107 model for my specific use-case. I have recorded two audios in a .wav-format with all the requirements mentioned on this page:

https://video2.skills-academy.com/en-us/azure/ai-services/speech-service/how-to-custom-speech-test-and-train

I have also provided transcripts that fit the requirements from the documentation:

https://video2.skills-academy.com/en-us/azure/ai-services/speech-service/how-to-custom-speech-human-labeled-transcriptions

Every time I try to train the model it throws an "Internal Error".

-I previously had my audio as .m4a and converted it to .wav. Now the audio was recorded as .wav

-I have waited several days with the training to see if it was a temporary error but it still doesn't work

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,675 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,782 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Benedikt Schmitt 20 Reputation points
    2024-08-27T09:35:43.03+00:00

    I checked my input data again ad it turns out my audio was too long. The different files were slightly longer than 40 seconds.

    It was user error in the end but it would still be a good idea to implement more descriptive error messages. It could have saved me a few days of waiting.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.