20 second Video limit for Pronunciation Assessment

Junbum Kwon 1

Hello,

Today, we tried to analyze Video speech using the "Pronunciation Assessment". But, at one time, we were allowed to analyze only 20 seconds' videos.

How we can analyze the entire video (about 3 minutes)?

YutongTie-MSFT 50,856 Reputation points

2021-07-18T21:49:06.037+00:00

Hello,

Is there any code sample you are referring to for your project? We have a video to explain the sample code here https://www.youtube.com/watch?v=zFlwm7N4Awc .

Hope this helps.

Regards,
Yutong

2 answers

YutongTie-MSFT 50,856 Reputation points

2021-08-04T23:56:48.01+00:00

Hello,

I have tried again on my side and it works for me well. As below screenshot, it's successfully with my 30sec audio. This is the sample code repo I am using, all of them are very convenient. https://github.com/Azure-Samples/Cognitive-Speech-TTS/tree/master/PronunciationAssessment/CSharp/Console

Please let me any block for this and share the code sample you are using.

Regards,
Yutong
Please sign in to rate this answer.

0 comments No comments
Sign in to comment

Use comments to ask for clarification, additional information, or improvements to the question.
Yinhe Wei 1 Reputation point Microsoft Employee

2021-09-07T05:56:29.89+00:00

Hi, @Junbum Kwon

To handle long speech for pronunciation assessment, you can refer to below sample code:
https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/python/console/speech_sample.py#L643

It is based on continuous recognition and therefore it doesn't have limitation on length.

Regards,
Yinhe
Please sign in to rate this answer.

0 comments No comments
Sign in to comment

Use comments to ask for clarification, additional information, or improvements to the question.

Your answer