Is it possible to recognize short words ("Yes", "No", "Ok") in speech sdk consistently

Faris Lemes 50 Reputation points
2024-06-03T04:53:50.27+00:00

hello, I was experimenting with SPEECH SDK and I was using https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/quickstart/cpp/linux/from-microphone/helloworld.cpp

I've adjusted code a bit, I was using the following timeouts:

config->SetProperty(PropertyId::SpeechServiceConnection_InitialSilenceTimeoutMs, "1500");
config->SetProperty(PropertyId::Speech_SegmentationSilenceTimeoutMs, "5000");
config->SetProperty(PropertyId::SpeechServiceConnection_EndSilenceTimeoutMs, "1000");
auto audioProcessingOptions = AudioProcessingOptions::Create(AUDIO_INPUT_PROCESSING_ENABLE_DEFAULT);
auto recognizer = SpeechRecognizer::FromConfig(config, audioInput);

One thing that I noticed is that when I say only some short words i.e. "yes", "no", "ok" I would get weird results from the picture image.png. I was not saying anything else except "Yes", "No" and "OK".

Thanks

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
{count} votes

Accepted answer
  1. Gowtham CP 3,730 Reputation points
    2024-06-03T05:21:18.5666667+00:00

    Hello Faris Lemes ,

    Thanks for reaching out in the Microsoft Q&A!

    Here are some tips to improve recognition:

    1. Increase Timeout: Try setting the "Speech_SegmentationSilenceTimeoutMs" to 3 seconds to give the system more time to process short words.
    2. Clear Speech: Ensure you're in a quiet environment and enunciate clearly to avoid confusion.
    3. Update SDK: Make sure you’re using the latest version of the Speech SDK to benefit from recent fixes.
    4. Custom Model: If these words are super important, consider training a custom speech model with your specific vocabulary. It'll learn your voice and these keywords even better.

    Please let me know if you have further questions.

    If the response helped, do "Accept Answer" and up-vote it to close the thread! Thanks


0 additional answers

Sort by: Most helpful