Real time diarization, for true!

Marco Cocco 5 Reputation points
2024-07-06T21:04:52.2466667+00:00

Hi,

i've decided to join Azure AI program due to this demo:

https://speech.microsoft.com/portal/speechtotexttool

In this demo, I can activate microphone, flagging the Diarization to True, and that's it!

Now, when I've discovered by documentation that I couldn't replicate that exact feature:

https://video2.skills-academy.com/en-us/azure/ai-services/speech-service/get-started-stt-diarization?tabs=windows&pivots=programming-language-python

I felt VERY disappointed.

Now I'm struggling to find a way to use a good silence detector (like the one to STT Azure AI of the quickstart) but that would be at least able to save the speech until silence to a wav file to pass to the diarizer.

I'm using Python for Ai, but If needed I could change to Java or C#.

I'm really really sad in this moment.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,713 questions
{count} votes

1 answer

Sort by: Most helpful
  1. romungi-MSFT 45,731 Reputation points Microsoft Employee
    2024-07-08T08:11:22.96+00:00

    @Marco Cocco Could you please add more detail about what failure you have seen while using the quickstart for python? If you are looking to just use the microphone instead of the file as mentioned in the sample, set the use_default_microphone to True instead of filename parameter in AudioConfig class. See the reference of the class here for available options.

    audio_config = speechsdk.audio.AudioConfig(use_default_microphone=True)

    Also, even if you do not provide AudioConfig the default will be set to mic for python. See the confirmation in this issue from SDK team.

    1 person found this answer helpful.
    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.