Visualize real-time voice data with stt

seoyeon lee 1

I hope that the user's real-time voice will be visualized when automatically moving on to the next page. Web audio api knows that user gestures are necessary. Can STT get real-time voice without user gestures?

If there's any other way to get user voice, please give me your feedback. and Tell me if what I want is not feasible.

thank you.

GiftA-MSFT 11,171 Reputation points

2020-10-30T16:21:36.8+00:00

Hi, thanks for reaching out. Azure speech-to-text enables real-time transcription of audio streams into text. Can you describe what visualization you're referring to?
seoyeon lee 1 Reputation point

2020-11-02T00:04:06.163+00:00

I want to use the sst to get the user's real-time voice.
I'd like to create a waveform with the imported user voice.
The goal is to waveform.

1 answer

GiftA-MSFT 11,171 Reputation points

2020-11-03T00:07:42.657+00:00

Thanks for clarifying your use case. Currently Speech SDK does not provide APIs to capture the microphone audio used for speech transcription, however, it's on our roadmap. A similar question was asked on this thread. Hope it helps.
Please sign in to rate this answer.

0 comments No comments
Sign in to comment

Use comments to ask for clarification, additional information, or improvements to the question.

Share via

Visualize real-time voice data with stt

1 answer

Your answer