Thanks for clarifying your use case. Currently Speech SDK does not provide APIs to capture the microphone audio used for speech transcription, however, it's on our roadmap. A similar question was asked on this thread. Hope it helps.
Visualize real-time voice data with stt
seoyeon lee
1
Reputation point
I hope that the user's real-time voice will be visualized when automatically moving on to the next page. Web audio api knows that user gestures are necessary. Can STT get real-time voice without user gestures?
If there's any other way to get user voice, please give me your feedback. and Tell me if what I want is not feasible.
thank you.