Azure AI Speech

0 answers

Saving audio stream generated by azure speech service ...

Hi, I am having hard time saving AudioDataStream generated by Azure TTS to a wave file using the method AudioDataStream.SaveToWaveFile(filename) for my Xamarin.Forms App. I am not sure what scheme to use for the filename parameter of the…

asked

Alaa Serry 21

commented

GiftA-MSFT 11,166

3 answers

We would like to know noises effect to speech to text performance

Hello. We are using Azure Japanese speech to text. We want to evaluate its performance. What parameters affect the result, noises or microphones or intonations or etc....?

asked

Kohei Watanabe 41

answered

Kohei Watanabe 41

4 answers

problem of running speechSynthetiser on aspx page

Hello, I've got a problem of running System.Speech.Synthesis.SpeechSynthetiser on aspx page. on a computer with windows 10 installation the control is operationnal but on a server with windows vista the same program with the same aspx and aspx.cs pages,…

asked

marc-antoine yonga 81

answered

Ivan 1

1 answer

Controlling speech pace dynamically

Hi all, I'm trying to control my speech pace (making it faster or slower) dynamically in my program. Is there any way to do it without using an SSML file? I'm using Java for Android. Thanks in advance.

asked

tges 26

accepted

tges 26

0 answers

Loading all intents from LUIS intent recognizer

I am using LUIS with speech input according to: https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/quickstart/python/intent-recognition/quickstart.py But ran into two problems, would be great if anyone has any advice: * Is there…

asked

holyone2 1

commented

holyone2 1

1 answer

Can Azure Japanese Speech to Text provide the hiragana (phonetic) transcription?

I would like to use voice to enter the names of people in hiragana, by default the highest confidence prediction seems to select the most plausible Kanji for that name, but it would be more useful if it gave me the hiragana (phonetic sound) of the name.…

asked

holyone2 1

commented

holyone2 1

1 answer

Speech to Text Error

Why am i getting this error, when I am trying to create a Resource for Speech to Text AzureSpeechTest. Error: Error: MissingSubscriptionRegistration. The subscription is not registered to use namespace 'Microsoft.CognitiveServices'. See…

asked

Mahadevan P 1

commented

romungi-MSFT 44,771 Microsoft Employee

1 answer

Rejection for potential voice biometric vulnerabilities

I could not find any documentation regarding the following two cases for voice recognition in MS Azure: 1- if a recording of a previous enrollment is used for verification, is it rejected? 2- if a single recording of a previous verification is…

asked

Christine Hanson 1

commented

romungi-MSFT 44,771 Microsoft Employee

1 answer

list of all phonemes used by pronounciation assessment service

Hello, I am looking for a list of all phonemes used by the pronunciation assessment service. Also, is there any word-phoneme mapping dictionary that the pronunciation service is using for checking the correctness? Best regards Rob

asked

Rob P 21

answered

YutongTie-MSFT 50,856

1 answer

cognitive pronounciation assessment send file directly from S3 bucket

When sending audio data to pronunciation assessment service, is it possible to provide a link to a file stored in e.g. S3 bucket?

asked

Rob P 21

accepted

Rob P 21

1 answer

[Japanese] Does the model of azure speech-to-text is the same as that of teams' live captioning?

Hello. We are using teams and azure speech-to-text in Japanese. We think the performance of azure speech-to-text is better than that of teams' live captioning, because the former one could recognize more proper nouns such as kaggle. Does the model…

asked

Kohei Watanabe 41

accepted

Kohei Watanabe 41

1 answer

Questions About CNV(Custom Neural Voice) Quotas and Limits.

Hi I have few questions while reading CNV(Custom Neural Voice) quotas and limits. (link below) …

asked

현우 오 181

accepted

현우 오 181

1 answer

Question About Azure Custom Neural Voice Training Data

Hello. I have a question while using Azure's Custom Neural Voice. I wonder if the silence time in the front and back of the voice file of the training data used when training Custom Neural Voice affects the silence time in the front and back of the…

asked

현우 오 181

accepted

현우 오 181

1 answer

Azure Custome Voice & Viseme Event

Hello, I'd like to train my own audio resource by Azure Custom Voice. Before I get start it, I have a question about Is Custom Neural Voice support a Visemes too?

asked

현우 오 181

accepted

현우 오 181

1 answer

Azure Speech Service TTS FromEndpoint C#

Hello Now I'm using Azure Speech Service TTS by C#. And I also use Viseme Event. While I using it, I was curious about change configuration method SpeechConfig.FromSubscription("<my subscription key>","<my region>")…

asked

현우 오 181

accepted

현우 오 181

0 answers

speech to text (realtime)

I need help. I want to recognize real-time speech and see a list of predicted words. So, I want to apply a function called NBest to Python, but it doesn't work properly. I would appreciate it if someone could tell me the problem with the simple code…

asked

sanghun jeon 41

commented

romungi-MSFT 44,771 Microsoft Employee

1 answer

Intent Rrecognition through JS won't record and recognize the intent

Hi, I am working on the Intent Recognition Quickstart from the following link https://video2.skills-academy.com/en-us/azure/cognitive-services/speech-service/get-started-intent-recognition?pivots=programming-language-javascript It was working fine on…

asked

Jang, Woong Jin 116

accepted

Jang, Woong Jin 116

2 answers

Speech Studio's Custom Keyword for Mandarin Advanced Model seems to not work as of August 27, 2021

Hi, I've been messing around with Speech Studio's Custom Keywords - Mandarin as the language for a few months. Advanced models for 2 words in Mandarin from my testing seems to not work at all. I have also tried a couple of 4 word phrases. I create…

asked

Jacob 96

accepted

Jacob 96

1 answer

Get facial pose events

Hello, When I get viseme events I can get the Viseme ID and Audio offset. Audio offset is the start time of each viseme. How do I know when the end time of each viseme.? thx.

asked

Jeonghoon 1

answered

romungi-MSFT 44,771 Microsoft Employee

1 answer

Questions about Azure Speech APIs function.

I have used voice APIs with various functions and feel that Microsoft's Azure Speech API is the best. However, one question is, can you show several recognized results like the Google Speech API function? For example, I wonder if other words…

asked

sanghun jeon 41

accepted

sanghun jeon 41

Filter

Content

1,675 questions with Azure AI Speech tags

Saving audio stream generated by azure speech service ...

We would like to know noises effect to speech to text performance

problem of running speechSynthetiser on aspx page

Controlling speech pace dynamically

Loading all intents from LUIS intent recognizer

Can Azure Japanese Speech to Text provide the hiragana (phonetic) transcription?

Speech to Text Error

Rejection for potential voice biometric vulnerabilities

list of all phonemes used by pronounciation assessment service

cognitive pronounciation assessment send file directly from S3 bucket

[Japanese] Does the model of azure speech-to-text is the same as that of teams' live captioning?

Questions About CNV(Custom Neural Voice) Quotas and Limits.

Question About Azure Custom Neural Voice Training Data

Azure Custome Voice & Viseme Event

Azure Speech Service TTS FromEndpoint C#

speech to text (realtime)

Intent Rrecognition through JS won't record and recognize the intent

Speech Studio's Custom Keyword for Mandarin Advanced Model seems to not work as of August 27, 2021

Get facial pose events

Questions about Azure Speech APIs function.