Azure AI Speech

1 answer

Quenstion About CNV(Custom Neural Voice) Limit

Hi I have a question about the CNV concurrent request limit. (https://video2.skills-academy.com/ko-kr/azure/cognitive-services/speech-service/speech-services-quotas-and-limits) Is the limit for CNV concurrent request(10) is per Speech resource or…

asked

현우 오 181

accepted

현우 오 181

1 answer

Two times with a pause for each sentence

I am an ESL learner. I have a list of 400++ English sentences. I want to make it audible but two times for each sentence: One at a 70% speed and another at a 100% speed and with a certain time of pause between one after another . As for the creation of…

asked

tts 1

answered

GiftA-MSFT 11,166

1 answer

cognitive-services-speech-sdk-go/samples crashed

install Azure Speech go sdk with follow commands: yum -y update yum -y groupinstall "Development tools" yum -y install alsa-lib openssl wget export SPEECHSDK_ROOT="$HOME/speechsdk" mkdir -p…

asked

nx-speech 6

commented

romungi-MSFT 44,951 Microsoft Employee

1 answer

Custom Voice Missing From Custom Command

In Speech Studio a month or two ago there was a section in settings to define the custom voice it would respond with when using the custom command interface. This has now disappeared in the UI. Is this still supported? …

asked

Bryan Roberts 31 Microsoft Employee

commented

Bryan Roberts 31 Microsoft Employee

2 answers

Is this working right now ?

Hello, I just created a Custom commands project, train, test, and nothing work. No response at all. I also have a 401 error using the Windows Voice Assistant sample exe. Is it down right now ?

asked

Guillaume Demicheli 116

accepted

Guillaume Demicheli 116

1 answer

Custom Speech API 3.0 does not support Upload of new StructuredText Dataset

The API manual does not indicate what "kind" should be used when uploading structured text (in Markdown format), which is introduced in…

asked

Hölzl Josef 21

commented

Hölzl Josef 21

1 answer

Realtime Speech To Text Performance

We are using the basicRecognizer.StartContinuousRecognitionAsync() and a PushAudioInputStream to push an audio input stream from a mic and then output the results to a text editor. I am looking to see if there is a setting to break the results into 2-4…

asked

Dave Revell 1

answered

Ramr-msft 17,736

0 answers

use speech to text sdk When using Bluetooth headset

1.During meetings using teams with Bluetooth headset。 2.Open my application, which uses speech to text SDK. 3.What I expect is that the voice I say and the voice I hear can be translated into text。 But the actual effect is that only my voice is…

asked

voicerec 1

commented

voicerec 1

1 answer

Phoneme Support

Does Speech to Text Provide Phoneme Support @phoneme @Anonymous eg What is the first sound in CAT (Answer- /k/) 2nd. What is last sound in Bottom (Answer- /tum/) If yes please share me the resource so i can also check the same. …

asked

Aman Verma 1

commented

romungi-MSFT 44,951 Microsoft Employee

1 answer

Azure Speechsdk on Raspberry Pi

I've been working on a project for the better part of a month now, and to no end I keep hitting road blocks. I am trying to make an automatic translator on my Raspberry Pi 4 that uses Bluetooth headphones as the audio input and output, preferably in…

asked

Tristen Campbell 1

answered

romungi-MSFT 44,951 Microsoft Employee

2 answers

Audio content creation: access denied to existing speech resource

I want to create a model in speech.microsoft.com and fail with a speech resource, where I have contributor access only (no access to the hosting RG of that resource). I can only work with a resource I created in the speech portal. In the browser I am…

asked

Klaus Zuenkler 46

commented

Klaus Zuenkler 46

0 answers

Azure TTS in webapp works in debug mode but not publish

I have created a basic webapp in which when a button is clicked, a TTS voice is playing. The app works fine in debug mode but not in browser. I know this is quite a common issue but I never find a way to solve it or another possibility to implement the…

asked

Noémie Tapie 1

commented

romungi-MSFT 44,951 Microsoft Employee

1 answer

Can we use audio content created in Speech Studio for public videos without restrictions?

Hello, We would like to create audio content with Speech Studio and embed it in our videos, which we distribute via social media and our homepage, for example. Can we do this without restrictions or do we have to purchase separate licences for this? …

asked

Johann-Jesko Lange 21

commented

Johann-Jesko Lange 21

1 answer

How do I use the Nbest function?

Now I want to use Python to see the voice recognizer's expected words. However, just one word is being broadcast continually, and no other projected word can be observed at this time. So when I looked up the function, I saw NBest and wanted to…

asked

sanghun jeon 41

answered

GiftA-MSFT 11,166

1 answer

The function enquiry of voice recognizer Nbest.

Hello, I'm a student that is interested in speech recognition. The "Nbest" function is explained, but I'm wondering because I haven't seen the specific example. In the case of Google, I am aware of various anticipated terms. Is there,…

asked

sanghun jeon 41

answered

GiftA-MSFT 11,166

2 answers

Azure Speech SDK: How to specify the NBest length?

Hello, i'm using speech-to-test sdk to recognize audio to text. Current json result seems returning 5 NBest candidates, but how can i specify how many Nbest candidates to receive ? Or full NBest list ? Thank you.

asked

Kun 21

commented

sanghun jeon 41

1 answer

speech to text (realtime)

I need help. I want to recognize real-time speech and see a list of predicted words. So, I want to apply a function called NBest to Python, but it doesn't work properly. I would appreciate it if someone could tell me the problem with the simple code…

asked

sanghun jeon 41

accepted

sanghun jeon 41

2 answers

Improving TTS latency

Hi, I'm using Azure TTS in my Android application. I use 2 objects in my implementation which are SpeechConfig and SpeechSynthesizer. My app is mainly based on a dynamic TTS (e.g. reading input back to the user, which happens very often) but the…

asked

tges 26

commented

tges 26

1 answer

Speech Studio and service unavailable

In October 29th 2021 the portal speech.microsoft.com (custom commands) went unavailable also the service. Wondering which was the cause, this is something that happened before and it could be a problem when my project goes online. In the…

asked

Gerardo Pagliardini 31

commented

Gerardo Pagliardini 31

1 answer

Speech recognition engine has difficulties recognizing separate letters (nl_NL)

I have noticed that when pronouncing separate letters in a test set, the engine has a lot of difficulty to recognize them (Dutch language). Of course this is not a big surprise, since there is no meaningful context, and the phoneme clusterse are…

asked

Vergeest, LC (Lucas) 1

answered

GiftA-MSFT 11,166

Filter

Content

1,683 questions with Azure AI Speech tags

Quenstion About CNV(Custom Neural Voice) Limit

Two times with a pause for each sentence

cognitive-services-speech-sdk-go/samples crashed

Custom Voice Missing From Custom Command

Is this working right now ?

Custom Speech API 3.0 does not support Upload of new StructuredText Dataset

Realtime Speech To Text Performance

use speech to text sdk When using Bluetooth headset

Phoneme Support

Azure Speechsdk on Raspberry Pi

Audio content creation: access denied to existing speech resource

Azure TTS in webapp works in debug mode but not publish

Can we use audio content created in Speech Studio for public videos without restrictions?

How do I use the Nbest function?

The function enquiry of voice recognizer Nbest.

Azure Speech SDK: How to specify the NBest length?

speech to text (realtime)

Improving TTS latency

Speech Studio and service unavailable

Speech recognition engine has difficulties recognizing separate letters (nl_NL)