1,506 questions with Azure AI Speech tags

Sort by: Updated
1 answer

Azure Text To Speech docker container throws an exception with viseme

I'm using the Azure Text to Speech docker image (mcr.microsoft.com/azure-cognitive-services/speechservices/neural-text-to-speech:3.3.0-amd64-en-us-jennyneural). I'm passing it SSML through the dotnet SDK. When asking for viseme (via <mstts:viseme…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
asked 2024-06-30T01:37:32.95+00:00
Jon Peterson 26 Reputation points
answered 2024-07-03T07:20:10.6333333+00:00
dupammi 7,955 Reputation points Microsoft Vendor
0 answers

Is there a way to make speech service transcription faster (diarization with speakers differentiated)?

Currently the speed seems to be half the time for wav and 1:1 ratio for mp4 with gstreamer. From this post, it seems half the time for wav file is the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
asked 2024-07-02T05:30:12.4666667+00:00
kk 0 Reputation points
commented 2024-07-03T07:14:34.3666667+00:00
santoshkc 6,380 Reputation points Microsoft Vendor
0 answers

how to assign operation permissions a resources

Hello, I am new to Azure and I want to use it to convert text to speech. when I creat the resources -> enter the speech studio and try to start the service, the system raised an error and say "You don't have operation permissions to [New],…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
asked 2024-06-29T02:40:15.9166667+00:00
Jingxiong Wang 0 Reputation points
commented 2024-07-03T04:49:59.52+00:00
santoshkc 6,380 Reputation points Microsoft Vendor
1 answer

Random Words Detected by Azure Speech Recognizer in Silence

Hello Azure Support Team, I am currently using the Azure Speech Service to recognize speech inputs in my application. The setup of my speech recognizer is as follows: export const createSpeechRecognizer = () => { const speechRecognitionConfig =…

Azure AI Bot Service
Azure AI Bot Service
An Azure service that provides an integrated environment for bot development.
777 questions
Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
Azure Startups
Azure Startups
Azure: A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.Startups: Companies that are in their initial stages of business and typically developing a business model and seeking financing.
236 questions
asked 2024-06-24T07:44:54.3933333+00:00
Abdul Subhan 5 Reputation points
edited the question 2024-07-03T04:23:28.4666667+00:00
Ryan Hill 26,866 Reputation points Microsoft Employee
0 answers

Speech-to-Text batch transcribe API in germanycentralwest doesn't work

Last Friday (May 31 2024) we started getting the following errors on all transcripts sent to the batch transcription API on our speech resource in…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
Azure Startups
Azure Startups
Azure: A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.Startups: Companies that are in their initial stages of business and typically developing a business model and seeking financing.
236 questions
asked 2024-06-02T20:47:26.57+00:00
Matej the Mete 20 Reputation points
edited the question 2024-07-03T04:21:20.95+00:00
Ryan Hill 26,866 Reputation points Microsoft Employee
0 answers

azure prononciation assessment time limit

i am using azure prononciation assessment to assess an audio , but the problem the assessment happens only for the 1 min of the speech and it doesnt assess the rest of the audio this is my code const sdk =…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
Azure Startups
Azure Startups
Azure: A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.Startups: Companies that are in their initial stages of business and typically developing a business model and seeking financing.
236 questions
asked 2024-05-17T11:36:28.55+00:00
Iheb Jandoubi 5 Reputation points
edited the question 2024-07-03T04:19:47.89+00:00
Ryan Hill 26,866 Reputation points Microsoft Employee
1 answer

azure prononciation assessment input video

can i give to azure prononciation assessment a video input ?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
Azure Startups
Azure Startups
Azure: A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.Startups: Companies that are in their initial stages of business and typically developing a business model and seeking financing.
236 questions
asked 2024-05-09T16:12:24.94+00:00
Iheb Jandoubi 5 Reputation points
edited the question 2024-07-03T04:18:36.8366667+00:00
Ryan Hill 26,866 Reputation points Microsoft Employee
1 answer

azure prononciation assessment async assessment

i'am using azure speech recognizer sdk , to do the prononciation assessment of an audio file. the problem when the speech is in french the results are always low , and no expressive const language = await detectSingleSpeechLanguage(text) …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
Azure Startups
Azure Startups
Azure: A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.Startups: Companies that are in their initial stages of business and typically developing a business model and seeking financing.
236 questions
asked 2024-05-07T17:04:34.3266667+00:00
Iheb Jandoubi 5 Reputation points
edited the question 2024-07-03T04:18:15.1066667+00:00
Ryan Hill 26,866 Reputation points Microsoft Employee
0 answers

Error while trying to train a 202240228 Whisper Large v2 baseline model

When trying to train a custom speech model using a dataset containing an audio file and its transcript, the model failed to train due to an internal error. Can anyone provide any insights on how to troubleshoot this issue?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
Azure Startups
Azure Startups
Azure: A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.Startups: Companies that are in their initial stages of business and typically developing a business model and seeking financing.
236 questions
asked 2024-05-03T08:53:22.2033333+00:00
Engineering 0 Reputation points
edited the question 2024-07-03T04:17:49.5766667+00:00
Ryan Hill 26,866 Reputation points Microsoft Employee
0 answers

How to create a dataset for Azure custom speech using spx (speechCLI)

I am using the following command for creating a custom speech dataset in my Azure Speech service: spx csr dataset create --api-version v3.1 --kind "Acoustic" --name "My Custom Speech" --description "My Acoustic Dataset…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
asked 2024-06-28T09:12:20.18+00:00
Mikel Broström Zalba 20 Reputation points
commented 2024-07-02T21:42:19.08+00:00
VasaviLankipalle-MSFT 15,836 Reputation points
1 answer

Azure Cognitive Services Speech: Unable to get Custom Translator model results from speech translation code

In test C# code that I created based on the speech translation code in the following sample (“Using custom translation in speech translation”), I’m having trouble getting Custom Translator model translation results. The code just returns a cancellation…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
Azure Translator
Azure Translator
An Azure service to easily conduct machine translation with a simple REST API call.
360 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,570 questions
asked 2024-06-28T21:35:09.9366667+00:00
Hirai, Tetu 0 Reputation points
edited a comment 2024-07-02T13:58:23.02+00:00
Hirai, Tetu 0 Reputation points
1 answer

How to synchronize real world events happening while speech recognition is happening with individual spoken words

I am trying to synchronize real world events that are occuring during live streaming of speech to Azure speech recognition services (e.g., eye gaze shifts, hardware device interactions, etc.). I note the time when I start speech recognition and record…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
asked 2024-07-01T11:45:38.4166667+00:00
Mark Miller (DevExpress) 0 Reputation points
commented 2024-07-02T13:30:45.2+00:00
Mark Miller (DevExpress) 0 Reputation points
0 answers

Can my web app use a GPU for AI capabilities or will I need to use an Azure VM?

I am running a web app which I deployed through docker. The web app works perfectly besides one important detail, the whisperx ai model I have takes forever to run a transcription (think hours). I run the same ai function on a "T4 GPU" using…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
Azure Virtual Machines
Azure Virtual Machines
An Azure service that is used to provision Windows and Linux virtual machines.
7,460 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,570 questions
Azure Static Web Apps
Azure Static Web Apps
An Azure service that provides streamlined full-stack web app development.
830 questions
asked 2024-07-01T15:09:37.23+00:00
Henrik Vlijter 0 Reputation points
commented 2024-07-02T11:19:28.9266667+00:00
dupammi 7,955 Reputation points Microsoft Vendor
1 answer

Is each voice in the voice gallery based on a clone of one specific natural person or is it synthetic?

I would like to understand whether: Each voice in the voice gallery is based on a clone of one specific natural person? Voices are synthetic (similar to those from 11Labs Voice Design) that cannot be traced back to an individual person? Thank you!

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
asked 2024-07-02T07:18:37.07+00:00
mpsb 0 Reputation points
answered 2024-07-02T10:35:31.66+00:00
santoshkc 6,380 Reputation points Microsoft Vendor
0 answers

Microsoft: fix captioning by Speech Studio

The captioning functionality in the Speech Studio is an utter failure. This is typical output: I encourage Microsoft to implement the functionality that allows the user to specify the number of lines of text (typically one or two), and the maximum…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
asked 2024-07-02T00:44:05.4533333+00:00
Roy Jensen 20 Reputation points
commented 2024-07-02T06:12:39.7266667+00:00
navba-MSFT 19,495 Reputation points Microsoft Employee
1 answer

SpeakSsmlAsync is cancelled, but SpeakTextAsync is successfull

I am trying out the Azure AI service to convert text to speech from a C# WPF application. My calls through SpeakTextAsync are successfull, but my calls through SpeakSsmlAsync are returned with the Reason = Cancelled. I am on the free tier for South…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
asked 2024-06-28T11:57:49.3633333+00:00
One More Henry 0 Reputation points
commented 2024-07-02T02:58:20.34+00:00
navba-MSFT 19,495 Reputation points Microsoft Employee
0 answers

What are the HW or sound limitations for the echo cancellation algorithm in SpeechSDK

hi, I'm having some issues with the echo cancellation on my device, and I'm trying to use speech SDK, when I was analyzing the sounds that I record with microphone it seems that there are present higher harmonics which are 24dB less then primary…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
asked 2024-06-28T07:30:28.45+00:00
Faris Lemes 50 Reputation points
commented 2024-07-01T09:11:11.38+00:00
navba-MSFT 19,495 Reputation points Microsoft Employee
1 answer

create a basic voice-interactive dashboard

Hello Team, I need to create a basic voice-interactive dashboard using Azure Cognitive services like, Speech service, CLU(Conversational Language Understanding) & PowerBI.Also suggest if any other way to achieve this. It would be really helpful.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
Azure AI Language
Azure AI Language
An Azure service that provides natural language capabilities including sentiment analysis, entity extraction, and automated question answering.
382 questions
asked 2024-06-27T17:16:48.97+00:00
Vijayakumar Elumalai 105 Reputation points
commented 2024-07-01T06:28:05.21+00:00
Vijayakumar Elumalai 105 Reputation points
1 answer

As a student how can I use Azure Speech resource

I have a student subscription and want to create an Azure Speech resource, but there's a problem. Is it because of the student subscription limitation or what I can do to use Azure speech service?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
asked 2024-06-30T18:34:40.5666667+00:00
Aleksei Zhukov 0 Reputation points
edited an answer 2024-06-30T21:42:43.56+00:00
YutongTie-MSFT 47,991 Reputation points
0 answers

how can I set the permission to the resources

Hello, I want to upload a text file to Speech Studio, but the system raised an error Does anyone help how I can fix this and assign a proper role for myself? I already set my role as a Cognitive Services User.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,506 questions
Azure
Azure
A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.
1,056 questions
asked 2024-06-29T02:47:13.3533333+00:00
Jingxiong Wang 0 Reputation points
commented 2024-06-30T00:55:14.18+00:00
YutongTie-MSFT 47,991 Reputation points