1,516 questions with Azure AI Speech tags

Sort by: Updated
1 answer

How to get sentence word timestamp results for real-time speech recognition ?

I am using Golang's SDK this is my golang code func (m *microsoft) Do(ctx context.Context, path string) (string, error) { defer os.Remove(path) accessKeyConfig := AccessKeyList[rand.Intn(len(AccessKeyList))] subscription := accessKeyConfig.Key region…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
asked 2024-07-05T06:25:10.16+00:00
莓 草 0 Reputation points
commented 2024-07-05T13:23:28.6533333+00:00
navba-MSFT 19,655 Reputation points Microsoft Employee
1 answer

no voice when I click "play" button to create speech from text

no voice when I click "play" button to create speech from text, my laptop voice turned on already.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,577 questions
asked 2024-07-05T07:22:27.8133333+00:00
Grace Xiong 0 Reputation points Microsoft Employee
answered 2024-07-05T08:03:43.17+00:00
santoshkc 6,310 Reputation points Microsoft Vendor
1 answer

Azure AI Speech content filter

Hey everyone, I am using the Azure AI Speech api for real time transcription of conversations. The problem I am facing is that the content filter recognizes words such as the german 'dick' as offensive. This might be true in english, however in german…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
asked 2024-07-04T18:16:35.31+00:00
Julian 0 Reputation points
commented 2024-07-05T05:54:41.2666667+00:00
navba-MSFT 19,655 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Ingesting webpage URL for the open AI web app in Azure

Hi there. In the Azure open AI studio, there is an option for defining webpage URL when you add data for the app but based on the requirements in the Microsoft website, it can only extract text up tp 20 sublinks and also I can only put one URL in it. …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
2,538 questions
asked 2024-02-26T19:15:09.8766667+00:00
Jalali, Hadi 40 Reputation points
commented 2024-07-05T04:55:11.28+00:00
Mansi Gusain 0 Reputation points
1 answer

create a basic voice-interactive dashboard

Hello Team, I need to create a basic voice-interactive dashboard using Azure Cognitive services like, Speech service, CLU(Conversational Language Understanding) & PowerBI.Also suggest if any other way to achieve this. It would be really helpful.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
Azure AI Language
Azure AI Language
An Azure service that provides natural language capabilities including sentiment analysis, entity extraction, and automated question answering.
383 questions
asked 2024-06-27T17:16:48.97+00:00
Vijayakumar Elumalai 105 Reputation points
commented 2024-07-05T02:35:38.43+00:00
YutongTie-MSFT 48,001 Reputation points
0 answers

How can I use Whisper on Azure AI Speech

Hi, I recently switched from using the whisper model via Azure OpenAI to using Azure AI Speech. However, I noticed that the quality of some transcriptions is worse on Azure AI Speech. On the below page it says that it is possible to use the whisper model…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
asked 2024-07-04T18:25:53.3633333+00:00
Julian 0 Reputation points
commented 2024-07-05T01:35:20.07+00:00
dupammi 7,955 Reputation points Microsoft Vendor
0 answers

Failed to get HTTP platform singleton instance. Error: 27

Hello! I'm working with the Azure Speech Services SDK via python. The code worked well, until I started getting blank responses. Basically my request got cancelled, when checking the reason, I got this: #…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
asked 2024-07-04T14:37:35.1233333+00:00
Vitalii Brydinskyi 0 Reputation points
commented 2024-07-05T01:16:34.1666667+00:00
VasaviLankipalle-MSFT 15,836 Reputation points
0 answers

I use speech to text and want to transcribe the corresponding text, but it keeps timing out without successful recognition. Why is this happening?

this is my file,and download it https://feedback.meitudata.com/public/file/yASWSTNPh2RE3Ncv.wav

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
asked 2024-07-04T11:47:58.2133333+00:00
莓 草 0 Reputation points
commented 2024-07-04T19:39:11.02+00:00
VasaviLankipalle-MSFT 15,836 Reputation points
1 answer

Is each voice in the voice gallery based on a clone of one specific natural person or is it synthetic?

I would like to understand whether: Each voice in the voice gallery is based on a clone of one specific natural person? Voices are synthetic (similar to those from 11Labs Voice Design) that cannot be traced back to an individual person? Thank you!

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
asked 2024-07-02T07:18:37.07+00:00
mpsb 0 Reputation points
commented 2024-07-04T10:52:53.4666667+00:00
santoshkc 6,310 Reputation points Microsoft Vendor
0 answers

SpeechSynthesizer sometimes plays speech depending on SpeechSynthesisOutputFormat

In a C# WPF application, I call this function to convert text to speech: SpeechSynthesisResult speechSynthesisResult = await speechSynthesizer.SpeakSsmlAsync(strSsml); The audio data is returned ok. BUT the function also sometimes plays the speech as…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
Azure
Azure
A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.
1,060 questions
C#
C#
An object-oriented and type-safe programming language that has its roots in the C family of languages and includes support for component-oriented programming.
10,573 questions
asked 2024-07-03T14:10:30.4733333+00:00
One More Henry 20 Reputation points
commented 2024-07-04T08:41:08.3966667+00:00
One More Henry 20 Reputation points
1 answer

Azure speech speaker differentiation

Hi, I would like to use azure speech to transcribe a meeting, however i want it to differentiate between anonymous speakers, eg speaker A, speaker B. Is it possible to do that. Are there any samplesand tutorials out there that I can just take and use?…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,577 questions
asked 2024-07-04T08:05:43.6733333+00:00
jchoo 0 Reputation points
edited the question 2024-07-04T08:17:20.3633333+00:00
AmaranS 3,455 Reputation points Microsoft Vendor
0 answers

Is there a way to make speech service transcription faster (diarization with speakers differentiated)?

Currently the speed seems to be half the time for wav and 1:1 ratio for mp4 with gstreamer. From this post, it seems half the time for wav file is the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
asked 2024-07-02T05:30:12.4666667+00:00
kk 0 Reputation points
commented 2024-07-04T06:45:28.3466667+00:00
santoshkc 6,310 Reputation points Microsoft Vendor
0 answers

Microsoft: fix captioning by Speech Studio

The captioning functionality in the Speech Studio is an utter failure. This is typical output: I encourage Microsoft to implement the functionality that allows the user to specify the number of lines of text (typically one or two), and the maximum…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
asked 2024-07-02T00:44:05.4533333+00:00
Roy Jensen 20 Reputation points
commented 2024-07-04T05:58:44.2433333+00:00
navba-MSFT 19,655 Reputation points Microsoft Employee
0 answers

No audio when using SpeechSDK in pcf control (canvas app)

I made a pcf control which uses the speechsdk to synthesize text to speech. This is working when I run "npm start watch" to test this. When publishing this pcf control and use it in a canvas powerapp I cannot hear the synthisized text. What can…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
asked 2024-07-03T13:38:13.5566667+00:00
Joël Simons 0 Reputation points
commented 2024-07-04T03:53:29.93+00:00
YutongTie-MSFT 48,001 Reputation points
1 answer One of the answers was accepted by the question author.

Azure Cognitive Services Speech: Unable to get Custom Translator model results from speech translation code

In test C# code that I created based on the speech translation code in the following sample (“Using custom translation in speech translation”), I’m having trouble getting Custom Translator model translation results. The code just returns a cancellation…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
Azure Translator
Azure Translator
An Azure service to easily conduct machine translation with a simple REST API call.
361 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,577 questions
asked 2024-06-28T21:35:09.9366667+00:00
Hirai, Tetu 20 Reputation points
accepted 2024-07-04T03:37:10.81+00:00
Hirai, Tetu 20 Reputation points
1 answer

Request to Increase Whisper Model Quota Limit

Hi Azure Community, I hope everyone is doing well. I am currently working on a project that requires a higher capacity of the Whisper model than my current Azure quota allows. I am seeking guidance on how to increase my Whisper model quota…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,577 questions
asked 2024-07-03T19:56:23.8133333+00:00
narayanam Srinivasulu 0 Reputation points
edited an answer 2024-07-04T00:03:21.8633333+00:00
VasaviLankipalle-MSFT 15,836 Reputation points
1 answer One of the answers was accepted by the question author.

SpeakSsmlAsync is cancelled, but SpeakTextAsync is successfull

I am trying out the Azure AI service to convert text to speech from a C# WPF application. My calls through SpeakTextAsync are successfull, but my calls through SpeakSsmlAsync are returned with the Reason = Cancelled. I am on the free tier for South…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
asked 2024-06-28T11:57:49.3633333+00:00
One More Henry 20 Reputation points
accepted 2024-07-03T13:51:52.1266667+00:00
One More Henry 20 Reputation points
1 answer

Azure Text To Speech docker container throws an exception with viseme

I'm using the Azure Text to Speech docker image (mcr.microsoft.com/azure-cognitive-services/speechservices/neural-text-to-speech:3.3.0-amd64-en-us-jennyneural). I'm passing it SSML through the dotnet SDK. When asking for viseme (via <mstts:viseme…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
asked 2024-06-30T01:37:32.95+00:00
Jon Peterson 26 Reputation points
edited a comment 2024-07-03T12:36:51.7933333+00:00
Jon Peterson 26 Reputation points
0 answers

how to assign operation permissions a resources

Hello, I am new to Azure and I want to use it to convert text to speech. when I creat the resources -> enter the speech studio and try to start the service, the system raised an error and say "You don't have operation permissions to [New],…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
asked 2024-06-29T02:40:15.9166667+00:00
Jingxiong Wang 0 Reputation points
commented 2024-07-03T04:49:59.52+00:00
santoshkc 6,310 Reputation points Microsoft Vendor
1 answer

Random Words Detected by Azure Speech Recognizer in Silence

Hello Azure Support Team, I am currently using the Azure Speech Service to recognize speech inputs in my application. The setup of my speech recognizer is as follows: export const createSpeechRecognizer = () => { const speechRecognitionConfig =…

Azure AI Bot Service
Azure AI Bot Service
An Azure service that provides an integrated environment for bot development.
779 questions
Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,516 questions
Azure Startups
Azure Startups
Azure: A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.Startups: Companies that are in their initial stages of business and typically developing a business model and seeking financing.
236 questions
asked 2024-06-24T07:44:54.3933333+00:00
Abdul Subhan 5 Reputation points
edited the question 2024-07-03T04:23:28.4666667+00:00
Ryan Hill 26,866 Reputation points Microsoft Employee