1,657 questions with Azure AI Speech tags

Sort by: Updated
0 answers

More details about Whisper model via Azure AI Speech

Hello, I'm trying to integrate the whisper model via Azure AI Speech. Here are somethings I already know: https://video2.skills-academy.com/en-us/azure/ai-services/speech-service/whisper-overview Whisper model via Azure OpenAI Service is available in the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
2,861 questions
asked 2024-07-16T03:52:55.4833333+00:00
YJ Kim 0 Reputation points
commented 2024-07-16T06:38:30.31+00:00
dupammi 8,460 Reputation points Microsoft Vendor
0 answers

How to deploy Live chat avatar based on the sample code in azure?

Hi, I follow this guide on how to setup Live Chat Avatar https://github.com/Azure-Samples/cognitive-services-speech-sdk/tree/yinhew/avatar/samples/js/browser/avatar and managed to setup it locally in my machine. Right how exactly do I deploy it into my…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
2,861 questions
asked 2024-07-15T09:16:54.35+00:00
Amir Basha 40 Reputation points
commented 2024-07-16T04:20:42.1633333+00:00
Amir Basha 40 Reputation points
0 answers

Speech-to-Text batch transcribe API in germanycentralwest doesn't work

Last Friday (May 31 2024) we started getting the following errors on all transcripts sent to the batch transcription API on our speech resource in…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
Azure Startups
Azure Startups
Azure: A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.Startups: Companies that are in their initial stages of business and typically developing a business model and seeking financing.
247 questions
asked 2024-06-02T20:47:26.57+00:00
Matej the Mete 20 Reputation points
commented 2024-07-15T17:08:38.0966667+00:00
Matej the Mete 20 Reputation points
2 answers

How to collect user voice in real-time from the browser and then send it to Azure Speech-to-Text via WebSocket?

I'm almost driven crazy by this problem. The audio stream I capture with MediaRecorder on Chrome only supports the webm format, while the Azure API only supports wav and ogg formats. And there is no complete example telling me how to create a support for…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
asked 2023-12-15T05:06:24.9066667+00:00
CodeKidz 35 Reputation points
answered 2024-07-13T07:52:58.36+00:00
Kenneth Díaz González 0 Reputation points
0 answers

Linux App service running fastAPI Application using Azure Speech SDK doesnt produce recognition and translation results

I Built a Fast API application on a local development environment on Windows, python and Azure Speech SDK and Azure Translation Service. the application will transcribe videos and translate text to another language as desired. it is working fine, and the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
Azure Translator
Azure Translator
An Azure service to easily conduct machine translation with a simple REST API call.
384 questions
Azure
Azure
A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.
1,148 questions
Azure App Service
Azure App Service
Azure App Service is a service used to create and deploy scalable, mission-critical web apps.
7,616 questions
asked 2023-11-18T10:52:45.2966667+00:00
AliAzizeh 10 Reputation points
edited a comment 2024-07-13T07:36:40.1033333+00:00
Kenneth Díaz González 0 Reputation points
1 answer

Azure Cognitive Speech to Text Duplicate Sentences returned on Channel 0 and Channel 1

We are developing a solution using Azure Cognitive Speech to Text service and have an issue with duplicate sentences being returned.  We have some cases with dual channel audio which appear to transcribe correctly with speaker channels. We have stereo…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
Azure AI Language
Azure AI Language
An Azure service that provides natural language capabilities including sentiment analysis, entity extraction, and automated question answering.
402 questions
asked 2023-06-06T05:35:35.0933333+00:00
James Nicolson 5 Reputation points
edited an answer 2024-07-12T12:46:38.7466667+00:00
Radhika Jagtap 20 Reputation points
0 answers

How to use both pronunciation file and a structured file in custom speech to text and speech studio?

I am using Microsoft's speech studio's Custom speech to text service. I have created a project and when uploading the data files there are different format in which I can upload. For my project I am using multiple formats of data, I want to use both…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
asked 2024-07-12T06:42:27.0433333+00:00
TR Ganesh 0 Reputation points
commented 2024-07-12T07:58:49.71+00:00
dupammi 8,460 Reputation points Microsoft Vendor
0 answers

Pause and Resume Azure Ai Continuous Speech to Text Recognition

Hi, I'm trying figurin' out how to pause the speech recognition api, while is in its continuous mode. Pretty much same situation described…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
asked 2024-07-10T22:07:16.5033333+00:00
Marco Cocco 5 Reputation points
commented 2024-07-11T12:37:01.7266667+00:00
Marco Cocco 5 Reputation points
1 answer One of the answers was accepted by the question author.

How to prepare plain text data for speech service custom model training

Hi, I'm trying to train my custom speech-to-text model to improve its accuracy in recognizing industry-specific jargon(computer science). Q1: For example, some domain specific terminologies like 'LinkedList', 'HashMap', is it better to format as it is or…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
asked 2024-07-10T08:39:02.13+00:00
hexarrior 40 Reputation points
commented 2024-07-11T07:54:07.0266667+00:00
santoshkc 7,755 Reputation points Microsoft Vendor
1 answer One of the answers was accepted by the question author.

What are the HW or sound limitations for the echo cancellation algorithm in SpeechSDK

hi, I'm having some issues with the echo cancellation on my device, and I'm trying to use speech SDK, when I was analyzing the sounds that I record with microphone it seems that there are present higher harmonics which are 24dB less then primary…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
asked 2024-06-28T07:30:28.45+00:00
Faris Lemes 70 Reputation points
accepted 2024-07-11T05:13:53.1033333+00:00
Faris Lemes 70 Reputation points
0 answers

How to have the control over the audio playing when text is converted to speech using Azure Speech Service?

Below is the code I am using to convert text to audio for a button click using Azure speech service, but I am unable to stop the audio that is playing, I would like to use the same button to stop the audio while it is playing. How to have the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
Azure Translator
Azure Translator
An Azure service to easily conduct machine translation with a simple REST API call.
384 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
2,861 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,759 questions
asked 2024-07-08T06:17:54.4566667+00:00
Shivani V 0 Reputation points
commented 2024-07-11T04:36:52.03+00:00
dupammi 8,460 Reputation points Microsoft Vendor
2 answers

Do we have a batch transcription in microsoft Azure speech to text cognitive services using java sdk or java Rest API ?

we have a embedded speech(microphone) speech to text cognitive service support in java but I want to implement a batch transcription using microsoft Azure cognitive services using java language, do we java sdk or java Rest API support for batch…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
asked 2024-07-10T15:10:29.04+00:00
Ganesh P 40 Reputation points
answered 2024-07-10T22:07:43.7066667+00:00
VasaviLankipalle-MSFT 17,006 Reputation points
1 answer

Can't preview a sound on Speech Studio

It happens on East US, S0

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
asked 2024-07-09T16:25:54.4633333+00:00
Quill Zhou 25 Reputation points
answered 2024-07-10T20:01:28.8733333+00:00
VasaviLankipalle-MSFT 17,006 Reputation points
1 answer One of the answers was accepted by the question author.

Seeking Optimal Speech Transcription Service for Mixed Chinese and English Scenarios

Our speech recognition scenario mainly involves a mix of Chinese and English. Currently, we have chosen the Chinese language recognition type (as there is no specific type for mixed Chinese and English). Besides manually adding hotwords and conducting…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
asked 2024-07-10T10:19:21.5833333+00:00
hexarrior 40 Reputation points
accepted 2024-07-10T19:48:13.1166667+00:00
hexarrior 40 Reputation points
1 answer One of the answers was accepted by the question author.

Improving Speech to Text Accuracy for Industry-Specific Terminology with Azure AI Service

Hi all, I want to improve the accuracy of reading industry-specific terminology(in Japanese) using Azure AI service's Speech to Text. The challenge is that these terms can have different meanings in general contexts versus industry-specific contexts. How…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,759 questions
asked 2024-07-09T07:38:58.95+00:00
KT 170 Reputation points
commented 2024-07-09T09:30:50.9633333+00:00
KT 170 Reputation points
1 answer

no voice when I click "play" button to create speech from text

no voice when I click "play" button to create speech from text, my laptop voice turned on already.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,759 questions
asked 2024-07-05T07:22:27.8133333+00:00
Grace Xiong 0 Reputation points Microsoft Employee
commented 2024-07-09T09:01:55.4866667+00:00
santoshkc 7,755 Reputation points Microsoft Vendor
1 answer

How to fix an issue where my 3D Blendshapes do not align with the audio.

I'm trying to apply viseme 3D Blend Shapes to drive my 3d avatar.  When the result is returned, the audio plays before the response's FrameIndex and BlendShape. I received event.animation and used it to set the weight for each blend shape name.  However,…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
asked 2024-07-08T07:21:23.1533333+00:00
Ananchai Mankhong 0 Reputation points
commented 2024-07-08T17:42:37.0233333+00:00
Ananchai Mankhong 0 Reputation points
1 answer

Can I use phonetic language to create perfect speech

Can I use International Phonetic alphabetic translation in azure text to speech to come out with a near perfect speech? If so, how?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
asked 2023-12-06T06:55:10.1666667+00:00
Geoff Surtees 0 Reputation points
edited a comment 2024-07-08T12:58:32.9966667+00:00
Stefano Michieletto 0 Reputation points
0 answers

How can I use Whisper on Azure AI Speech

Hi, I recently switched from using the whisper model via Azure OpenAI to using Azure AI Speech. However, I noticed that the quality of some transcriptions is worse on Azure AI Speech. On the below page it says that it is possible to use the whisper model…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
asked 2024-07-04T18:25:53.3633333+00:00
Julian 0 Reputation points
commented 2024-07-08T11:03:44.17+00:00
dupammi 8,460 Reputation points Microsoft Vendor
1 answer

How to get sentence word timestamp results for real-time speech recognition ?

I am using Golang's SDK this is my golang code func (m *microsoft) Do(ctx context.Context, path string) (string, error) { defer os.Remove(path) accessKeyConfig := AccessKeyList[rand.Intn(len(AccessKeyList))] subscription := accessKeyConfig.Key region…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,657 questions
asked 2024-07-05T06:25:10.16+00:00
莓 草 0 Reputation points
commented 2024-07-08T09:01:24.4366667+00:00
navba-MSFT 22,995 Reputation points Microsoft Employee