speakSsmlAsync returns invalid audio file

Question

We are using Javascript to access the API using speakSsmlAsync on the SpeechSynthesizer. We are expecting mp3 files. When we try to play these, in most software they don't play (QuickTime for example).

We are setting the audioConfig like this

const audioConfig = AudioConfig.fromAudioFileOutput(filename);

where filename is something like my-file.mp3

When I try and open in Handbrake I get errors like this:

Input #0, wav, from 'my-file.mp3':
Duration: 00:00:02.31, bitrate: 256 kb/s
Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 16000 Hz, 1 channels, s16, 256 kb/s
[16:24:03] hb_stream_open: open my-file.mp3 failed
[16:24:03] scan: unrecognized file type
[16:24:03] libhb: scan thread found 0 valid title(s)
[16:24:03] macgui: ScanCore scan done

Which makes me think it's not encoded properly.

If I change the extension to .wav, it will play (although it still says it's invalid).

So

what are we doing wrong?
is there a way to specify the output format / rate explicitly by creating our own AudioConfig? We couldn't figure out how to do that.

Thanks!

Accepted Answer

Thanks.

These functions don't appear to exist in Javascript.

It helped though to show that it should be possible, and after some more digging around in the docs and examples, we eventually found that we could set the output format in the SpeechConfig like this

speechConfig.speechSynthesisOutputFormat = sdk.SpeechSynthesisOutputFormat.Audio16Khz32KBitRateMonoMp3;

This gave us playable mp3 files.

Thanks again.

Answer

@Ben Carter In this case you need to set the output file format in the SpeechSynthesizer() using SetOutputToWaveFile()

SpeechSynthesizer synth = new SpeechSynthesizer()  
synth.SetOutputToWaveFile(@"C:	emp	est.wav",     
          new SpeechAudioFormatInfo(32000, AudioBitsPerSample.Sixteen, AudioChannel.Mono));

If an answer is helpful, please click on or upvote which might help other community members reading this thread.

Share via

speakSsmlAsync returns invalid audio file

1 additional answer

Your answer