Hi @hexarrior,
Thank you for reaching out to Microsoft Q&A forum!
To effectively train your Azure custom speech-to-text model, especially for recognizing industry-specific jargon, please follow:
- Use the exact terminology as it is commonly written in the industry, e.g., 'LinkedList', 'HashMap'. This will help the model recognize these terms as single entities, which is crucial for technical jargon.
- Incorporate jargon within sentences to provide context. For instance, use "Can you explain the usage of Dubbo framework?" instead of just "Dubbo". This helps the model learn the contextual usage and pronunciation of the terms. Provide diverse sentences that use the jargon in different contexts. This variation improves the model's ability to generalize and accurately recognize jargon in various scenarios. Please look into this info: Plain-text data for training.
Hope this helps. Do let us know if you any further queries.
If this answers your query, do click Accept Answer
and Yes
for was this answer helpful.