Interactive language learning with pronunciation assessment

Important

Some of the features described in this article might only be available in preview. This preview is provided without a service-level agreement, and we don't recommend it for production workloads. Certain features might not be supported or might have constrained capabilities. For more information, see Supplemental Terms of Use for Microsoft Azure Previews.

Learning a new language is an exciting journey. Interactive language learning can make your learning experience more engaging and effective. By using pronunciation assessment effectively, you get instant feedback on pronunciation accuracy, fluency, prosody, grammar, and vocabulary through your interactive language learning experience.

Note

The language learning feature currently supports only en-US. For available regions, refer to available regions for pronunciation assessment. If you turn on the Avatar button to interact with a text to speech avatar, refer to the available regions for text to speech avatar.

If you have any feedback on the language learning feature, fill out this form.

Common use cases

Here are some common scenarios where you can make use of the language learning feature to improve your language skills:

  • Assess pronunciations: Practice your pronunciation and receive scores with detailed feedback to identify areas for improvement.
  • Improve speaking skills: Engage in conversations with a native speaker (or a simulated one) to enhance your speaking skills and build confidence.
  • Learn new vocabulary: Expand your vocabulary and work on advanced pronunciation by interacting with AI-driven language models.

Getting started

In this section, you can learn how to immerse yourself in dynamic conversations with a GPT-powered voice assistant to enhance your speaking skills.

To get started with language learning through chatting, follow these steps:

  1. Go to Language learning in the Speech Studio.

  2. Decide on a scenario or context in which you'd like to interact with the voice assistant. This can be a casual conversation, a specific topic, or a language learning exercise.

    Screenshot of choosing chatting scenario to interact with the voice assistant.

    If you want to interact with an avatar, toggle the Avatar button in the upper right corner to On.

  3. Press the microphone icon to start speaking naturally, as if you were talking to a real person.

    Screenshot of selecting the microphone icon to interact with the voice assistant.

    For accurate vocabulary and grammar scores, speak at least 3 sentences before assessment.

  4. Press the stop button or Assess my response button to finish speaking. This action will trigger the assessment process.

    Screenshot of selecting the stop button to assess your response.

  5. Wait for a moment, and you can get a detailed assessment report.

    Screenshot of a detailed assessment report.

    The assessment report may include feedback on:

    • Accuracy: Accuracy indicates how closely the phonemes match a native speaker's pronunciation.
    • Fluency: Fluency indicates how closely the speech matches a native speaker's use of silent breaks between words.
    • Prosody: Prosody indicates the nature of the given speech, including stress, intonation, speaking speed, and rhythm.
    • Grammar: Grammar considers lexical accuracy, grammatical accuracy, and diversity of sentence structures, providing a more comprehensive evaluation of language proficiency.
    • Vocabulary: Vocabulary evaluates the speaker's effective usage of words and their appropriateness within the given context to express ideas accurately, as well as the level of lexical complexity.

    When recording your speech for pronunciation assessment, ensure your recording time falls within the recommended range of 20 seconds (equivalent to more than 50 words) to 10 minutes per session. This time range is optimal for evaluating the content of your speech accurately. Whether you have a short and focused conversation or a more extended dialogue, as long as the total recorded time falls within this range, you'll receive comprehensive feedback on your pronunciation, fluency, and content.

    To get feedback on how to improve for each aspect of the assessment, select Get feedback on how to improve.

    Screenshot of selecting the button to get feedback on how to improve for each aspect of the assessment.

    When you have completed the conversation, you can also download your chat audio. You can clear the current conversation by selecting Clear chat.

Next steps