Amazon Nova Sonic

State-of-the-art speech-to-speech model for conversational AI

What is Amazon Nova Sonic?

Amazon Nova Sonic is a state-of-the-art speech-to-speech model that delivers real-time, human-like voice conversations with industry-leading price performance and low latency. Available in Amazon Bedrock via the bidirectional streaming API, the model understands streaming speech in various speaking styles and generates expressive speech responses that dynamically adapt to the prosody of input speech.

Amazon Nova Sonic supports expressive voices, including both masculine-sounding and feminine-sounding voices, in different English accents including American and British. The model can be utilized across a wide range of applications, including customer support call automation, outbound marketing, voice-enabled personal assistants and agents, and interactive education and language learning.

Key capabilities

Amazon Nova Sonic delivers industry-leading speed and price performance.

Amazon Nova Sonic enables knowledge grounding with enterprise data using Retrieval-Augmented Generation (RAG).

Amazon Nova Sonic supports functional calling, enabling seamless interaction with external services and efficient agentic task automation.

Amazon Nova Sonic is accessed via bidirectional streaming API in Amazon Bedrock. This API enables two-way streaming of content, which is critical for low latency interactive communication between a human user and the AI model.

Built-in protections including content moderation and watermarking.

See Amazon Nova Sonic in action

Language learning for non-native speaker

Voice-enabled business assistant

Customer service call automation

  • Amazon Nova Sonic

Discover real-world use cases

Getting Started with Amazon Nova Sonic

This video provides a step-by-step tutorial on how to use Amazon Nova Sonic in Amazon Bedrock to build your own voice-enabled bot.