Automationscribe.com
  • Home
  • AI Scribe
  • AI Tools
  • Artificial Intelligence
  • Contact Us
No Result
View All Result
Automation Scribe
  • Home
  • AI Scribe
  • AI Tools
  • Artificial Intelligence
  • Contact Us
No Result
View All Result
Automationscribe.com
No Result
View All Result

Constructing a multi-agent voice assistant with Amazon Nova Sonic and Amazon Bedrock AgentCore

admin by admin
October 29, 2025
in Artificial Intelligence
0
Constructing a multi-agent voice assistant with Amazon Nova Sonic and Amazon Bedrock AgentCore
399
SHARES
2.3k
VIEWS
Share on FacebookShare on Twitter


Amazon Nova Sonic is a basis mannequin that creates pure, human-like speech-to-speech conversations for generative AI purposes, permitting customers to work together with AI via voice in real-time, with capabilities for understanding tone, enabling pure movement, and performing actions.

Multi-agent structure presents a modular, sturdy, and scalable design sample for production-level voice assistants. This weblog put up explores Amazon Nova Sonic voice agent purposes and demonstrates how they combine with Strands Brokers framework sub-agents whereas leveraging Amazon Bedrock AgentCore to create an efficient multi-agent system.

Why multi-agent structure?

Think about growing a monetary assistant utility answerable for consumer onboarding, info assortment, id verification, account inquiries, exception dealing with, and handing off to human brokers primarily based on predefined situations. As purposeful necessities increase, the voice agent continues so as to add new inquiry varieties. The system immediate grows monumental, and the underlying logic turns into more and more complicated, illustrates a persistent problem in software program growth: monolithic designs result in techniques which can be tough to keep up and improve.

Consider multi-agent structure as constructing a workforce of specialised AI assistants quite than counting on a single do-it-all helper. Similar to corporations divide duties throughout completely different departments, this strategy breaks complicated duties into smaller, manageable items. Every AI agent turns into an skilled in a selected space—whether or not that’s fact-checking, knowledge processing, or dealing with specialised requests. For the consumer, the expertise feels seamless: there’s no delay, no change in voice, and no seen handoff. The system features behind the scenes, directing every skilled agent to step in on the proper second.

Along with modular and sturdy advantages, multi-agent techniques supply benefits just like a microservice structure, a well-liked enterprise software program design sample, offering scalability, distribution and maintainability whereas permitting organizations to reuse agentic workflows already developed for his or her massive language mannequin (LLM)-powered purposes.

Pattern utility

On this weblog, we discuss with the Amazon Nova Sonic workshop multi-agent lab code, which makes use of the banking voice assistant as a pattern to exhibit easy methods to deploy specialised brokers on Amazon Bedrock AgentCore. It makes use of Nova Sonic as the voice interface layer and acts as an orchestrator to delegate detailed inquiries to sub-agents written in Strands Brokers hosted on AgentCore Runtime. You will discover the pattern supply code on the GitHub repo.

Within the banking voice agent pattern, the dialog movement begins with a greeting and accumulating the consumer’s title, after which it handles inquiries associated to banking or mortgages. We use three secondary degree brokers hosted on AgentCore to deal with specialised logic:

  • Authenticate sub-agent: Handles consumer authentication utilizing the account ID and different info
  • Banking sub-agent: Handles account steadiness checks, statements, and different banking-related inquiries
  • Mortgage sub-agent: Handles mortgage-related inquiries, together with refinancing, charges, and compensation choices

sonic-multi-agent-diargam

Sub-agents are self-contained, dealing with their very own logic akin to enter validation. As an example, the authentication agent validates account IDs and returns errors to Nova Sonic if wanted. This simplifies the reasoning logic in Nova Sonic whereas maintaining enterprise logic encapsulated, just like the software program engineering modular design patterns.

Combine Nova Sonic with AgentCore via software use occasions

Amazon Nova Sonic depends on software use to combine with agentic workflows. In the course of the Nova Sonic occasion lifecycle, you’ll be able to present software use configurations via the promptStart occasion, which is designed to provoke when Sonic receives particular sorts of enter.

For instance, within the following Sonic software configuration pattern, software use is configured to provoke occasions primarily based on Sonic’s built-in reasoning mannequin, which classifies the inquiry for routing to the banking sub-agents.

[
    {
        "toolSpec": {
            "name": "bankAgent",
            "description": `Use this tool whenever the customer asks about their **bank account balance** or **bank statement**.  
                    It should be triggered for queries such as:  
                    - "What’s my balance?"  
                    - "How much money do I have in my account?"  
                    - "Can I see my latest bank statement?"  
                    - "Show me my account summary."`,
            "inputSchema": {
                "json": JSON.stringify({
                "type": "object",
                "properties": {
                    "accountId": {
                        "type": "string",
                        "description": "This is a user input. It is the bank account Id which is a numeric number."
                    },
                    "query": {
                        "type": "string",
                        "description": "The inquiry to the bank agent such as check account balance, get statement etc."
                    }
                },
                "required": [
                    "accountId", "query"
                ]
                })
            }
        }
    }
]

When a consumer asks Nova Sonic a query akin to ‘What’s my account steadiness?’, Sonic sends a toolUse occasion to the consumer utility with the required toolName (for instance, bankAgent) outlined within the configuration. The appliance can then invoke the sub-agent hosted on AgentCore to deal with the banking logic and return the response to Sonic, which in flip generates an audio reply for the consumer.

{
  "occasion": {
    "toolUse": {
      "completionId": "UUID",
      "content material": "{"accountId":"one two three 4 5","question":"examine account steadiness"}",
      "contentId": "UUID",
      "promptName": "UUID",
      "function": "TOOL",
      "sessionId": "UUID",
      "toolName": "bankAgent",
      "toolUseId": "UUID"
    }
  }
}

Sub-agent on AgentCore

The next pattern showcases the banking sub-agent developed utilizing the Strands Brokers framework, particularly configured for deployment on Bedrock AgentCore. It leverages Nova Lite via Amazon Bedrock as its reasoning mannequin, offering efficient cognitive capabilities with minimal latency. The agent implementation contains a system immediate that defines its banking assistant duties, complemented by two specialised instruments: one for account steadiness inquiries and one other for financial institution assertion retrieval.

from strands import Agent, software
import json
from bedrock_agentcore.runtime import BedrockAgentCoreApp
from strands.fashions import BedrockModel
import re, argparse

app = BedrockAgentCoreApp()

@software
def get_account_balance(account_id) -> str:
    """Get account steadiness for given account Id

    Args:
        account_id: Checking account Id
    """

    # The precise implementation will retrieve info from a database API or one other backend service.
    
    return {"consequence": consequence}

@software
def get_statement(account_id: str, year_and_month: str) -> str:
    """Get account assertion for a given 12 months and month
    Args:
        account_id: Checking account Id
        year_and_month: Yr and month of the financial institution assertion. For instance: 2025_08 or August 2025
    """
    # The precise implementation will retrieve info from a database API or one other backend service.
    
    return {"consequence": consequence}


# Specify Bedrock LLM for the Agent
bedrock_model = BedrockModel(
    model_id="amazon.nova-lite-v1:0",
)
# System immediate
system_prompt=""'
You're a banking agent. You'll obtain requests that embody:  
- `account_id`  
- `question` (the inquiry sort, akin to **steadiness** or **assertion**, plus any further particulars like month).  

## Directions
1. Use the supplied `account_id` and `question` to name the instruments.  
2. The software will return a JSON response.  
3. Summarize the lead to 2–3 sentences.  
   - For a **steadiness inquiry**, give the account steadiness with forex and date.  
   - For a **assertion inquiry**, present opening steadiness, closing steadiness, and variety of transactions.  
4. Don't return uncooked JSON. All the time reply in pure language.  
'''

# Create an agent with instruments, LLM, and system immediate
agent = Agent(
    instruments=[ get_account_balance, get_statement], 
    mannequin=bedrock_model,
    system_prompt=system_prompt
)

@app.entrypoint
def banking_agent(payload):
    response = agent(json.dumps(payload))
    return response.message['content'][0]['text']
    
if __name__ == "__main__":
    app.run()

Greatest practices for voice-based multi-agent techniques

Multi-agent structure offers distinctive flexibility and a modular design strategy, permitting builders to construction voice assistants effectively and doubtlessly reuse present specialised agent workflows. When implementing voice-first experiences, there are necessary greatest practices to contemplate that handle the distinctive challenges of this modality.

  • Stability flexibility and latency: Though the flexibility to invoke sub-agents utilizing Nova Sonic software use occasions creates highly effective capabilities, it might introduce further latency to voice responses. For the use instances that require a synchronized expertise, every agent handoff represents a possible delay level within the interplay movement. Due to this fact, it’s necessary to design with response time in thoughts.
  • Optimize mannequin choice for sub-agents: Beginning with smaller, extra environment friendly fashions like Nova Lite for sub-agents can considerably cut back latency whereas nonetheless dealing with specialised duties successfully. Reserve bigger, extra succesful fashions for complicated reasoning or when refined pure language understanding is important.
  • Craft voice-optimized responses: Voice assistants carry out greatest with concise, centered responses that may be adopted by further particulars when wanted. This strategy not solely improves latency but additionally creates a extra pure conversational movement that aligns with human expectations for verbal communication.

Think about stateless vs. stateful sub-agent design

Stateless sub-agents deal with every request independently, with out retaining reminiscence of previous interactions or session-level states. They’re easy to implement, simple to scale, and work nicely for simple, one-off duties. Nevertheless, they can’t present context-aware responses except exterior state administration is launched.

Stateful sub-agents, however, preserve reminiscence throughout interactions to help context-aware responses and session-level states. This allows extra personalised and cohesive consumer experiences, however comes with added complexity and useful resource necessities. They’re greatest suited to situations involving multi-turn interactions and consumer or session-level context caching.

Conclusion

Multi-agent architectures unlock flexibility, scalability, and accuracy for complicated AI-driven workflows. By combining the Nova Sonic conversational capabilities with the orchestration energy of Bedrock AgentCore, you’ll be able to construct clever, specialised brokers that work collectively seamlessly. Should you’re exploring methods to reinforce your AI purposes, multi-agent patterns with Nova Sonic and AgentCore are a strong strategy price testing.

Study extra about Amazon Nova Sonic by visiting the Consumer Information, constructing your utility with the pattern purposes, and exploring the Nova Sonic workshop to get began. You too can discuss with the technical report and mannequin card for added benchmarks.


In regards to the authors

Author - Lana Zhang Lana Zhang is a Senior Specialist Options Architect for Generative AI at AWS throughout the Worldwide Specialist Group. She makes a speciality of AI/ML, with a give attention to use instances akin to AI voice assistants and multimodal understanding. She works carefully with clients throughout various industries, together with media and leisure, gaming, sports activities, promoting, monetary companies, and healthcare, to assist them remodel their enterprise options via AI.

Tags: AgentCoreAmazonAssistantBedrockBuildingMultiAgentNovaSonicVoice
Previous Post

Deep Reinforcement Studying: 0 to 100

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Popular News

  • How Aviva constructed a scalable, safe, and dependable MLOps platform utilizing Amazon SageMaker

    How Aviva constructed a scalable, safe, and dependable MLOps platform utilizing Amazon SageMaker

    402 shares
    Share 161 Tweet 101
  • Unlocking Japanese LLMs with AWS Trainium: Innovators Showcase from the AWS LLM Growth Assist Program

    402 shares
    Share 161 Tweet 101
  • Diffusion Mannequin from Scratch in Pytorch | by Nicholas DiSalvo | Jul, 2024

    402 shares
    Share 161 Tweet 101
  • The Journey from Jupyter to Programmer: A Fast-Begin Information

    401 shares
    Share 160 Tweet 100
  • Speed up edge AI improvement with SiMa.ai Edgematic with a seamless AWS integration

    401 shares
    Share 160 Tweet 100

About Us

Automation Scribe is your go-to site for easy-to-understand Artificial Intelligence (AI) articles. Discover insights on AI tools, AI Scribe, and more. Stay updated with the latest advancements in AI technology. Dive into the world of automation with simplified explanations and informative content. Visit us today!

Category

  • AI Scribe
  • AI Tools
  • Artificial Intelligence

Recent Posts

  • Constructing a multi-agent voice assistant with Amazon Nova Sonic and Amazon Bedrock AgentCore
  • Deep Reinforcement Studying: 0 to 100
  • Internet hosting NVIDIA speech NIM fashions on Amazon SageMaker AI: Parakeet ASR
  • Home
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms & Conditions

© 2024 automationscribe.com. All rights reserved.

No Result
View All Result
  • Home
  • AI Scribe
  • AI Tools
  • Artificial Intelligence
  • Contact Us

© 2024 automationscribe.com. All rights reserved.