This publish was cowritten by Rishi Srivastava and Scott Reynolds from Clarus Care.
Many healthcare practices as we speak wrestle with managing excessive volumes of affected person calls effectively. From appointment scheduling and prescription refills to billing inquiries and pressing medical issues, practices face the problem of offering well timed responses whereas sustaining high quality affected person care. Conventional telephone programs typically result in lengthy maintain instances, pissed off sufferers, and overwhelmed employees who manually course of and prioritize tons of of calls every day. These communication bottlenecks not solely impression affected person satisfaction however can even delay crucial care coordination.
On this publish, we illustrate how Clarus Care, a healthcare contact middle options supplier, labored with the AWS Generative AI Innovation Middle (GenAIIC) workforce to develop a generative AI-powered contact middle prototype. This resolution permits conversational interplay and multi-intent decision via an automatic voicebot and chat interface. It additionally incorporates a scalable service mannequin to help development, human switch capabilities–when requested or for pressing instances–and an analytics pipeline for efficiency insights.
Clarus Care is a healthcare know-how firm that helps medical practices handle affected person communication via an AI-powered name administration system. By robotically transcribing, prioritizing, and routing affected person messages, Clarus improves response instances, reduces employees workload, and minimizes maintain instances. Clarus is the quickest rising healthcare name administration firm, serving over 16,000 customers throughout 40+ specialties. The corporate handles 15 million affected person calls yearly and maintains a 99% consumer retention fee.
Use case overview
Clarus is embarking on an revolutionary journey to rework their affected person communication system from a standard menu-driven Interactive Voice Response (IVR) to a extra pure, conversational expertise. The corporate goals to revolutionize how sufferers work together with healthcare suppliers by making a generative AI-powered contact middle able to understanding and addressing a number of affected person intents in a single interplay. Beforehand, sufferers navigated via inflexible menu choices to go away messages, that are then transcribed and processed. This method, whereas purposeful, limits the system’s potential to deal with complicated affected person wants effectively. Recognizing the necessity for a extra intuitive and versatile resolution, Clarus collaborated with the GenAIIC to develop an AI-powered contact middle that may comprehend pure language dialog, handle a number of intents, and supply a seamless expertise throughout each voice and internet chat interfaces. Key success standards for the venture had been:
- A pure language voice interface able to understanding and processing a number of affected person intents reminiscent of billing questions, scheduling, and prescription refills in a single name
- <3 second latency for backend processing and response to the person
- The flexibility to transcribe, document, and analyze name info
- Sensible switch capabilities for pressing calls or when sufferers request to talk instantly with suppliers
- Help for each voice calls and internet chat interfaces to accommodate varied affected person preferences
- A scalable basis to help Clarus’s rising buyer base and increasing healthcare facility community
- Excessive availability with a 99.99% SLA requirement to facilitate dependable affected person communication
Resolution overview & structure
The GenAIIC workforce collaborated with Clarus to create a generative AI-powered contact middle utilizing Amazon Join and Amazon Lex, built-in with Amazon Nova and Anthropic’s Claude 3.5 Sonnet basis fashions via Amazon Bedrock. Join was chosen because the core system on account of its potential to take care of 99.99% availability whereas offering complete contact middle capabilities throughout voice and chat channels.
The mannequin flexibility of Bedrock is central to the system, permitting task-specific mannequin choice based mostly on accuracy and latency. Claude 3.5 Sonnet was used for its high-quality pure language understanding capabilities, and Nova fashions provided optimization for low latency and comparable pure language understanding and technology capabilities. The next diagram illustrates the answer structure for the principle contact middle resolution:

The workflow consists of the next high-level steps:
- A affected person initiates contact via both a telephone name or internet chat interface.
- Join processes the preliminary contact and routes it via a configured contact circulate.
- Lex handles transcription and maintains dialog state.
- An AWS Lambda achievement perform processes the dialog utilizing Claude 3.5 Sonnet and Nova fashions via Bedrock to:
- Classify urgency and intents
- Extract required info
- Generate pure responses
- Handle appointment scheduling when relevant
The fashions used for every particular perform are described in resolution element sections.
- Sensible transfers to employees are initiated when pressing instances are detected or when sufferers request to talk with suppliers.
- Dialog knowledge is processed via an analytics pipeline for monitoring and reporting (described later on this publish).
Some challenges the workforce tackled in the course of the growth course of included:
- Formatting the contact middle name circulate and repair mannequin in a method that’s interchangeable for various prospects, with minimal code and configuration adjustments
- Managing latency necessities for a pure dialog expertise
- Transcription and understanding of affected person names
Along with voice calls, the workforce developed an internet interface utilizing Amazon CloudFront and Amazon S3 Static Web site Internet hosting that demonstrates the system’s multichannel capabilities. This interface exhibits how sufferers can have interaction in AI-powered conversations via a chat widget, offering the identical degree of service and performance as voice calls. Whereas the online interface demo makes use of the identical contact circulate because the voice name, it may be additional custom-made for chat-specific language.

The workforce additionally constructed an analytics pipeline that processes dialog logs to offer helpful insights into system efficiency and affected person interactions. A customizable dashboard provides a user-friendly interface for visualizing this knowledge, permitting each technical and non-technical employees to realize actionable insights from affected person communications. The analytics pipeline and dashboard had been constructed utilizing a beforehand printed reusable GenAI contact middle asset.

Dialog dealing with particulars
The answer employs a classy dialog administration system that orchestrates pure affected person interactions via the multi-model capabilities of Bedrock and thoroughly designed immediate layering. On the coronary heart of this method is the power of Bedrock to offer entry to a number of basis fashions, enabling the workforce to pick the optimum mannequin for every particular activity based mostly on accuracy, value, and latency necessities. The circulate of the dialog administration system is proven within the following picture; NLU stands for pure language understanding.

The dialog circulate begins with a greeting and urgency evaluation. When a affected person calls, the system instantly evaluates whether or not the state of affairs requires pressing consideration utilizing Bedrock APIs. This primary step makes certain that emergency instances are rapidly recognized and routed appropriately. The system makes use of a targeted immediate that analyzes the affected person’s preliminary assertion towards a predefined record of pressing intent classes, returning both “pressing” or “non_urgent” to information subsequent dealing with.
Following this, the system strikes to intent detection. A key innovation right here is the system’s potential to course of a number of intents inside a single interplay. Fairly than forcing sufferers via inflexible menu timber, the system can leverage highly effective language fashions to know when a affected person mentions each a prescription refill and a billing query, queuing these intents for sequential processing whereas sustaining pure dialog circulate. Throughout this extraction, we guarantee that the intent and the quote from the person enter are each extracted. This produces two outcomes:
- Built-in mannequin reasoning to guarantee that the right intent is extracted
- Dialog historical past reference that led to intent extraction, so the identical intent will not be extracted twice until explicitly requested for
As soon as the system begins processing intents sequentially, it begins prompting the person for knowledge required to service the intent at hand. This occurs in two interdependent levels:
- Checking for lacking info fields and producing a pure language immediate to ask the person for info
- Parsing person utterances to research and extract collected fields and the fields which might be nonetheless lacking
These two steps occur in a loop till the required info is collected. The system additionally considers provider-specific providers at this stage, the place fields required per supplier is collected. The answer robotically matches supplier names talked about by sufferers to the right supplier within the system. This handles variations like “Dr. Smith” matching to “Dr. Jennifer Smith” or “Jenny Smith,” eradicating the inflexible title matching or extension necessities of conventional IVR programs. The answer additionally contains good handoff capabilities. When the system wants to find out if a affected person ought to converse with a selected supplier, it analyses the dialog context to think about urgency and routing wants for the expressed intent. This course of preserves the dialog context and picked up info, facilitating a seamless expertise when human intervention is requested. All through the dialog, the system maintains complete state monitoring via Lex session attributes whereas the pure language processing happens via Bedrock mannequin invocations. These attributes function the dialog’s reminiscence, storing all the pieces from the person’s collected info and dialog historical past to detected intents and picked up info. This state administration permits the system to take care of context throughout a number of Bedrock API calls, making a extra pure dialogue circulate.
Intent administration
The intent administration system was designed via a hierarchical service mannequin construction that displays how sufferers naturally categorical their wants. To traverse this hierarchical service mannequin, the person inputs are parsed utilizing pure language understanding, that are dealt with via Bedrock API calls.
The hierarchical service mannequin organizes intents into three main ranges:
- Urgency Degree: Separating pressing from non-urgent providers facilitates applicable dealing with and routing.
- Service Degree: Grouping associated providers like appointments, prescriptions, and billing creates logical classes.
- Supplier-Particular Degree: Additional granularity accommodates provider-specific necessities and sub-services
This construction permits the system to effectively navigate via attainable intents whereas sustaining flexibility for personalisation throughout completely different healthcare services. Every intent within the mannequin contains customized directions that may be dynamically injected into Bedrock prompts, permitting for extremely configurable habits with out code adjustments. The intent extraction course of leverages the superior language understanding capabilities of Bedrock via a immediate that instructs the mannequin to establish the intents current in a affected person’s pure language enter. The immediate contains complete directions about what constitutes a brand new intent, the whole record of attainable intents, and formatting necessities for the response. Fairly than forcing classification right into a single intent, we intend to detect a number of wants expressed concurrently. As soon as intents are recognized, they’re added to a processing queue. The system then works via every intent sequentially, making extra mannequin calls in a number of layers to gather required info via pure dialog. To optimize for each high quality and latency, the answer leverages the mannequin choice flexibility of Bedrock for varied dialog duties in a similar way:
- Intent extraction makes use of Anthropic’s Claude 3.5 Sonnet via Bedrock for detailed evaluation that may establish a number of intents from pure language, ensuring sufferers don’t have to repeat info.
- Info assortment employs a sooner mannequin, Amazon Nova Professional, via Bedrock for structured knowledge extraction whereas sustaining conversational tone.
- Response technology makes use of a smaller mannequin, Nova Lite, via Bedrock to create low-latency, pure, and empathetic responses based mostly on the dialog state.
Doing this helps in ensuring that the answer can:
- Preserve conversational tone and empathy
- Ask for less than the precise lacking info
- Acknowledge info already supplied
- Deal with particular instances like spelling out names
The whole intent administration pipeline advantages from the Bedrock unified Converse API, which gives:
- Constant interface throughout the mannequin calls, simplifying growth and upkeep
- Mannequin model management facilitating steady habits throughout deployments
- Future-proof structure permitting seamless adoption of recent fashions as they turn out to be accessible
By implementing this hierarchical intent administration system, Clarus can provide sufferers a extra pure and environment friendly communication expertise whereas sustaining the construction wanted for correct routing and knowledge assortment. The pliability of mixing the multi-model capabilities of Bedrock with a configurable service mannequin permits for simple customization per healthcare facility whereas retaining the core dialog logic constant and maintainable. As new fashions turn out to be accessible in Bedrock, the system could be up to date to leverage improved capabilities with out main architectural adjustments, facilitating long-term scalability and efficiency optimization.
Scheduling
The scheduling element of the answer is dealt with in a separate, purpose-built module. If an ‘appointment’ intent is detected in the principle handler, processing is handed to the scheduling module. The module operates as a state machine consisting of dialog states and subsequent steps. The general circulate of the scheduling system is proven beneath:
There are three principal LLM prompts used within the scheduling circulate:
- Extract time preferences (Nova Lite is used for low latency and use desire understanding)
- Decide if person is confirming or denying time (Nova Micro is used for low latency on a easy activity)
- Generate a pure response based mostly on a subsequent step (Nova Lite is used for low latency and response technology)
The attainable steps are:
- Let the person know you’ll escalate to the workplace
- Finish a dialog with reserving affirmation
System Extensions
Sooner or later, Clarus can combine the contact middle’s voicebot with Amazon Nova Sonic. Nova Sonic is a speech-to-speech LLM that delivers real-time, human-like voice conversations with main worth efficiency and low latency. Nova Sonic is now instantly built-in with Join.
Bedrock has a number of extra providers which assist with scaling the answer and deploying it to manufacturing, together with:
Conclusion
On this publish, we demonstrated how the GenAIIC workforce collaborated with Clarus Care to develop a generative AI-powered healthcare contact middle utilizing Amazon Join, Amazon Lex, and Amazon Bedrock. The answer showcases a conversational voice interface able to dealing with a number of affected person intents, managing appointment scheduling, and offering good switch capabilities. By leveraging Amazon Nova and Anthropic’s Claude 3.5 Sonnet language fashions and AWS providers, the system achieves excessive availability whereas providing a extra intuitive and environment friendly affected person communication expertise.The answer additionally incorporates an analytics pipeline for monitoring name high quality and metrics, in addition to an internet interface demonstrating multichannel help. The answer’s structure gives a scalable basis that may adapt to Clarus Care’s rising buyer base and future service choices.The transition from a standard menu-driven IVR to an AI-powered conversational interface permits Clarus to assist improve affected person expertise, enhance automation capabilities, and streamline healthcare communications. As they transfer in direction of implementation, this resolution will empower Clarus Care to fulfill the evolving wants of each sufferers and healthcare suppliers in an more and more digital healthcare panorama.
If you wish to implement an analogous resolution on your use case, take into account the weblog Deploy generative AI brokers in your contact middle for voice and chat utilizing Amazon Join, Amazon Lex, and Amazon Bedrock Data Bases for the infrastructure setup.
Concerning the authors
Rishi Srivastava is the VP of Engineering at Clarus Care. He’s a seasoned trade chief with over 20 years in enterprise software program engineering, specializing in design of multi-tenant Cloud based mostly SaaS structure and, conversational AI agentic options associated to affected person engagement. Beforehand, he labored in monetary providers and quantitative finance, constructing latent issue fashions for stylish portfolio analytics to drive data-informed funding methods.
Scott Reynolds is the VP of Product at Clarus Care, a healthcare SaaS communications and AI-powered affected person engagement platform. He’s spent over 25 years within the know-how and software program market creating safe, interoperable platforms that streamline scientific and operational workflows. He has based a number of startups and holds a U.S. patent for patient-centric communication know-how.
Brian Halperin
joined AWS in 2024 as a GenAI Strategist within the Generative AI Innovation Middle, the place he helps enterprise prospects unlock transformative enterprise worth via synthetic intelligence. With over 9 years of expertise spanning enterprise AI implementation and digital know-how transformation, he brings a confirmed monitor document of translating complicated AI capabilities into measurable enterprise outcomes. Brian beforehand served as Vice President on an working workforce at a world different funding agency, main AI initiatives throughout portfolio firms.
Brian Yost is a Principal Deep Studying Architect within the AWS Generative AI Innovation Middle. He makes a speciality of making use of agentic AI capabilities in buyer help eventualities, together with contact middle options.
Parth Patwa is a Information Scientist within the Generative AI Innovation Middle at Amazon Internet Providers. He has co-authored analysis papers at high AI/ML venues and has 1500+ citations.
Smita Bailur is a Senior Utilized Scientist on the AWS Generative AI Innovation Middle, the place she brings over 10 years of experience in conventional AI/ML, deep studying, and generative AI to assist prospects unlock transformative options. She holds a masters diploma in Electrical Engineering from the College of Pennsylvania.
Shreya Mohanty Shreya Mohanty is a Strategist within the AWS Generative AI Innovation Middle the place she makes a speciality of mannequin customization and optimization. Beforehand she was a Deep Studying Architect, targeted on constructing GenAI options for purchasers. She makes use of her cross-functional background to translate buyer objectives into tangible outcomes and measurable impression.
Yingwei Yu Yingwei Yu is an Utilized Science Supervisor on the Generative AI Innovation Middle (GenAIIC) at Amazon Internet Providers (AWS), based mostly in Houston, Texas. With expertise in utilized machine studying and generative AI, Yu leads the event of revolutionary options throughout varied industries. He has a number of patents and peer-reviewed publications in skilled conferences. Yingwei earned his Ph.D. in Pc Science from Texas A&M College – School Station.


