Automationscribe.com
  • Home
  • AI Scribe
  • AI Tools
  • Artificial Intelligence
  • Contact Us
No Result
View All Result
Automation Scribe
  • Home
  • AI Scribe
  • AI Tools
  • Artificial Intelligence
  • Contact Us
No Result
View All Result
Automationscribe.com
No Result
View All Result

Run Generative AI inference with Amazon Bedrock in Asia Pacific (New Zealand)

admin by admin
March 27, 2026
in Artificial Intelligence
0
Run Generative AI inference with Amazon Bedrock in Asia Pacific (New Zealand)
399
SHARES
2.3k
VIEWS
Share on FacebookShare on Twitter


Kia ora!

Prospects in New Zealand have been asking for entry to basis fashions (FMs) on Amazon Bedrock from their native AWS Area.

At present, we’re excited to announce that Amazon Bedrock is now obtainable within the Asia Pacific (New Zealand) Area (ap-southeast-6). Prospects in New Zealand can now entry Anthropic Claude fashions (Claude Opus 4.5, Opus 4.6, Sonnet 4.5, Sonnet 4.6, and Haiku 4.5) and Amazon (Nova 2 Lite) fashions instantly within the Auckland Area with cross area inference.

On this submit, we discover how cross-Area inference works from the New Zealand Area, the fashions obtainable via geographic and world routing, and easy methods to get began together with your first API name. We cowl three key areas:

  • How Amazon Bedrock in ap-southeast-6 makes use of cross-Area inference to offer you entry to FMs, with the ANZ geographic routing configuration throughout Auckland, Sydney, and Melbourne
  • Supported fashions, IAM permissions, and making your first inference name from the Auckland Area
  • Quota administration, safety issues, and selecting between geographic and world cross-Area inference to your workloads

Understanding cross-Area inference

Cross-Area inference is an Amazon Bedrock functionality that distributes inference processing throughout a number of AWS Areas that can assist you obtain larger throughput at scale.

Once you invoke a cross-Area inference profile, Amazon Bedrock routes your request from the supply Area (the place you provoke the API name) to a vacation spot Area (the place inference processing happens). All information transmitted throughout cross-Area operations stays on the AWS community and doesn’t traverse the general public web, and information is encrypted in transit between AWS Areas. All cross-Area inference requests are logged in AWS CloudTrail in your supply Area. In the event you configure mannequin invocation logging, logs are printed to Amazon CloudWatch Logs or Amazon Easy Storage Service (Amazon S3) in the identical Area.

Amazon Bedrock supplies two sorts of cross-Area inference profiles:

  • Geographic cross-Area inference – Routes requests inside a selected geographic boundary. For instance, with AU profile, and Auckland as your supply Area, requests path to Auckland, Sydney, and Melbourne. Designed for organizations with information residency necessities that want inference processing to remain inside Australia and New Zealand.
  • World cross-Area inference – Routes requests to supported business AWS Areas worldwide, offering the best obtainable throughput. Designed for organizations with out strict information residency necessities.

What’s new: New Zealand as a supply Area for cross-Area inference

With this launch, Auckland (ap-southeast-6) turns into a brand new supply Area for each AU geographic and world cross-Area inference on Amazon Bedrock. This implies which you can now make Amazon Bedrock API calls from the New Zealand Area, and cross-Area inference routes your requests to vacation spot Areas the place the FMs course of inference.

AU geographic cross-Area inference configuration

The AU cross-Area profile now spans three Areas throughout Australia and New Zealand. The next desk particulars the supply and vacation spot Area routing.

Supply Area Vacation spot Areas Description
Auckland (ap-southeast-6) ap-southeast-6, ap-southeast-2, ap-southeast-4 New – Requests from Auckland could be routed to Sydney, Melbourne, or Auckland
Sydney (ap-southeast-2) ap-southeast-2, ap-southeast-4 Requests from Sydney could be routed to Sydney or Melbourne
Melbourne (ap-southeast-4) ap-southeast-2, ap-southeast-4 Requests from Melbourne could be routed to Sydney or Melbourne

There are two essential particulars to notice:

  • The AU cross-Area inference profiles for Sydney and Melbourne proceed to route between Sydney and Melbourne solely. The addition of Auckland doesn’t change the vacation spot Areas for present Australian supply Area configurations.
  • Requests originating from Auckland could be served regionally or routed to both Australian Area, offering three vacation spot Areas for capability distribution.

World cross-Area inference from New Zealand

For organizations with out strict information residency necessities, world cross-Area inference from the Auckland Area supplies entry to inference capability throughout all supported AWS business Areas worldwide. World cross-Area inference delivers two key benefits:

  • Greater throughput — Clever routing distributes site visitors dynamically throughout all supported business Areas, lowering the probability of throttling throughout site visitors spikes
  • Constructed-in resilience — Requests are robotically routed to Areas with obtainable capability, serving to your functions preserve operational continuity as demand patterns shift

Getting began

Supported fashions and inference profile IDs

Cross-Area inference from the New Zealand Area helps basis fashions from a number of suppliers throughout each AU geographic and world cross-Area inference profiles. The next desk exhibits examples of the most recent fashions obtainable at launch.

Cross-Area inference sort Instance fashions
AU geographic cross-Area inference Anthropic Claude Opus 4.6, Claude Sonnet 4.6, Claude Sonnet 4.5, Claude Haiku 4.5
World cross-Area inference Anthropic Claude Opus 4.6, Claude Sonnet 4.6, Claude Opus 4.5, Claude Sonnet 4.5, Claude Haiku 4.5

AU geographic cross-Area inference at present helps Anthropic Claude fashions, preserving inference processing inside the ANZ geography. World cross-Area inference supplies entry to a broader set of basis fashions from a number of suppliers. To make use of a cross-Area inference profile, substitute the foundational mannequin ID with the geographic (au.) or world (world.) prefix — for instance, anthropic.claude-sonnet-4-6 turns into au.anthropic.claude-sonnet-4-6 or world.anthropic.claude-sonnet-4-6.

For the whole and up-to-date record of supported fashions and inference profile IDs, check with Supported Areas and fashions for inference profiles.

Cross-Area inference profiles work with the InvokeModel, InvokeModelWithResponseStream, Converse, and ConverseStream APIs. The Converse API supplies a constant request and response format throughout totally different basis fashions, making it simple to change between fashions with out rewriting integration code.

Configure IAM permissions

To invoke basis fashions via AU geographic cross-Area inference from the Auckland Area, your AWS Identification and Entry Administration (IAM) coverage wants two statements:

  • Granting entry to the inference profile within the supply Area
  • Granting entry to the muse mannequin in all vacation spot Areas listed within the AU cross-Area inference profile.

The next IAM coverage instance grants entry to invoke Anthropic Claude Sonnet 4.6 via AU geographic cross-Area inference from Auckland. Change together with your AWS account ID.

{ 
     "Model": "2012-10-17", 
     "Assertion": [ 
         { 
             "Sid": "AllowAuCrisInferenceProfile", 
             "Effect": "Allow", 
             "Action": [ 
                 "bedrock:InvokeModel", 
                 "bedrock:InvokeModelWithResponseStream" 
             ], 
             "Useful resource": "arn:aws:bedrock:ap-southeast-6::inference-profile/au.anthropic.claude-sonnet-4-6" 
         }, 
         { 
             "Sid": "AllowFoundationModelViaAuCris", 
             "Impact": "Enable", 
             "Motion": [ 
                 "bedrock:InvokeModel", 
                 "bedrock:InvokeModelWithResponseStream" 
             ], 
             "Useful resource": [ 
                 "arn:aws:bedrock:ap-southeast-2::foundation-model/anthropic.claude-sonnet-4-6", 
                 "arn:aws:bedrock:ap-southeast-4::foundation-model/anthropic.claude-sonnet-4-6", 
                 "arn:aws:bedrock:ap-southeast-6::foundation-model/anthropic.claude-sonnet-4-6" 
             ], 
             "Situation": { 
                 "StringLike": { 
                     "bedrock:InferenceProfileArn": "arn:aws:bedrock:ap-southeast-6::inference-profile/au.anthropic.claude-sonnet-4-6" 
                 } 
             } 
         } 
     ] 
} 

The primary assertion permits invoking the AU inference profile from the Auckland supply Area. The second assertion permits the FM to be invoked within the three vacation spot Areas, however solely when the request is routed via the AU inference profile. This follows the precept of least privilege by stopping direct mannequin invocation in these Areas.

The identical two-statement sample applies to any mannequin within the AU cross-Area inference profile—substitute the mannequin ID within the useful resource ARNs. For world cross-Area inference IAM insurance policies, service management insurance policies (SCP) configurations, and superior safety patterns, check with Securing Amazon Bedrock cross-Area inference: Geographic and world.

Safety and compliance issues

Cross-Area inference is designed with safety at its core. All requests journey solely over the AWS World Community with end-to-end encryption, and your information at relaxation stays within the supply Area.

For organizations utilizing SCPs to limit entry to particular AWS Areas, notice the next when calling from the Auckland supply Area (ap-southeast-6):

  • AU geographic cross-Area inference requires permitting ap-southeast-2, ap-southeast-4, and ap-southeast-6 for Amazon Bedrock actions in your SCPs, as a result of Auckland’s AU profile routes to all three ANZ Areas.
  • World cross-Area inference moreover requires permitting unspecified as a Area worth for Amazon Bedrock actions, as a result of vacation spot Areas are decided dynamically.

The next instance SCP restricts companies to the Auckland Area, with exceptions for Amazon Bedrock and world companies like IAM. It limits Amazon Bedrock to the three ANZ Areas, and requires that Amazon Bedrock entry in Sydney and Melbourne undergo cross-Area inference profiles moderately than direct mannequin invocation:

{ 
     "Model": "2012-10-17", 
     "Assertion": [ 
         { 
             "Sid": "DenyNonBedrockServicesOutsideAuckland", 
             "Effect": "Deny", 
             "NotAction": [ 
                 "bedrock:*", 
                 "iam:*", 
                 "organizations:*", 
                 "support:*" 
             ], 
             "Useful resource": "*", 
             "Situation": { 
                 "StringNotEquals": { 
                     "aws:RequestedRegion": ["ap-southeast-6"] 
                 } 
             } 
         }, 
         { 
             "Sid": "DenyBedrockOutsideANZRegions", 
             "Impact": "Deny", 
             "Motion": "bedrock:*", 
             "Useful resource": "*", 
             "Situation": { 
                 "StringNotEquals": { 
                     "aws:RequestedRegion": [ 
                         "ap-southeast-2", 
                         "ap-southeast-4", 
                         "ap-southeast-6" 
                     ] 
                 } 
             } 
         }, 
         { 
             "Sid": "DenyDirectBedrockInDestinationRegions", 
             "Impact": "Deny", 
             "Motion": "bedrock:*", 
             "Useful resource": "*", 
             "Situation": { 
                 "StringEquals": { 
                     "aws:RequestedRegion": [ 
                         "ap-southeast-2", 
                         "ap-southeast-4" 
                     ] 
                 }, 
                 "Null": { 
                     "bedrock:InferenceProfileArn": "true" 
                 } 
             } 
         } 
     ] 
} 

Within the earlier coverage:

  • The primary assertion restricts all companies to the Auckland Area, apart from Amazon Bedrock and world companies equivalent to IAM, AWS Organizations, and AWS Help that function independently of Area restrictions.
  • The second assertion restricts Amazon Bedrock to the three ANZ Areas, which is critical for AU cross-Area inference to route requests from Auckland to Sydney and Melbourne.
  • The third assertion makes use of the Null situation on bedrock:InferenceProfileArn to disclaim any Amazon Bedrock request in Sydney or Melbourne that’s not routed via a cross-Area inference profile. This prevents direct mannequin invocation in vacation spot Areas whereas permitting cross-Area inference to perform usually.

For detailed SCP configuration examples, world cross-Area inference IAM insurance policies, disabling particular cross-Area inference varieties, and AWS Management Tower integration steerage, check with Securing Amazon Bedrock cross-Area inference: Geographic and world.

Auditing and monitoring

AWS CloudTrail logs all cross-Area inference calls within the supply Area. The additionalEventData.inferenceRegion discipline information the place every request was processed, so you possibly can audit precisely the place inference occurred:

{ 
     "eventSource": "bedrock.amazonaws.com", 
     "eventName": "InvokeModel", 
     "awsRegion": "ap-southeast-6", 
     "requestParameters": { 
         "modelId": "au.anthropic.claude-sonnet-4-6" 
     }, 
     "additionalEventData": { 
         "inferenceRegion": "ap-southeast-2" 
     } 
} 

For real-time operational monitoring, Amazon CloudWatch supplies metrics for cross-Area inference requests in your supply Area. Key metrics embody:

  • InvocationCount — Whole variety of inference requests
  • InvocationLatency — Finish-to-end response time together with cross-Area routing
  • InvocationClientErrors — Failed requests, together with throttling (spikes point out that you just’re approaching quota limits)
  • InputTokenCount and OutputTokenCount — Token consumption for quota monitoring

Quota administration

Amazon Bedrock service quotas are managed on the supply Area degree. Quota will increase requested from the Auckland Area (ap-southeast-6) apply solely to requests originating from Auckland.

Quotas are measured in two dimensions:

  • Tokens per minute (TPM) — The utmost variety of tokens (enter + output) processed per minute
  • Requests per minute (RPM) — The utmost variety of inference requests per minute

When calculating your required quota, account for the token burndown fee. For Anthropic Claude Opus 4.6, Sonnet 4.6, and Sonnet 4.5, output tokens devour 5 instances extra quota than enter tokens (5:1 burndown fee). For Claude Haiku 4.5 and Amazon Nova fashions, the burndown fee is 1:1.

Quota consumption method:

Quota consumption = Enter tokens + Cache write tokens + (Output tokens x Burndown fee)

To request quota will increase, navigate to the AWS Service Quotas console in your supply Area, choose Amazon Bedrock, and seek for the related cross-Area inference quota to your mannequin.

Conclusion

On this submit, we launched cross-Area inference assist from the New Zealand Area on Amazon Bedrock. Prospects in New Zealand can now make API calls from Auckland and entry basis fashions via geographic and world cross-Area inference profiles.Key takeaways:

  • Auckland is now a supply Area for cross-Area inference — New Zealand clients could make Amazon Bedrock API calls from their native Area, with logs and configurations staying in Auckland.
  • AU geographic cross-Area inference retains information inside ANZ — Inference requests from Auckland route to a few locations (Auckland, Sydney, and Melbourne), offering Anthropic Claude fashions inside the ANZ geographic boundary.
  • World cross-Area inference expands mannequin entry — offering the best obtainable throughput by routing requests to supported business AWS Areas worldwide.
  • Present Australian routing is unchanged — Sydney and Melbourne supply Areas proceed to route between one another solely.

You may get began with cross-Area inference from the New Zealand Area with the next steps:

  • Register to the Amazon Bedrock console within the Auckland Area (ap-southeast-6).
  • Configure IAM and SCP permissions utilizing the coverage instance on this submit.
  • Make your first API name utilizing the au. inference profile ID.
  • Request quota will increase via the Service Quotas console primarily based in your anticipated workload.

For extra data, check with:


In regards to the authors

Zohreh Norouzi

Zohreh Norouzi is a Safety Options Architect at Amazon Net Providers. She helps clients make good safety decisions and speed up their journey to the AWS Cloud. She has been actively concerned in generative AI safety initiatives throughout APJ, utilizing her experience to assist clients construct safe generative AI options at scale.

Melanie Li

Melanie Li, PhD, is a Senior Generative AI Specialist Options Architect at AWS primarily based in Sydney, Australia, the place her focus is on working with clients to construct options utilizing state-of-the-art AI/ML instruments. She has been actively concerned in a number of generative AI initiatives throughout APJ, harnessing the ability of LLMs. Previous to becoming a member of AWS, Dr. Li held information science roles within the monetary and retail industries.

Saurabh Trikande

Saurabh Trikande is a Senior Product Supervisor for Amazon Bedrock and Amazon SageMaker Inference. He’s enthusiastic about working with clients and companions, motivated by the aim of democratizing AI. He focuses on core challenges associated to deploying advanced AI functions, inference with multi-tenant fashions, value optimizations, and making the deployment of generative AI fashions extra accessible. In his spare time, Saurabh enjoys mountaineering, studying about revolutionary applied sciences, following TechCrunch, and spending time along with his household.

James Zheng

James Zheng is a Software program Growth Supervisor at Amazon Net Providers.

William Yap

William Yap is Principal Product Supervisor for Amazon Bedrock.

Julia Bodia

Julia Bodia is Principal Product Supervisor for Amazon Bedrock.

Tags: AmazonAsiaBedrockgenerativeInferencePacificRunZealand
Previous Post

How you can Make Your AI App Quicker and Extra Interactive with Response Streaming

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Popular News

  • Greatest practices for Amazon SageMaker HyperPod activity governance

    Greatest practices for Amazon SageMaker HyperPod activity governance

    405 shares
    Share 162 Tweet 101
  • How Cursor Really Indexes Your Codebase

    404 shares
    Share 162 Tweet 101
  • Speed up edge AI improvement with SiMa.ai Edgematic with a seamless AWS integration

    403 shares
    Share 161 Tweet 101
  • Unlocking Japanese LLMs with AWS Trainium: Innovators Showcase from the AWS LLM Growth Assist Program

    403 shares
    Share 161 Tweet 101
  • The Good-Sufficient Fact | In direction of Knowledge Science

    403 shares
    Share 161 Tweet 101

About Us

Automation Scribe is your go-to site for easy-to-understand Artificial Intelligence (AI) articles. Discover insights on AI tools, AI Scribe, and more. Stay updated with the latest advancements in AI technology. Dive into the world of automation with simplified explanations and informative content. Visit us today!

Category

  • AI Scribe
  • AI Tools
  • Artificial Intelligence

Recent Posts

  • Run Generative AI inference with Amazon Bedrock in Asia Pacific (New Zealand)
  • How you can Make Your AI App Quicker and Extra Interactive with Response Streaming
  • Constructing age-responsive, context-aware AI with Amazon Bedrock Guardrails
  • Home
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms & Conditions

© 2024 automationscribe.com. All rights reserved.

No Result
View All Result
  • Home
  • AI Scribe
  • AI Tools
  • Artificial Intelligence
  • Contact Us

© 2024 automationscribe.com. All rights reserved.