Amazon Bedrock now affords Stability AI Picture Companies: 9 instruments that enhance how companies create and modify photos. The know-how extends Steady Diffusion and Steady Picture fashions to offer you exact management over picture creation and enhancing. Clear prompts are crucial—they supply artwork course to the AI system. Sturdy prompts management particular parts like tone, texture, lighting, and composition to create the specified visible outcomes. This functionality serves skilled wants throughout product images, idea, and advertising and marketing campaigns.
On this put up, we develop on the put up Understanding immediate engineering: Unlock the artistic potential of Stability AI fashions on AWS. We present the way to successfully use superior prompting methods to maximise picture technology high quality and precision for enterprise software utilizing Stability AI Picture Companies in Amazon Bedrock.
Answer overview
Stability AI Picture Companies can be found as APIs in Amazon Bedrock, that includes capabilities akin to, in-painting, fashion switch, recoloring, background removing, object removing, fashion information, and way more.
Within the following sections, we first talk about immediate construction for optimum management of picture technology, then we offer superior methods of prompting for stylistic steerage. Code samples might be discovered within the following GitHub repository.
Conditions
To get began with Stability AI Picture Companies in Amazon Bedrock, comply with the directions in Getting began with the API to finish the next stipulations:
- Arrange your AWS account.
- Purchase credentials to grant programmatic entry.
- Connect the Amazon Bedrock permission to an AWS Id and Entry Administration (IAM) consumer or position.
- Request entry to the Amazon Bedrock fashions.
Construction prompts that maximize management
To maximise the granular capabilities of Stability AI Picture Companies in Amazon Bedrock, you will need to assemble prompts that allow fine-grained management.
This part outlines greatest practices for constructing efficient prompts that produce the specified output. We reveal how immediate construction impacts outcomes and why extra structured prompts sometimes yield extra constant and controllable outcomes.
Select the best immediate sort on your use case
Deciding on the best immediate format helps the mannequin higher perceive your intent. Three major immediate codecs ship totally different ranges of management and readability:
- Pure language maximizes readability and is greatest for common utilization
- Tag-based codecs allow exact structural management and are perfect for technical software
- Hybrid codecs mix pure language and the structural parts of tags to supply much more management
The next desk supplies examples of those three widespread methods to phrase your prompts. Every immediate format has its strengths relying in your purpose or the interface you’re utilizing.
Immediate sort | Immediate instance | Generated picture utilizing Steady Picture Extremely in Amazon Bedrock | Description and use case |
Primary Immediate (Pure Language) | “A clear product picture of a fragrance bottle on a marble countertop” | ![]() |
That is readable and intuitive. Nice for exploration, conversational instruments, and a few mannequin sorts. Steady Diffusion 3.5 responds greatest to this fashion. |
Tag-Primarily based Immediate | “fragrance bottle, marble floor, mushy mild, prime quality, product picture” | ![]() |
Utilized in many technology UIs or with fashions educated on datasets like LAION or Danbooru. Compact and good for stacking particulars. |
Hybrid Immediate | “fragrance bottle on marble counter, mushy studio lighting, sharp focus, f/2.8lens” | ![]() |
Better of each worlds. Add emphasis with weighting syntax to affect the mannequin’s priorities. |
Construct modular prompts
Modular prompting enhances AI picture technology effectiveness. This method divides prompts into distinct parts, every specifying what to attract and the way it ought to seem. Modular constructions present a number of advantages: they assist stop conflicting or complicated directions, enable for exact output management, and simplify immediate debugging. By isolating particular person parts, you’ll be able to shortly determine and modify efficient or ineffective elements of your prompts. This technique in the end results in extra refined and focused AI-generated photos.
The next desk supplies examples of modular immediate modules. Experiment with totally different immediate sequences on your desired consequence; for instance, inserting the fashion earlier than the topic will give it a extra visible weight.
Module | Instance | Description |
Prefix | “style editorial portrait of” | Units the tone and intent for a high-fashion styled portrait |
Topic | “a girl with medium-brown pores and skin and brief coiled hair” | Offers the mannequin’s look and floor element to assist information facial options |
Modifiers | “sporting an asymmetrical black mesh prime, metallic jewellery” | Provides stylized clothes and niknaks for visible curiosity |
Motion | “seated along with her shoulders angled, eyes locked on digital camera, one arm lifted” | Describes physique language and pose to offer dynamic composition |
Setting | “bathed in intersecting beams of laborious directional mild via window slats” | Provides context for dramatic mild play and environment |
Type | “high-contrast chiaroscuro lighting, sculptural and summary” | Informs the aesthetic and temper (shadow-driven, moody, architectural) |
Digital camera/Lighting | “shot on 85mm, studio setup, layered shadows and lightweight falling throughout face and physique” | Provides technical precision and helps management realism and constancy |
The next instance illustrates the way to use a modular immediate to generate the specified output.
Modular Immediate | Generated Picture Utilizing Steady Picture Extremely in Amazon Bedrock |
“style editorial portrait of a girl with medium-brown pores and skin and brief coiled hair, sporting an asymmetrical black mesh prime and metallic jewellery, seated with shoulders angled and one arm lifted, eyes locked on digital camera, bathed in intersecting beams of laborious directional mild via window slats, layered shadows and highlights sculpting her face and physique, high-contrast chiaroscuro lighting, summary and daring, shot on 85mm in studio” | ![]() |
Use unfavorable prompts for polished output
Detrimental prompts enhance AI output high quality by eradicating particular visible parts. Explicitly defining what to not embody within the immediate guides the mannequin’s output, sometimes resulting in skilled outputs. Detrimental prompts act like a retoucher’s guidelines used to deal with points of a picture to reinforce high quality and enchantment. For instance, “No bizarre fingers. No blurry corners. No cartoon filters. Undoubtedly no watermarks.” Detrimental prompts lead to clear, assured, compositions, freed from distracting factor and distortions.
The next desk supplies examples of extra tokens that can be utilized in unfavorable prompts.
Artifact Kind | Tokens to Use |
Low high quality or noise | blurry, lowres, jpeg artifacts, noisy |
Anatomy or mannequin points | deformed, additional limbs, unhealthy fingers, lacking fingers |
Type clashes | cartoon, illustration, anime, portray |
Technical errors | watermark, textual content, signature, overexposed |
Common cleanup | ugly, poorly drawn, distortion, worst high quality |
The next instance illustrates how a well-structured unfavorable immediate can improve photorealism.
With out Detrimental Immediate |
Immediate “(medium full shot) of (charming workplace cubicle) manufactured from glass materials, a number of colours, fashionable fashion, space-saving, upholstered seat, patina, gold trim, positioned in a contemporary backyard, with smooth furnishings, trendy decor, vivid lighting, comfy seating, Masterpiece, highest quality, uncooked picture, life like, very aesthetic, darkish “ |
![]() |
With Detrimental Immediate |
Immediate “(medium full shot) of (charming workplace cubicle) manufactured from glass materials, a number of colours, fashionable fashion, space-saving, upholstered seat, patina, gold trim, positioned in a contemporary backyard, with smooth furnishings, trendy decor, vivid lighting, comfy seating, Masterpiece, highest quality, uncooked picture, life like, very aesthetic, darkish” Detrimental Immediate “cartoon, 3d render, cgi, oversaturated, easy plastic textures, unreal lighting, synthetic, matte floor, painterly, dreamy, shiny end, digital artwork, low element background” |
![]() |
Emphasize or suppress parts with immediate weighting
Immediate weighting controls the affect of particular person parts in AI picture technology. These numerical weights prioritize particular immediate parts over others. For instance, to emphasise the character over the background, you’ll be able to apply a 1.8 weight to “character” (character: 1.8) and 1.1 to “background” (background: 1.1), which makes positive the mannequin prioritizes character element whereas sustaining environmental context. This focused emphasis produces extra exact outputs by minimizing competitors between immediate parts and clarifying the mannequin’s priorities.
The syntax for immediate weights is (
- (time period:1.2): Emphasize
- (time period:0.8): Deemphasize
- ((time period)): Shorthand for (time period:1.2)
- (((((((((time period)))))))): Shorthand for (time period:1.8)
The next instance reveals how immediate weights contribute to the generated output.
Immediate with weights “editorial product picture of (a translucent gel moisturizer jar:1.4) positioned on a (frosted glass pedestal:1.2), surrounded by (dewy pink flower petals:1.1), with mushy (subtle lighting:1.3), refined water droplets, shallow depth of subject” |
![]() |
Immediate with out weights “editorial product picture of a translucent gel moisturizer jar positioned on a frosted glass pedestal, surrounded by dewy pink flower petals, with mushy, refined water droplets, shallow depth of subject” |
![]() |
It’s also possible to use weights in unfavorable prompts to cut back how strongly the mannequin avoids one thing. For instance, “(textual content:0.5), (blurry:0.2), (lowres:0.1).” This tells the mannequin to be particularly positive to keep away from producing blurry textual content or low-resolution content material.
Giving particular stylistic steerage
Efficient immediate writing when utilizing Stability AI Picture Companies akin to Type Switch and Type Information requires an excellent understanding of favor matching and reference-driven prompting. These methods assist present clear stylistic course for each text-to-image and image-to-image creation.
Picture-to-image fashion switch extracts stylistic parts from an enter picture (management picture) and makes use of it to information the creation of an output picture based mostly on the immediate. Method writing the immediate as in case you’re directing knowledgeable photographer or stylist. Concentrate on supplies, lighting high quality, and creative intention—not simply objects. For instance, a well-structured immediate may learn: “Shut-up editorial picture of a translucent inexperienced lip gloss tube on crushed iridescent plastic, subtle coloured lighting, shallow DOF, excessive style product styling.”
Type tag layering: Recognized aesthetic labels that align with model identification
The artwork of crafting efficient prompts usually depends on incorporating established fashion tags that resonate with acquainted visible languages and datasets. By strategically mixing phrases from acknowledged aesthetic classes (starting from editorial images and analog movie to anime, cyberpunk cityscapes, and brutalist constructions), creators can information the AI towards particular visible outcomes that align with their model identification. These fashion descriptors function highly effective anchors within the immediate engineering course of. The flexibility of those tags extends additional via their capability to be mixed and weighted, permitting for nuanced management over the ultimate aesthetic. As an example, a skincare model may mix the clear strains of product images with dreamy, surreal parts, whereas a tech firm may merge brutalist construction with cyberpunk parts for a particular visible identification. This method to fashion mixing helps creators enhance their outputs whereas sustaining clear ties to recognizable visible genres that resonate with their target market. The secret is understanding how these fashion tags work together and utilizing their mixtures to create distinctive, but culturally related, visible expressions that serve particular artistic or industrial goals. The next desk supplies examples of prompts for a desired aesthetic.
Desired aesthetic | Immediate phrases | Instance use case |
Retro / Y2K | 2000s nostalgia, flash images, sweet tones, harsh lighting | Metallic textures, skinny fonts, early digital really feel. |
Clear fashionable | impartial tones, mushy gradients, minimalist styling, editorial format | Nice for wellness or skincare merchandise. |
Daring streetwear | city background, outsized match, sturdy pose, noon shadow | Style images and way of life adverts. Prioritize outfit construction and placement cues. |
Hyperreal surrealism | dreamcore lighting, shiny textures, cinematic DOF, surreal shadows | Performs effectively in music, style, or alt-culture campaigns. |
Invoke a named fashion as a reference
Some immediate constructions profit from invoking a named visible signature from a particular artist, particularly when mixed with your personal stylistic phrasing or workflows, as proven within the following instance.
Immediate “editorial studio portrait of a girl with glowing pores and skin in minimalist glam make-up, high-contrast lighting, clear background, (depiction of Van Gogh fashion:1.3)” |
![]() |
The next is a extra conceptual instance.
Immediate “product shot of a silver hair oil bottle with mushy reflections on curved chrome, (depiction of Wes Anderson fashion:1.2), underneath chilly studio lighting” |
![]() |
These phrases perform like calling on a style; they suggest selections round supplies, lighting, format, and colour tonality.
Use reference photos to information fashion
One other helpful approach is utilizing a reference picture to information the pose, colour, or composition of the output. To be used instances like matching a pose from a lookbook picture, transferring a colour palette from a marketing campaign nonetheless, or copying shadowplay from a photograph shoot, you’ll be able to extract and apply construction or fashion from reference photos.
Stability AI Picture Companies help a wide range of image-to-image workflows the place you need to use a reference picture (management picture) to information the output, akin to Construction, Sketch, and Type. Instruments like ControlNet (a neural community structure developed by Stability AI that enhances management), IP-Adapter (a picture immediate adapter), or clip-based captioning additionally allow additional management when paired with Stability AI fashions.
We are going to talk about ControlNet, IP-Adapter, and clip-based captioning in a subsequent put up.
The next is an instance of an image-to-image workflow:
- Discover a high-quality editorial reference.
- Use it with a depth, canny, or seg ControlNet to lock a pose.
- Type with a immediate.
Immediate “style editorial of a mannequin in layered knitwear, dramatic coloured lighting, sturdy shadows, excessive ISO texture” |
![]() |
Create the best temper with lighting management
In a immediate, lighting units tone, provides dimensionality, and mimics the language of images. It shouldn’t simply be “vivid vs. darkish.” Lighting is usually the fashion itself, particularly for audiences like Gen Z, as an example TikTok, early-aughts flash, harsh backlight, and colour gels. The next desk supplies some helpful lighting fashion immediate phrases.
Lighting fashion | Immediate phrases | Instance use case |
Excessive-contrast studio | laborious directional mild, deep shadows, managed highlights | Magnificence, tech, style with punchy visuals |
Gentle editorial | subtle mild, mushy shadows, ambient glow, overcast | Skincare, style, wellness |
Coloured gel lighting | blue and pink gel lighting, dramatic colour shadows, rim lighting | Nightlife, music-adjacent style, youth-forward styling |
Pure bounce | golden hour, mushy pure mild, solar flare, heat tones | Outdoor, way of life, brand-friendly minimalism |
Construct intent with posing and framing phrases
Good posing helps merchandise really feel aspirational and digital fashions extra dynamic. With AI, you should be intentional. Framing and pose cues assist keep away from stiffness, anatomical errors, and randomness. The next desk supplies some helpful posing and framing immediate phrases.
Immediate cue | Description | Tip |
trying off digital camera | Creates candid or editorial vitality | Helpful for lookbooks or advert pages |
fingers in movement | Provides realism and fluidity | Avoids awkward, static physique posture |
seated with physique turned | Provides depth and twist to the torso | Reduces symmetry, feels pure |
shot from low angle | Energy or standing cue | Works effectively for stylized streetwear or product hero pictures |
Instance: Placing all of it collectively
The next instance places collectively what we’ve mentioned on this put up.
Immediate “studio portrait of a mannequin with platinum hair in metallic cargo pants and a cropped mesh hoodie, seated with legs large on (acrylic stairs:1.6), magenta and teal gel lighting from left and behind, dramatic distinction, shot on 50mm, streetwear editorial for Gen Z marketing campaign” Detrimental immediate “blurry, additional limbs, watermark, cartoon, distorted face lacking fingers, unhealthy anatomy” |
![]() |
Let’s break down the previous immediate. We direct the look of the topic (platinum hair, metallic garments), specify their pose (seated wide-legged, assured, unposed), outline the setting (acrylic stairs and studio setup, managed, fashionable), state the lighting (combined gel sources, daring stylization), designate the lens (50mm, portrait realism), and lastly element the aim (for Gen Z marketing campaign, units visible and cultural tone). Collectively, the immediate produces the specified consequence.
Greatest practices and troubleshooting
Prompting isn’t a one-and-done process, particularly for artistic use instances. Most nice photos come from refining an concept over a number of makes an attempt. Contemplate the next methodology to iterate over your prompts:
- Maintain a immediate log
- Change one variable at a time
- Save seeds and base photos
- Use comparability grids
Generally issues go unsuitable—perhaps the mannequin ignores your immediate, or the picture appears to be like messy. These points are widespread and sometimes fast to repair, and you will get sharper, cleaner, and extra intentional outputs with each adjustment. The next desk supplies helpful ideas for troubleshooting your prompts.
Downside | Reason for concern | How you can repair it |
Type feels random | Mannequin is confused or phrases are obscure | Make clear fashion, add weight, take away conflicts |
Face will get warped | Over-styled or lacks facial cues | Add portrait of, headshot, or modify pose or lighting |
Picture is simply too darkish | Lighting not outlined | Add softbox from left, pure mild, or time of day |
Repetitive poses | Identical seed or static construction | Swap seed or change digital camera angle or topic motion |
Lacks realism or feels “AI-ish” | Improper tone or artifacts | Add negatives like cartoon, digital texture, distorted |
Conclusion
Mastering superior prompting methods can flip fundamental picture technology into skilled artistic outputs. Stability AI Picture Companies in Amazon Bedrock present exact management over visible creation and enhancing, serving to companies convert ideas into production-ready belongings. The mix of technical experience and artistic intent may help creators obtain the precision and consistency required in skilled settings. This management proves useful throughout a number of functions, akin to advertising and marketing campaigns, model consistency, and product visualizations. This put up demonstrated the way to optimize Stability AI Picture Companies in Amazon Bedrock to provide high-quality imagery that aligns along with your artistic objectives.
To implement these methods, entry Stability AI Picture Companies via Amazon Bedrock or discover Stability AI’s basis fashions obtainable in Amazon SageMaker JumpStart. It’s also possible to discover sensible code examples in our GitHub repository.
In regards to the authors
Maxfield Hulker is the VP of Group and Enterprise Growth at Stability AI. He’s a longtime chief within the generative AI area. He has helped construct creator-focused platforms like Civitai and Dream Studio. Maxfield frequently publishes guides and tutorials to make superior AI methods extra accessible.
Suleman Patel is a Senior Options Architect at Amazon Net Companies (AWS), with a particular give attention to machine studying and modernization. Leveraging his experience in each enterprise and know-how, Suleman helps prospects design and construct options that deal with real-world enterprise issues. When he’s not immersed in his work, Suleman loves exploring the outside, taking highway journeys, and cooking up scrumptious dishes within the kitchen.
Isha Dua is a Senior Options Architect based mostly within the San Francisco Bay Space working with generative AI mannequin suppliers and serving to buyer optimize their generative AI workloads on AWS. She helps enterprise prospects develop by understanding their objectives and challenges, and guides them on how they will architect their functions in a cloud-based method whereas supporting resilience and scalability. She’s keen about machine studying applied sciences and environmental sustainability.
Fabio Branco is a Senior Buyer Options Supervisor at Amazon Net Companies (AWS) and a strategic advisor, serving to prospects obtain enterprise transformation, drive innovation via generative AI and knowledge options, and efficiently navigate their cloud journeys. Previous to AWS, he held Product Administration, Engineering, Consulting, and Know-how Supply roles throughout a number of Fortune 500 corporations in industries, together with retail and shopper items, oil and fuel, monetary providers, insurance coverage, and aerospace and protection.