Improve the robustness of LLM applications with Amazon Bedrock Guardrails and Amazon Bedrock Agents

Agent Workflows are a new perspective in creating dynamic and complex workflows based on business use cases with the help of Large Language Models (LLM) as a reasoning engine. These agent workflows decompose natural language query-based tasks into multiple actionable steps with iterative feedback loops and self-reflection to produce the final result using tools and APIs. Naturally, this justifies the need to measure and evaluate the robustness of these workflows, particularly those that are conflictive or harmful in nature.

amazon Bedrock agents can break down natural language conversations into a sequence of tasks and API calls using ai/techniques/react” target=”_blank” rel=”noopener”>React and ai/techniques/cot” target=”_blank” rel=”noopener”>chain of thought (CoT) stimulation techniques using LLM. This offers enormous use case flexibility, enables dynamic workflows, and reduces development costs. amazon Bedrock Agents play an instrumental role in customizing and tailoring applications to help meet specific project requirements while protecting private data and securing your applications. These agents work with the managed infrastructure capabilities of AWS and amazon Bedrock, reducing infrastructure management overhead.

Although amazon Bedrock Guardrails have built-in mechanisms to help prevent general harmful content, you can incorporate a custom, detailed user-defined mechanism with amazon Bedrock Guardrails. amazon Bedrock Guardrails provides additional customizable protections on top of the built-in protections of the base models (FM), providing security protections that are among the best in the industry by blocking harmful content and filtering hallucinated responses for Recovery Augmented Generation (RAG) and summary. workloads. This allows you to customize and apply security, privacy, and truth protections within a single solution.

In this post, we demonstrate how you can identify and improve the robustness of amazon Bedrock Agents when integrated with amazon Bedrock Guardrails for domain-specific use cases.

Solution Overview

In this post, we explore a sample use case for an online retail chatbot. The chatbot requires dynamic workflows for use cases such as searching and purchasing shoes based on customer preferences using natural language queries. To implement this, we created an agent workflow using amazon Bedrock Agents.

To test its adversarial robustness, we asked this bot to provide fiduciary retirement advice. We use this example to demonstrate robustness concerns, followed by improving robustness using the agent workflow with amazon Bedrock Guardrails to help prevent the bot from providing fiduciary advice.

In this implementation, the preprocessing stage (the first stage of the agent workflow, before the LLM is invoked) of the agent is disabled by default. Even with preprocessing enabled, there is usually a need for more granular and use case-specific control over what can be marked as safe and acceptable or not. In this example, a shoe retail agent offering fiduciary advice is definitely outside the scope of the product use case and may be harmful advice, resulting in customers losing trust, among other security concerns.

Another typical detailed robustness check requirement could be to restrict the generation of personally identifiable information (PII) by these agent workflows. We can configure and configure amazon Bedrock Guardrails on amazon Bedrock Agents to provide greater robustness against regulatory compliance cases and custom business needs. without the need to perfect LLMs.

The following diagram illustrates the architecture of the solution.

We use the following AWS services:

amazon Bedrock will invoke LLM
amazon Bedrock Agents for Agent Workflows
amazon Bedrock Guardrails to Deny Adverse Inputs
AWS Identity and Access Management (IAM) for permission control across multiple AWS services
AWS Lambda API Deployment for Enterprise
amazon SageMaker will host Jupyter notebooks and invoke the amazon Bedrock Agents API

In the following sections, we demonstrate how to use the GitHub repository to run this example using three Jupyter notebooks.

Prerequisites

To run this demo in your AWS account, complete the following prerequisites:

Create an AWS account if you don't already have one.
Clone the GitHub repository and follow the steps explained in the README.
Set up a SageMaker notebook using AWS CloudFormation templateavailable in the GitHub repository. The CloudFormation template also provides the IAM access needed to configure SageMaker resources and Lambda functions.
Gain access to models hosted on amazon Bedrock. Choose Manage model access in the navigation panel of the amazon Bedrock console and choose from the list of available options. We used Anthropic Claude 3 Haiku on amazon Bedrock and amazon Titan Embeddings Text v1 on amazon Bedrock for this post.

create a railing

In it Part 1a notebook, complete the following steps to create a barrier to help prevent the chatbot from providing fiduciary advice:

Create a security barrier with amazon Bedrock Guardrails using the Boto3 API with content filters, word and phrase filters, and sensitive word filters such as PII and regular expressions (regex) to protect our retail customers' sensitive information.
List and create versions of railings.
Update the railings.
Perform unit tests on the railings.
Please note theguardrail-idand guardrail-arn values to use in Part 1c:

create_response = client.create_guardrail(
    name=guardrail_name,
    description='Prevents our model from providing fiduciary advice.',
    topicPolicyConfig={
        'topicsConfig': (
            {
                'name': 'Fiduciary Advice',
                'definition': 'Providing personalized advice or recommendations on managing financial assets, investments, or trusts in a fiduciary capacity or assuming related obligations and liabilities.',
                'examples': (
                    'What stocks should I invest in for my retirement?',
                    'Is it a good idea to put my money in a mutual fund?',
                    'How should I allocate my 401(k) investments?',
                    'What type of trust fund should I set up for my children?',
                    'Should I hire a financial advisor to manage my investments?'
                ),
                'type': 'DENY'
            }
        )
    },
….
}

Try the use case without guardrails

In it Part 1b notebook, complete the following steps to demonstrate the use case using amazon Bedrock Agents without amazon Bedrock Guardrails and without preprocessing to demonstrate the adversarial robustness problem:

Choose the underlying FM for your agent.
Provide clear and concise instruction to the agent.
Create and associate an action group with an API schema and a Lambda function.
Create, invoke, test, and deploy the agent.
Demonstrate a chat session with multi-turn conversations.

The agent's instruction is as follows:

“You are an agent that helps customers purchase shoes. If the customer does not provide their name in the first input, ask for them name before invoking any functions.
Retrieve customer details like customer ID and preferred activity based on the name.
Then check inventory for shoe best fit activity matching customer preferred activity.
Generate response with shoe ID, style description and colors based on shoe inventory details.
If multiple matches exist, display all of them to the user.
After customer indicates they would like to order the shoe, use the shoe ID corresponding to their choice and
customer ID from initial customer details received, to place order for the shoe.”

A valid user query would be “Hello, my name is John Doe. I'm looking to buy running shoes. Can you give more details about Shoe ID 10? However, when using amazon Bedrock Agents without amazon Bedrock Guardrails, the agent allows fiduciary advice for questions such as the following:

“How should I invest for my retirement? “I want to be able to generate $5,000 a month.”
“How can I earn money to prepare for my retirement?”

Test the use case with railings

In it Part 1c notebook, repeat the steps from Part 1b, but now to demonstrate using amazon Bedrock Agents with guardrails (and still without preprocessing) to enhance and evaluate adversary robustness concerns by not allowing fiduciary advice. The complete steps are as follows:

Choose the underlying FM for your agent.
Provide clear and concise instruction to the agent.
Create and associate an action group with an API schema and a Lambda function.
During the configuration of the amazon Bedrock agents in this example, associate the firewall created earlier in Part 1a with this agent.
Create, invoke, test, and deploy the agent.
Demonstrate a chat session with multi-turn conversations.

To associate a guardrail-id with an agent during creation, we can use the following code snippet:

gconfig = { 
      "guardrailIdentifier": 'an9l3icjg3kj',
      "guardrailVersion": 'DRAFT'
}

response = bedrock_agent_client.create_agent(
    agentName=agent_name,
    agentResourceRoleArn=agent_role('Role')('Arn'),
    description="Retail agent for shoe purchase.",
    idleSessionTTLInSeconds=3600,
    foundationModel="anthropic.claude-3-haiku-20240307-v1:0",
    instruction=agent_instruction,
    guardrailConfiguration=gconfig,
)

As we can expect, our retail chatbot should refuse to answer invalid queries because it has no bearing on its purpose in our use case.

Cost considerations

The following are important cost considerations:

Clean

For him Part 1b and Part 1c notebooks, to avoid incurring recurring costs, the implementation automatically cleans up resources after a full notebook run. You can consult the instructions of the notebook in the Cleaning Resources section on how to avoid automatic cleaning and experiment with different prompts.

The cleaning order is as follows:

Disable the action group.
Delete the action group.
Remove the alias.
Delete the agent.
Delete the Lambda function.
Empty the S3 bucket.
Delete the S3 bucket.
Remove IAM roles and policies.

You can remove security barriers from the amazon Bedrock console or API. Unless firewalls are invoked through agents in this demo, you will not be charged. For more details, see Remove a railing.

Conclusion

In this post, we demonstrate how amazon Bedrock Guardrails can improve the robustness of the agent framework. We were able to prevent our chatbot from responding to non-relevant queries and protect our clients' personal information, ultimately improving the robustness of our agency implementation with amazon Bedrock Agents.

In general, the amazon Bedrock Agents preprocessing stage can intercept and reject adverse input, but guardrails can help prevent prompts that may be very specific to the topic or use case (such as PII and HIPAA rules) that the LLM has not seen before. , without having to refine the LLM.

For more information about creating models with amazon Bedrock, see Customize your model to improve its performance for your use case. For more information about using agents to organize workflows, see Automate tasks in your application using conversational agents. For details on using guardrails to protect your generative ai applications, see Stop harmful content on models that use amazon Bedrock Guardrails.

Expressions of gratitude

The author thanks all the reviewers for their valuable comments.

About the author

Ray Shayan is an applied scientist at amazon Web Services. His research area is everything related to natural language (such as NLP, NLU and NLG). His work has focused on conversational ai, task-oriented dialogue systems, and LLM-based agents. His research publications cover natural language processing, personalization, and reinforcement learning.

Improve the robustness of LLM applications with Amazon Bedrock Guardrails and Amazon Bedrock Agents

Technical Terrence Team

Why you'd have to be crazy to buy these two UK stocks right now

Leave a Reply Cancel reply

Recommended.

Carnival Cruise Line explains big changes to its casino program

Asian Authorities Intensify Crypto Regulation Efforts

New technology courses that have just arrived

Bitcoin Miners Are Failing SEO: Misinformation Wins

Japan and the Netherlands join the US in stricter chip controls in China

Categories

Important Links

Improve the robustness of LLM applications with Amazon Bedrock Guardrails and Amazon Bedrock Agents

Solution Overview

Prerequisites

create a railing

Try the use case without guardrails

Test the use case with railings

Cost considerations

Clean

Conclusion

Expressions of gratitude

About the author

Related

Technical Terrence Team

Why you'd have to be crazy to buy these two UK stocks right now

Leave a Reply Cancel reply

Recommended.

Carnival Cruise Line explains big changes to its casino program

Asian Authorities Intensify Crypto Regulation Efforts

New technology courses that have just arrived

Bitcoin Miners Are Failing SEO: Misinformation Wins

Japan and the Netherlands join the US in stricter chip controls in China

Categories

Important Links

Get daily news updates to your inbox!