Deploy AI Agents to Production - Complete AgentCore Guide

Overview

Deploy the multimodal travel assistant agent to production using Amazon Bedrock AgentCore Runtime. This agent features persistent memory, multimodal content analysis (images, videos, documents), and personalized travel recommendations.

AgentCore Services

AgentCore Runtime ⭐ - Serverless execution with auto-scaling and session management
AgentCore Identity - Secure credential management for API keys and tokens
AgentCore Memory ⭐ - State persistence and conversation history
AgentCore Code Interpreter - Secure code execution sandbox
AgentCore Browser - Cloud browser automation
AgentCore Gateway - API management and tool discovery
AgentCore Observability - Monitoring, tracing, and debugging
AgentCore Policy - Deterministic control and security boundaries for agent-tool interactions
AgentCore Evaluations - Automated assessment and performance measurement for agents

Production Features

Persistent Memory: Cross-session memory using Bedrock AgentCore Memory
- Short-term Memory: Captures turn-by-turn interactions within a single session
- Long-term Memory: Automatically extracts and stores key insights across multiple sessions
Multimodal Analysis: Process images, videos, and documents with built-in tools
Travel Expertise: Personalized recommendations based on user preferences
Production Ready: Secure, scalable deployment on AWS infrastructure

Requirements & Setup

AWS Account with appropriate permissions
Python 3.10+ installed
AWS CLI configured (aws configure)
Model Access: Enable us.anthropic.claude-3-5-sonnet-20241022-v2:0 in Amazon Bedrock console
Basic understanding of AI agents and AWS services

Installation

# Navigate to deployment directory
cd deploy-to-production/deployment

# Create virtual environment (optional, can use parent directory)
python3 -m venv ../.venv
source ../.venv/bin/activate  # Windows: ..\.venv\Scripts\activate

# Install dependencies from requirements.txt
pip install -r requirements.txt

# Verify installation
agentcore --help

3-Step Deployment Process

Step 1: Configure

Navigate to deployment directory and configure the agent with memory enabled:

cd deployment
agentcore configure -e multimodal_agent.py
# Select 'yes' for memory
# Select 'yes' for long-term memory extraction

Or specify a different region:

agentcore configure -e multimodal_agent.py -r us-east-1

Custom Header Configuration

Select YES in Request Header Allow list, and in Request Header Allow paste X-Amzn-Bedrock-AgentCore-Runtime-Custom-Actor-Id

This header allows passing a user identifier from your application to the agent. The agent extracts it from context.request_headers (normalized to lowercase: x-amzn-bedrock-agentcore-runtime-custom-actor-id) and uses it to namespace memory per user.

At the end .bedrock_agentcore.yaml, must look like this:

request_header_configuration:
  requestHeaderAllowlist:
  - X-Amzn-Bedrock-AgentCore-Runtime-Custom-Actor-Id

This creates a .bedrock_agentcore.yaml configuration file.

Note: When you enable memory during configuration, the AgentCore CLI automatically creates the memory resource (if needed) and sets the BEDROCK_AGENTCORE_MEMORY_ID environment variable during deployment. Your agent code reads this variable automatically.

Step 2: Deploy

Deploy to Amazon Bedrock AgentCore Runtime:

agentcore launch

This command:

Builds your container using AWS CodeBuild (no Docker required)
Creates necessary AWS resources (ECR repository, IAM roles)
Deploys your agent to AgentCore Runtime
Configures CloudWatch logging
Sets up memory if enabled

Note the Agent ARN from the output - you'll need it to invoke the agent.

Step 3: Test Memory Functionality

For automated testing, use the provided test applications in sample-test/:

# Set your agent ARN (get from agentcore status or agentcore launch output)
export AGENT_ARN="your-agent-arn-from-step-2"

Test short-term memory (within session):

Captures turn-by-turn interactions within a single session. Agents maintain immediate context without requiring users to repeat information.

cd sample-test
python test_short_memory.py

This script tests:

Information storage within a session
Memory recall in the same session
Session-based context retention

Test long-term memory (across sessions):

Automatically extracts and stores key insights from conversations across multiple sessions, including user preferences, important facts, and session summaries.

cd sample-test
python test_long_memory.py

This script tests:

Information storage in one session
Memory extraction and persistence
Cross-session memory recall
User-specific memory isolation

Important: Long-term memory extraction is an asynchronous background process that can take a minute or more. The test waits 60 seconds between invocations for reliable memory retrieval.

Generate test content (optional):

If you need sample travel content for testing, use the travel content generator from the parent directory:

cd ..
07-travel-content-generator.ipynb
# Generate images, videos, and itineraries for any destination
# Generated content will be saved to data-sample/ directory
cd deploy-to-production/sample-test

Test multimodal capabilities:

python test_image.py path/to/image.jpg

# Run the video test with a sample video
python test_video.py path/to/video.mp4

This script tests:

Video analysis with the agent (visual only, no audio)
Memory of video content in follow-up questions
Multimodal payload format (text + video)
Maximum video size: ~20MB

Interactive Notebook

For an interactive experience, use the Jupyter notebook:

cd sample-test
#go to notebook test_agentcore_memory.ipynb

The notebook demonstrates:

✅ Cross-session memory persistence
✅ Multimodal content (images and videos)
✅ Memory survival across kernel restarts
✅ User isolation testing
✅ Pretty-printed conversations

If you want to start using the agent by creating your own code, keep the following points in mind:

Session IDs must be 33+ characters for proper session management
Use custom headers for user identification: X-Amzn-Bedrock-AgentCore-Runtime-Custom-Actor-Id
Same user ID enables cross-session memory
Different session IDs simulate different conversations
Headers are normalized to lowercase in the agent code

Locate AWS Resources After Deployment

After deployment, find resources in AWS Console:

Resource	Location
Agent Logs	CloudWatch → Log groups → `/aws/bedrock-agentcore/runtimes/{agent-id}-DEFAULT`
Container Images	ECR → Repositories → `bedrock-agentcore-multimodal_agent`
Build Logs	CodeBuild → Build history
IAM Role	IAM → Roles → Search for "BedrockAgentCore"
Memory Store	Bedrock Console → AgentCore → Memory

You can also check your agent status:

cd deployment
agentcore status

3 Deployment Options - Cloud, Local & Hybrid

Default: CodeBuild (Recommended)

No Docker required - builds in the cloud:

agentcore launch

Local Development

Build and run locally (requires Docker):

agentcore launch --local

Hybrid: Local Build + Cloud Runtime

Build locally, deploy to cloud (requires Docker):

agentcore launch --local-build

Agent Code Architecture & Memory Integration

The agent uses AgentCore Memory SDK for integration with Strands Agents.

Automatic Memory Setup

When you run agentcore configure and enable memory, the AgentCore CLI automatically creates the memory resource (if needed) and sets the BEDROCK_AGENTCORE_MEMORY_ID environment variable during agentcore launch. Your agent code reads this variable automatically - no manual configuration needed.

Memory automatically stores:

Travel preferences and interests
Dietary restrictions
Budget considerations
Past travel experiences
User facts and context

Basic Memory Setup

from bedrock_agentcore.memory import MemoryClient
from bedrock_agentcore.memory.integrations.strands.config import AgentCoreMemoryConfig, RetrievalConfig

# Create memory client
client = MemoryClient(region_name="us-west-2") #your region

# Create memory store
basic_memory = client.create_memory(
    name="BasicTestMemory",
    description="Basic memory for testing short-term functionality"
)

# Configure memory with retrieval settings
memory_config = AgentCoreMemoryConfig(
    memory_id=basic_memory.get('id'),
    session_id=session_id,
    actor_id=actor_id,
    retrieval_config={
        f"/users/{actor_id}/facts": RetrievalConfig(top_k=3, relevance_score=0.5),
        f"/users/{actor_id}/preferences": RetrievalConfig(top_k=3, relevance_score=0.5)
    }
)

Memory Integration with Strands Agents

_agent = Agent(
            model=BedrockModel(model_id="us.anthropic.claude-3-5-sonnet-20241022-v2:0"),
            tools=[image_reader, file_read,video_reader_local],
            system_prompt=system_prompt,
            session_manager=session_manager

The invoke function is the main entry point for your AgentCore agent:

Receives user prompts and context from AgentCore Runtime
Extracts session and actor IDs for memory management
Creates or retrieves the agent instance with memory configuration
Processes the user message and returns the response

@app.entrypoint
def invoke(payload, context):
    """AgentCore Runtime entry point with lazy-loaded agent"""
    # Extract user prompt
    prompt = payload.get("prompt", "Hello!")
    
    # Get session/actor info for memory
    actor_id = context.request_headers.get('X-Amzn-Bedrock-AgentCore-Runtime-Custom-Actor-Id', 'whatsapp-user')
    session_id = context.session_id or 'whatsapp-session'
    
    # Get agent with memory
    agent = get_or_create_agent(actor_id, session_id)
    
    # Handle multimodal input (images)
    if "media" in payload:
        media = payload["media"]
        if media.get("type") == "image":
            # Process image with agent tools
            image_data = base64.b64decode(media["data"])
            # ... image processing logic
    
    # Process and return response
    result = agent(prompt)
    return {"result": result.message}

Sending Images, Videos & Text - Payload Examples

Sending Images

To send images to the agent, use this payload structure:

import base64

# Read and encode image
with open("destination.jpg", "rb") as f:
    image_data = base64.b64encode(f.read()).decode('utf-8')

# Create payload
payload = {
    "prompt": "What can you tell me about this destination?",
    "media": {
        "type": "image",
        "format": "jpeg",  # or "png", "jpg", "gif", "webp"
        "data": image_data  # base64-encoded string
    }
}

How it works:

Client sends image as base64 in payload
Agent decodes and saves temporarily to /tmp/
Agent instructs itself to use image_reader tool with the temp file path
Tool reads the file and sends bytes directly to Claude model
Model analyzes the image and responds

Sending Videos

Videos follow the same pattern but are processed by the video_reader_local tool:

import base64

# Read and encode video
with open("travel_vlog.mp4", "rb") as f:
    video_data = base64.b64encode(f.read()).decode('utf-8')

# Create payload
payload = {
    "prompt": "Analyze this travel video and suggest similar destinations",
    "media": {
        "type": "video",
        "format": "mp4",  # or "mov", "avi", "mkv", "webm"
        "data": video_data  # base64-encoded string
    }
}

Video limitations:

Maximum size: ~20MB (for local processing)
Visual content only (no audio analysis)
Supported formats: mp4, mov, avi, mkv, webm

Text-Only Messages

For text-only messages, simply send the prompt:

payload = {
    "prompt": "I want to visit Japan. What should I know?"
}

Common Deployment Issues & Fixes

Permission denied errors

Verify AWS credentials: aws sts get-caller-identity
Check required policies are attached

Docker not found warnings

Ignore if using default CodeBuild deployment
Only needed for --local or --local-build modes

Model access denied

Enable Claude 3.5 Sonnet in Bedrock console
Verify correct AWS Region

Port 8080 in use (local only)

Find process: lsof -ti:8080
Stop process: kill -9 PID

For more troubleshooting, see AgentCore Runtime Troubleshooting.

Delete Resources & Clean Up

Delete all AWS resources created by the toolkit:

cd deployment
agentcore destroy

Note: This will delete the agent runtime but not the memory store. To delete the memory store, go to the Bedrock Console → AgentCore → Memory.

Additional Documentation & Resources

Documentation

Code Examples

AWS Labs AgentCore Samples

Related Notebooks

06-travel-assistant-demo.ipynb - Interactive demo with cross-session memory
07-travel-content-generator.ipynb - Generate test content for multimodal testing
05-s3-vector-memory.ipynb - Memory implementation with S3 Vectors
02-custom-tools.ipynb - Custom tool development

Ready to deploy? Follow the Quick Start guide above to get your multimodal travel agent running in minutes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deploy AI Agents to Production - Complete AgentCore Guide

Overview

AgentCore Services

Production Features

Requirements & Setup

Installation

3-Step Deployment Process

Step 1: Configure

Custom Header Configuration

Step 2: Deploy

Step 3: Test Memory Functionality

Locate AWS Resources After Deployment

3 Deployment Options - Cloud, Local & Hybrid

Default: CodeBuild (Recommended)

Local Development

Hybrid: Local Build + Cloud Runtime

Agent Code Architecture & Memory Integration

Automatic Memory Setup

Basic Memory Setup

Memory Integration with Strands Agents

Sending Images, Videos & Text - Payload Examples

Sending Images

Sending Videos

Text-Only Messages

Common Deployment Issues & Fixes

Delete Resources & Clean Up

Additional Documentation & Resources

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Deploy AI Agents to Production - Complete AgentCore Guide

Overview

AgentCore Services

Production Features

Requirements & Setup

Installation

3-Step Deployment Process

Step 1: Configure

Custom Header Configuration

Step 2: Deploy

Step 3: Test Memory Functionality

Locate AWS Resources After Deployment

3 Deployment Options - Cloud, Local & Hybrid

Default: CodeBuild (Recommended)

Local Development

Hybrid: Local Build + Cloud Runtime

Agent Code Architecture & Memory Integration

Automatic Memory Setup

Basic Memory Setup

Memory Integration with Strands Agents

Sending Images, Videos & Text - Payload Examples

Sending Images

Sending Videos

Text-Only Messages

Common Deployment Issues & Fixes

Delete Resources & Clean Up

Additional Documentation & Resources