Use AWS Bedrock & AI Services (Claude, Nova, Polly, Transcribe) with Your Existing OpenAI Code

Published: 1 month ago (December 19, 2025 at 09:58 AM EST)

4 min read

Source: Dev.to

Cover image for Use AWS Bedrock & AI Services (Claude, Nova, Polly, Transcribe) with Your Existing OpenAI Code

TL;DR

Use your existing OpenAI‑compatible tools (OpenWebUI, n8n, Cline, Cursor) with AWS Bedrock’s 80+ models and AWS AI services (Polly, Transcribe). Just change the endpoint – no code changes needed. Deploy to your AWS account in 15 minutes with Terraform. Data stays in your AWS environment for HIPAA/GDPR compliance.

The Solution in 30 Seconds

from openai import OpenAI

# Just change this line ↓
client = OpenAI(base_url="https://your-aws-api/v1", api_key="your-key")

# Everything else stays the same ↓
response = client.chat.completions.create(
    model="anthropic.claude-sonnet-4-5-20250929-v1:0",  # Claude 4.5 on Bedrock
    messages=[{"role": "user", "content": "Hello from AWS!"}]
)

What this enables

Use OpenWebUI, n8n, Cline, Cursor with AWS Bedrock and AWS AI services
Access Claude 4.5, Nova, Qwen, DeepSeek, Mistral, Cohere, and 75+ more models
Text‑to‑speech (Polly), speech‑to‑text (Transcribe), image generation, embeddings
Keep all data in your AWS account (HIPAA, GDPR, FedRAMP ready)
Multi‑region model access from a single endpoint

The Problem We Solved

Most modern AI tools are built for OpenAI’s API, but AWS Bedrock uses a completely different SDK. This forces you to choose between using familiar tools or switching to AWS for compliance/cost reasons and rewriting everything.

We ran into this while working with a customer who needed:

Requirement	Why it mattered
Compliance	All data must stay in their AWS account (GDPR)
Choice	Multiple models (Claude, Nova, Qwen, Mistral, DeepSeek, Stability AI, Cohere)
Multi‑region access	Different Bedrock models are available in different AWS regions
Standard tooling	OpenWebUI, n8n, IDE coding agents (Cline, Cursor)
Full AWS AI stack	Bedrock models + Polly (TTS) + Transcribe (STT)

Our Approach: An OpenAI‑Compatible Translation Layer

stdapi.ai is an API gateway that sits in your AWS account and translates OpenAI API calls to AWS services.

Key Features

OpenAI API Compatibility

Your existing code works without modifications:

from openai import OpenAI

client = OpenAI(
    base_url="https://your-deployment.example.com/v1",
    api_key="your-api-key"
)

# Chat completions
response = client.chat.completions.create(
    model="amazon.nova-pro-v1:0",
    messages=[{"role": "user", "content": "Explain AWS Lambda"}]
)

# Text‑to‑speech with Polly
response = client.audio.speech.create(
    model="amazon.polly-neural",
    input="Welcome to the future of voice technology!"
)

# Speech‑to‑text with Transcribe
with open("meeting-recording.mp3", "rb") as audio_file:
    transcription = client.audio.transcriptions.create(
        model="amazon.transcribe",
        file=audio_file,
        response_format="json"
    )

# Image generation with Bedrock
response = client.images.generate(
    model="stability.stable-image-ultra-v1:0",
    prompt="A serene mountain landscape at sunset"
)

# Embeddings with Bedrock
response = client.embeddings.create(
    model="cohere.embed-v4",
    input="Semantic search transforms how we find information"
)

Multi‑Region Access

Access models across multiple AWS Bedrock regions from a single endpoint. Different models are available in different regions; you can configure which regions to include. Bedrock inference profiles handle automatic routing when needed.

Multi‑Modal Capabilities

Process text, images, videos, and documents together in a single request. Supports:

HTTP URLs
S3 URLs (direct access to your data)
Base64 data URIs

Ideal for vision tasks, document analysis, and Retrieval‑Augmented Generation (RAG) applications.

Deployment

Deploy to your AWS account in 5‑15 minutes using Terraform.

Sample configurations set up:

Application Load Balancer with ECS/Fargate
IAM roles with least‑privilege access
CloudWatch logging
S3 storage
Optional custom domain with ACM certificate

Get started: Sample Terraform configurations

Use Cases

Chat interfaces – OpenWebUI or LibreChat for private ChatGPT alternatives
Workflow automation – n8n connecting AI to 400+ services
Developer tools – IDE coding agents (Cline, Cursor, Continue)
Knowledge management – AI‑powered note‑taking and semantic search
Internal AI tools – Custom chatbots for Slack, Discord, Teams

Technical Details

How It Works

Converts OpenAI API format to Bedrock’s format
Maps model IDs to the appropriate AWS service and region
Provides unified access to models across configured AWS regions
Converts Bedrock responses back to OpenAI format
Supports streaming via Server‑Sent Events (SSE)

Security

All traffic stays within your AWS account
IAM role‑based access control
Optional API‑key authentication
VPC deployment supported
Integrates with AWS Bedrock guardrails

Performance

Low‑latency translation layer
Streaming response support
Scales with ECS/Fargate auto‑scaling

Open Source & Commercial Options

stdapi.ai is available as open source (AGPL‑3.0) for experimentation and as a commercial offering with additional support, SLA, and managed‑service options.

Feel free to reach out with questions or contributions!

# Internal Use

**For production deployments, we recommend the AWS Marketplace version**, which includes:

- Hardened container images  
- Regular security updates  
- Production support  
- 14‑day free trial  

The commercial license also removes AGPL obligations for proprietary applications.

Get Started

🏢 Recommended: AWS Marketplace (14‑day free trial, production‑ready)

🚀 Open source: GitHub (AGPL‑3.0 for experimentation)

📦 Sample deployments: Terraform configurations

📚 Documentation: Full guides and API reference

Conclusion

This translation‑layer approach solves the AWS Bedrock compatibility gap while keeping everything AWS‑native.

Key benefits

Zero code changes – Your OpenAI SDK code works as‑is
AWS‑native – Fully integrated with Bedrock, Polly, Transcribe
Multi‑region – Access models across AWS regions from one endpoint
Production‑ready – Terraform deployment, CloudWatch integration

We’d love to hear your feedback! Have you solved this problem differently? What challenges have you faced integrating Bedrock into your workflow?

What’s Next

We’re planning detailed tutorials on:

Deploying OpenWebUI on AWS – Private ChatGPT alternative with Bedrock
Building n8n workflows – AWS AI services in automation pipelines
IDE coding agents setup – Configuring Cline and Cursor with Bedrock
RAG applications – Document search with Bedrock embeddings

Want a specific tutorial? Drop a comment or open an issue on GitHub!