AWS's kitchen-sink chatbot: Bedrock, RAG, and 20 services in one CDK stack
An AWS-sanctioned blueprint that wires together every managed service you can name to get a multi-LLM chatbot running in your account.

What it does
This is a CDK-deployed reference architecture for an enterprise chatbot with RAG. It supports multiple LLM backends—Amazon Bedrock (Claude, Llama 2), SageMaker endpoints, and custom models—plus vector search via OpenSearch, document storage in S3, Cognito auth, Lambda compute, and a React frontend. The README also mentions a “Nexus Gateway” (or “GenAIEH Gateway”—the two readmes disagree) for additional model access, plus cost tracking and conversation memory.
The interesting bit
The value here is less any single clever component and more the wiring: someone at AWS already did the painful work of getting Cognito, API Gateway, Lambda, OpenSearch, and Bedrock to talk to each other in a permission-safe way. If you’ve ever hand-rolled IAM policies for a multi-service AI app, you know why this matters.
Key highlights
- Multi-LLM support: Bedrock, SageMaker, custom endpoints, and gateway integrations
- Full RAG pipeline with OpenSearch vector storage and S3 document ingestion
- Enterprise guardrails: fine-grained access control, audit logging, encryption
- Token usage tracking and cost management built in
- React UI plus API endpoints for headless integration
- Deployed via AWS CDK (requires aws-cdk-lib 2.206.0+) and SeedFarmer
Caveats
- The README is vague on specifics: no architecture diagram, no benchmark claims, no cost estimates, and the gateway integration name literally changes between two copies of the same file
- “Production-ready” is asserted but undefined—what scale, what latency, what testing?
- Heavy AWS lock-in: if you’re not already committed to the AWS ecosystem, this is a lot of managed service surface area to adopt
Verdict
Worth a look if you’re an AWS shop that needs to stand up a RAG chatbot fast and would rather not debug IAM between OpenSearch and Bedrock for three days. Skip it if you want multi-cloud portability, detailed operational docs, or a lightweight stack.