Sample Apps for AWS Generative AI CDK Constructs

February 12, 2026 ยท View on GitHub

This repo provides samples to demonstrate how to build your own Generative AI solutions using AWS Generative AI CDK Constructs.

Getting started

Use CaseDescriptionTypeLanguage
SageMaker JumpStart modelThis sample provides a sample application which deploys a SageMaker real-time endpoint hosting a Llama 2 foundation model developed by Meta from Amazon JumpStart, and an AWS Lambda function to run inference requests against that endpoint.BackendTypeScript
SageMaker Hugging Face modelThis sample provides a sample application which deploys a SageMaker real-time endpoint hosting a model (Mistral 7B) from Hugging Face, and an AWS Lambda function to run inference requests against that endpoint.BackendTypeScript
SageMaker Hugging Face model on AWS Inferentia2This sample provides a sample application which deploys a SageMaker real-time endpoint hosting a model (Zephyr 7B) from Hugging Face, and an AWS Lambda function to run inference requests against that endpoint. This sample uses Inferentia 2 as the hardware accelerator.BackendTypeScript
SageMaker custom endpointThis sample provides a sample application which deploys a SageMaker real-time endpoint hosting a model with artifacts stored in an Amazon Simple Storage Service (S3) bucket, and an AWS Lambda function to run inference requests against that endpoint. This sample uses Inferentia2 as the hardware accelerator.BackendTypeScript
SageMaker multimodal custom endpointThis sample provides a sample application which deploys a SageMaker real-time endpoint hosting llava-1.5-7b, with artifacts stored in an Amazon Simple Storage Service (S3) bucket, a custom inference script, and an AWS Lambda function to run inference requests against that endpoint.BackendTypeScript
SageMaker image to video endpointThis sample provides a sample application which deploys a SageMaker async endpoint hosting stable-video-diffusion-img2vid-xt-1-1, with artifacts stored in an Amazon Simple Storage Service (S3) bucket, a custom inference script, and an AWS Lambda function to run inference requests against that endpoint.BackendTypeScript
LLM on SageMaker in GovCloud PDTThis sample provides a sample application which deploys a SageMaker real-time endpoint hosting Falcon-40b on GovCloud PDT.BackendTypeScript
Amazon Bedrock AgentsThis sample provides a sample application which deploys an Amazon Bedrock Agent and Knowledge Base backed by an OpenSearch Serverless Collection and documents in S3. It demonstrates how to use the Amazon Bedrock CDK construct.BackendTypeScript
Python SamplesThis project showcases the utilization of the 'generative-ai-cdk-constructs' package from the Python Package Index (PyPI).BackendPython
.NET SamplesThis project showcases the utilization of the 'Cdklabs.GenerativeAiCdkConstructs' package from nuget library.Backend.NET
Contract Compliance AnalysisThis prototype can potentially help you make your contract compliance process more efficient, leveraging Generative Artificial Intelligence (GenAI) to evaluate contract clauses against predefined guidelines and to provide feedback on adherence to standards applicable to you, based on your instructions. This is achieved through workflows that leverages Large Language Models via Amazon Bedrock.Backend + FrontendPython for Backend, TypeScript (React) for Frontend
Multimodal RAG solutionThis project showcases Amazon Bedrock multimodal features using Amazon Bedrock Data Automation (BDA) combined with Amazon Bedrock Knowledge Bases for a RAG-based chatbot.Backend + FrontendPython for Backend, TypeScript (React) for Frontend
RFP Answers with GenAIThis project automates RFP (Request for Proposal) responses by using Amazon Bedrock's LLMs to generate answers based on your company's existing knowledge base. Key features include direct Excel file processing for questions and answers, Bedrock AgentCore Runtime with agents and tools for sophisticated question answering, and S3 Vectors for simplified vector storage.Backend + FrontendPython for Backend, TypeScript (React) for Frontend
Code ExpertThis project addresses the scalability limitations of manual code reviews by leveraging artificial intelligence to perform expert code reviews automatically. It leverages the Bedrock Batch Step Functions CDK construct.BackendPython for Backend and Demo, TypeScript for CDK
Bedrock Agent UI WrapperThis sample provides a CDK construct that creates an API layer and frontend application for Amazon Bedrock Agents. It includes authentication with Amazon Cognito, agent trace streaming, and can be deployed locally or on ECS Fargate.API layer + FrontendPython
Stateless MCP Server on AWS LambdaSample MCP Server running natively on AWS Lambda and API Gateway without any extra bridging components or custom transports and a test MCP client.API layerTypeScript
Stateless MCP Server on ECSSample stateless MCP Server running natively on ECS Fargate and ALB without any extra bridging components or custom transports and a test MCP client.API layerTypeScript
Stateful MCP Server on ECSSample stateful MCP Server running natively on ECS Fargate and ALB without any extra bridging components or custom transports and a test MCP client.API layerTypeScript
Speech to speechReal-time Speech to Speech solution with Amazon Nova Sonic, featuring a Java WebSocket server and React frontend.Backend + FrontendPython for Backend, TypeScript (React) for Frontend
Strands Lambda DurableThis folder contains demos showcasing the Strands Agents Python SDK integrated with AWS Lambda durable execution.BackendPython for Lambda, Ts for IaC

Contributing

Please refer to the CONTRIBUTING document for further details on contributing to this repository.