Use AWS Generative AI CDK constructs to speed up app development
Assemble and deploy the infrastructure for a RAG solution using AWS CDK for Python
- Knowledge Bases for Amazon Bedrock: This is the foundation for the RAG solution.
- OpenSearch Serverless collection: It supports the vector search collection type that provides similarity search capability.
- An S3 bucket: This will act as the data source for the Knowledge Base.
- AWS Lambda function (written in Python) along with an API Gateway that uses the RetrieveAndGenerate API to query the knowledge base and generate responses from the information it retrieves.

- L1 constructs are the lowest-level construct and offer no abstraction. Each L1 construct maps directly to a single AWS CloudFormation resource.
- L2 constructs provide a higher-level abstraction along with helper methods for most resources that make it simpler to define resources.
- L3 constructs, also known as patterns, are the highest-level of abstraction, and are used to create entire AWS architectures for particular use cases in your application.
1
2
git clone https://github.com/abhirockzz/aws-cdk-generative-ai-rag
cd aws-cdk-generative-ai-rag
1
2
3
4
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
1
cdk deploy

/query
at the end of the API Gateway URL. I had uploaded the Amazon 2022 shareholder document and asked the following question: "What is Amazon doing in the field of generative AI?"1
2
3
export APIGW_URL=<enter URL>
curl -X POST -d 'What is Amazon doing in the field of generative AI?' $APIGW_URL/query
RetrieveAndGenerate
API:1
2
3
4
{
"question": "what is amazon doing in the field of generative AI?",
"response": "Amazon is investing heavily in Large Language Models (LLMs) and Generative AI. The company believes that Generative AI will transform and improve virtually every customer experience across its businesses. Amazon has been working on developing its own LLMs for a while now. Amazon Web Services (AWS) is democratizing Generative AI technology by offering price-performant machine learning chips like Trainium and Inferentia, so that companies of all sizes can afford to train and run their LLMs in production. AWS also enables companies to choose from various LLMs and build applications with AWS's security, privacy and other features. One example is AWS's CodeWhisperer, which uses Generative AI to generate code suggestions in real time, boosting developer productivity."
}
1
cdk destroy
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
kb = bedrock.KnowledgeBase(self, 'DocKnowledgeBase',
embeddings_model= bedrock.BedrockFoundationModel.TITAN_EMBED_TEXT_V1,
)
documentBucket = s3.Bucket(self, 'DocumentBucket')
deployment = s3deploy.BucketDeployment(self, "DeployDocuments",
sources=[s3deploy.Source.asset("docs")],
destination_bucket=documentBucket
)
bedrock.S3DataSource(self, 'KBS3DataSource',
bucket= deployment.deployed_bucket,
knowledge_base=kb,
data_source_name='documents',
chunking_strategy= bedrock.ChunkingStrategy.FIXED_SIZE,
max_tokens=500,
overlap_percentage=20
)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
kbQueryLambdaFunction = _lambda.Function(
self, 'KBQueryFunction',
runtime=_lambda.Runtime.PYTHON_3_12,
code=_lambda.Code.from_asset('lambda'),
handler='app.handler',
environment={
'KB_ID': kb.knowledge_base_id,
'KB_MODEL_ARN': 'arn:aws:bedrock:us-east-1::foundation-model/anthropic.claude-3-sonnet-20240229-v1:0',
},
timeout=Duration.seconds(15)
)
kbArn = f'arn:aws:bedrock:{Stack.of(self).region}:{Stack.of(self).account}:knowledge-base/{kb.knowledge_base_id}'
# Create an IAM policy statement
policy_statement = iam.PolicyStatement(
actions=[
"bedrock:Retrieve",
"bedrock:RetrieveAndGenerate",
"bedrock:InvokeModel"
],
resources=[
"arn:aws:bedrock:us-east-1::foundation-model/anthropic.claude-3-sonnet-20240229-v1:0",
kbArn
]
)
kbQueryLambdaFunction.add_to_role_policy(policy_statement)
query
endpoint:1
2
3
4
5
6
7
8
api = apigateway.LambdaRestApi(
self, 'KBQueryApiGW',
handler=kbQueryLambdaFunction,
proxy=False
)
kb_query = api.root.add_resource('query')
kb_query.add_method('POST')
Any opinions in this post are those of the individual author and may not reflect the opinions of AWS.