A single API for all your conversational generative AI applications
Use the Converse API to create conversational generative AI applications with a single API across multiple models
Abhishek Gupta
Amazon Employee
Published Jun 10, 2024
You can now use the Converse API in Amazon Bedrock to create conversational applications like chatbots and support assistants. It is a consistent, unified API that works with all Amazon Bedrock models that support messages. The benefit is that you have a single code-base (application) and use it with different models – this makes it preferable to use the
Converse
API over InvokeModel (or InvokeModelWithResponseStream) APIs.I will walk you through how to use this API with the AWS SDK for Go v2.
Here is a super-high level overview of the API - you will see these in action when we go through some of the examples.
- The API consists of two operations -
Converse
andConverseStream
- The conversations are in the form of a
Message
object, which are encapsulated in aContentBlock
. - A
ContentBlock
can also have images, which are represented by anImageBlock
. - A message can have one of two roles -
user
orassistant
- For streaming response, use the
ConverseStream
API - The streaming output (
ConverseStreamOutput
) has multiple events, each of which has different response items such as the text output, metadata etc.
Let's explore a few sample apps now.
Refer to Before You Begin section in this blog post to complete the prerequisites for running the examples. This includes installing Go, configuring Amazon Bedrock access and providing necessary IAM permissions.
Let's start off with a simple example. You can refer to the complete code here.
To run the example:
The response may be different in your case:
The crux of the app is a
for
loop in which:- Sent using the
Converse
API - The response is collected and added to existing list of messages
- The conversation continues, until the app is exited
I used the Claude Sonnet model in the example. Refer to Supported models and model features for a complete list.
You can also use the
Converse
API to build multi-modal application that work images - note that they only return text, for now.You can refer to the complete code here.
To run the example:
I used the following picture of pizza and asked "what's in the image?":
Here is the output:
The is a simple single-turn exchange, but feel free to continue using a combination of images and text to continue the conversation.
The conversation for loop is similar to the previous example, but it has an added benefit of using the image data type with the help of types.ImageBlock:
Note:
imageContents
is nothing but a []byte
representation of the image.Streaming provide a better user experience because the client application does not need to wait for the complete response to be generated for it start showing up in the conversation.
You can refer to the complete code here.
To run the example:
Streaming based implementations can be a bit complicated. But in this case, it was simplified due to the clear API abstractions that the Converse API provided, including partial response types such as types.ContentBlockDeltaMemberText.
The application invokes ConverseStream API and then processes the output components in bedrockruntime.ConverseStreamOutput.
There are a few other awesome things the
Converse
API does to make your life easier.- It allows you to pass inference parameters specific to a model.
- You can also use the
Converse
API to implement tool use in your applications. - If you are using Mistral AI or Llama 2 Chat models, the
Converse
API will embed your input in a model-specific prompt template that enables conversations - one less thing to worry about!
Like I always say, Python does not have to be the only way to build generative AI powered machine learning applications. As an AI engineer, choose the right tools (including foundation models) and programming languages for your solutions. I maybe biased towards Go but this applies equally well to Java, JS/TS, C# etc.
Happy building!
Any opinions in this post are those of the individual author and may not reflect the opinions of AWS.