GenAI on Outposts: Bringing AI to the Edge
Enable GenAI on AWS Outposts with EKS for seamless, low-latency AI at the edge, enabling data privacy and residency compliance.
- An AWS account with access to EKS and AWS Outposts.
- Appropriate IAM permissions to create EKS clusters, manage EC2 instances, and work with other AWS services.
- Availability of the g4dn.12xlarge GPU instance type in your AWS Outposts configuration.
Disclaimer: Before following the steps outlined below, please ensure they comply with your organization's security protocols and corporate policies.
- Installation of the latest version AWS Command Line Interface (AWS CLI) (v2 recommended), kubectl, and eksctl.
make sure to update region and OutpostARN before saving this config as cluster-config.yaml
- Add your self-managed node group specifically configured for your AWS Outpost:
- Install NVIDIA GPU operator:
- Create an IAM policy using the policy downloaded.
- Create an IAM role. Create a Kubernetes service account named aws-load-balancer-controller in the kube-system namespace for the AWS Load Balancer Controller and annotate the Kubernetes service account with the name of the IAM role.
- Install the AWS Load Balancer Controller, as the following:
- Verify that the controller is installed.
- Check Pod Status: Start by checking the status of the pods to ensure they are running correctly. You can use the following command to monitor the pods continuously:
- Check Service and Ingress: As the WebUI readiness depends on both the service and the ingress being properly set up, you should verify these resources are configured correctly:
- Monitor Ingress Readiness: Since ingress provisioning can take a few minutes, especially when integrating with AWS ALB, you can monitor the readiness by observing the annotations and events associated with the ingress:
- Open the ingress address in your browser and start the conversation with the model you choose:

Any opinions in this post are those of the individual author and may not reflect the opinions of AWS.