
Efficiently download LLM weights from HuggingFace to S3
Interactive CFN-wrapped utility on AWS console to easily and efficiently download multiple models from HuggingFace and store them on S3
that can be triggered from the AWS console whenever a new model needs to be downloaded.

- If needed before its instantiation, update the stack template with the default values for the parameters named LlmModels, PathToModels, BucketName with your own values. The list of models is a comma-separated string. Note: do not update the HF token in the CFN template, you’ll update the corresponding AWS SSM parameter after the CFN created it.
- Instantiate the CloudFormation stack via AWS Console, AWS CLI or any SDK. In standard situation, you’ll never have to update it again: everything happens via change of values for the parameters.
- If needed update the various parameters (s3 bucket, model list and path, access token) in SSM Parameter Store via the AWS Console interactive UI.
- Start the build project via the AWS CodeBuild console to download the models to your S3 bucket. The HF CLI will handle the download and save it.
- the IAM CodeBuild execution role limits the authorization of the Codebuild project. It can only execute CloudWatch logs, SSM and S3 commands to log activity, access SSM parameters and save models in the S3 bucket. If your environment requires it, you can further tighten credentials by restricting the resources to the specific log streams, parameters and bucket in use.
- On the cost efficiency side, CodeBuild pricing is charged with the pay-as-you-go model: "There are no upfront costs or minimum fees. You pay only for the resources you use. You are charged for compute resources based on the duration it takes your build to run. The rate depends on the selected compute type". So, the CodeBuild project in the CFN template won't cost you if you don't use it. Same thing for standard SSM parameters and IAM roles. For all those services, only API calls (from console or CLI) will be charged.
- The GitHub workflow lint_cfn.yaml attached to the repository checks the quality of the CFN template each time we commit a new version. It raises an error as soon as cfn-lint finds an error or a warning in template syntax. See Actions page for details.
Any opinions in this post are those of the individual author and may not reflect the opinions of AWS.