AWS open source newsletter, #203
A round up of the latest open source news, projects, and events that every open source developer should know about.
aws-config-rule-rds-logging-enabled-remediation
improve-employee-productivity-using-genai
Demos, Samples, Solutions and Workshops
build-an-agentic-llm-assistant
deploy-langfuse-on-ecs-with-fargate
sample-code-for-a-secure-vault-using-aws-nitro-enclaves
streamlit-bedrock-claude-sample
What I am reading from around the Community
Getting Started with OpenTofu in AWS
An Intro to AWS Deployments with OpenTofu, Scalr, & GitHub!
Cedar Policies: Default Deny and Avoiding Evaluation Errors with Validation
- The AWS SDK for Java 1.x is in maintenance mode, effective July 31, 2024 essential you read this if you are using the v1.x version of the AWS SDK for Java to find out what your options are, including test out the...
- Preview Release of the Migration Tool for the AWS SDK for Java 2.x looks at how this tool might be able to help in migrating your 1.x application to 2.x. [hands on]
- Amazon’s Exabyte-Scale Migration from Apache Spark to Ray on Amazon EC2 provides a fantastic look into what it takes to migrate from Apache Spark over to Ray at scale, and how this helped reduce both data processing time and cost - essential reading
- Automate monitoring for your Amazon EKS cluster using CloudWatch Container Insights presents an event-driven, automation solution for monitoring your Amazon EKS cluster using Amazon CloudWatch Container Insights metrics as well as bunch of other AWS services and resources [hands on]
- Diving into OCI Image and Distribution 1.1 Support in Amazon ECR looks into how Amazon ECR is one of the first registry cloud services to support the Referrers API and OCI 1.1 (read the post to find out what that means!)
- Cordial’s journey implementing Bottlerocket and Karpenter in Amazon EKS is a case study in how Cordial implemented Bottlerocket as the Operating System (OS) and Karpenter as the Node Lifecycle Manager within their Amazon EKS environments to achieve operational efficiency and improve security posture (graphs ahoy!)
- How Getir optimized their Amazon EKS compute using Karpenter provides another case study, this time Getir, and how Karpenter helped them address the challenges of spiky workloads, which require faster responses for dynamic scaling
- Developing Custom Processors using OpenTelemetry in .NET 8 explores how to develop a custom processor using OpenTelemetry in the .NET ecosystem [hands on]
- Making sense of secrets management on Amazon EKS for regulated institutions provides a summary of three options for managing secrets in Amazon EKS - ESO, Sealed Secrets, and AWS Secrets and Configuration Provider (ASCP), and the key considerations for FSI customers when choosing between them
- How to Run WebAssembly on Amazon EKS is a step by step guid on how AWS empowers users to harness the full potential of Wasm by providing a seamless integration with Amazon EKS [hands on]
- Build a real-time analytics solution with Apache Pinot on AWSshows how to deploy a scalable Apache Pinot-based near real-time user facing analytics solution on AWS [hands on]
- Stream change data in a multicloud environment using AWS DMS, Amazon MSK, and Amazon Managed Service for Apache Flink dives deep into architecture, approaches, and considerations for streaming data changes from the transactional databases deployed in other cloud providers to a streaming data solution deployed on AWS [hands on]
- Improve the resilience of Amazon Managed Service for Apache Flink application with system-rollback feature looks at how to use the system-rollback feature in Managed Service for Apache Flink, and how this functionality improves your application’s resilience by providing a highly available Flink application [hands on]
- Build and deploy knowledge graphs faster with RDF and openCypher shows you how to use openCypher with RDF graphs [hands on]
- Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch discusses how the metadata layer of Apache Iceberg can be used to make data lakes more efficient. You will learn about an open-source solution that can collect important metrics from the Iceberg metadata layer [hands on]
- Integrate Amazon MWAA with Microsoft Entra ID using SAML authentication illustrates how to configure an Amazon MWAA environment deployed in private network access mode with customer managed VPC endpoints and authenticate users using SAML federated identity using Microsoft Entra ID and Application Load Balancer (ALB) [hands on]
- Optimize cost and performance for Amazon MWAA provides guidance on how you can optimise performance and save cost of your Apache Airflow environments by following some good practices [hands on]
- How Amazon GTTS runs large-scale ETL jobs on AWS using Amazon MWAA is a great case study on how Amazon GTTS were able to see 70% improvement in end-to-end runtime on their ETL jobs, among other benefits - a great read
- Synchronize data lakes with CDC-based UPSERT using open table format, AWS Glue, and Amazon MSK illustrates the construction of a comprehensive CDC system, enabling the processing of CDC data sourced from an Amazon RDS for MySQL source [hands on]
- Improve Apache Kafka scalability and resiliency using Amazon MSK tiered storage dives deep into how tiered storage helps with faster broker recovery and quicker partition migrations, facilitating faster load balancing and broker scaling in your Apache Kafka clusters (plenty of graphs in this one!)
- AWS Glue mutual TLS authentication for Amazon MSKdescribes how to set up AWS Glue jobs to produce, consume, and process messages on an MSK cluster using mutual TLS authentication [hands on]
- Migrate an on-premises MySQL database to Amazon Aurora MySQL over a private network using AWS DMS homogeneous data migration and Network Load Balancerguides you through the steps of performing a homogeneous migration from an on-premises MySQL database to Amazon Aurora MySQL using AWS DMS homogeneous data migrations over a private network using network load balancer [hands on]
- Implement a rollback strategy after an Amazon Aurora MySQL blue/green deployment switchover looks at the steps needed to perform a blue/green deployment switchover and how to set up and perform a rollback strategy post switchover for Amazon Aurora MySQL-Compatible Edition [hands on]
- Build a serverless data quality pipeline using Deequ on AWS Lambda shows how to run Deequ (an open source framework for data quality) on Lambda [hands on]
- Build a custom HTTP client in Amazon Aurora PostgreSQL and Amazon RDS for PostgreSQL: An alternative to Oracle’s UTL_HTTP is a neat post that demonstrates how you can use PL/pgSQL custom wrapper functions to convert Oracle UTL_HTTP referenced custom code to the Amazon Aurora PostgreSQL and Amazon RDS for PostgreSQL equivalent [hands on]
- From Data Chaos to Cohesion: How OCSF is Optimizing Cyber Threat Detection is an in depth overview of the OCSF’s progress since its inception in 2022, and share insights from organisations participating in the project and adopting OCSF
- GraphStorm 0.3: Scalable, multi-task learning on graphs with user-friendly APIs provides an overview of what GraphStorm is and how it can help you tackle your large-scale graph ML challenges [hands on]
- Installing Red Hat OpenShift on AWS in a restricted network using AWS Secure Token Service is a detailed walk through on how to install a Red Hat OpenShift Container Platform cluster in a restricted Amazon VPC network to comply with regulatory requirements and organisational controls [hands on]
- How DXC creates application consistent EBS Snapshots for Linux looks at how DXC and AWS collaborated to implement a serverless solution to create application consistent backups for Linux instances [hands on]
JSON_TABLE
features that can convert JSON to a standard PostgreSQL table. The MERGE
command now supports the RETURNING
clause, letting you further work with modified rows. PostgreSQL 17 also includes general improvements to query performance and adds more flexibility to partition management with the ability to SPLIT/MERGE partitions.Any opinions in this post are those of the individual author and may not reflect the opinions of AWS.