Select your cookie preferences

We use essential cookies and similar tools that are necessary to provide our site and services. We use performance cookies to collect anonymous statistics, so we can understand how customers use our site and make improvements. Essential cookies cannot be deactivated, but you can choose “Customize” or “Decline” to decline performance cookies.

If you agree, AWS and approved third parties will also use cookies to provide useful site features, remember your preferences, and display relevant content, including relevant advertising. To accept or decline all non-essential cookies, choose “Accept” or “Decline.” To make more detailed choices, choose “Customize.”

AWS Logo
Menu

Unlock Observability with AI: AWS CloudWatch's Powerful Tools Revealed

"Observability helps us understand our system's behavior - is it functioning correctly, is it fast enough? Are there any errors? It's about being able to ask new questions and investigate problems we didn't even anticipate having." - Helen Ashton

Steph Gooch
Amazon Employee
Published Mar 4, 2025
Last Modified Mar 5, 2025
In this episode of the "Keys to AWS Optimization" show, hosts Stephanie Gooch (Senior Optimization Solutions Architect Advocate) and expert guests Helen Ashton (Sr. Solutions Architect) and Kevin Lewin (Solutions Architect) dive deep into how AWS is leveraging AI and machine learning to enhance real-time monitoring and incident response through Amazon CloudWatch. They explore the power of anomaly detection on metrics and logs, as well as a new AI-driven investigative tool called Amazon Q Operational Investigations.
Key Takeaways:
1. Observability is crucial for understanding system behavior, reducing downtime, and improving customer experience.
2. AWS CloudWatch's anomaly detection capabilities use AI models to automatically identify patterns and anomalies in metrics and log data.
3. The blog solution demonstrated uses AWS Bedrock's generative AI to automatically summarize log data, providing valuable context for troubleshooting.
4. Amazon Q Operational Investigations leverages AI to rapidly identify the root cause of incidents, by analyzing metrics, logs, and traces across various AWS services.
5. Integrating observability tools with incident management platforms like Jira or ServiceNow can streamline the investigation and resolution process.
6. Customizing and training these AI models can further enhance their capabilities for specific use cases.
7. These observability solutions are designed to be cost-effective, with pricing based on actual usage.
Call to Action:
Don't miss out on the valuable insights from this episode! You can watch the on-demand recording on the AWS YouTube channel or the Twitch channel. And if you have any additional questions, feel free to reach out to the AWS cost optimization team at costoptimization@amazon.com.

Any opinions in this post are those of the individual author and may not reflect the opinions of AWS.

Comments