Monitoring AWS Lambdas with Metricly

In a previous article, I discussed the key metrics and best practices that should be employed when monitoring AWS Lambdas. In this article, we’ll walk step-by- step through the process of configuring a Metricly monitoring solution for AWS Lambdas. We’ll cover the connection between AWS and Metricly, review the Lambda dashboard, and the configuration of policies to alert you so you know when the Lambda isn’t performing as expected.

Before You Start

If you don’t have a Metricly account already, you’ll want to set up an account before proceeding through this article. Metricly offers a 21-day free trial which you can sign up for from this link. As part of the signup process, you will have the opportunity to watch a video containing an overview of Netuitive from the co-founder, Bob Farzami. I would highly recommend watching this to get a broad understanding of the environment and capabilities of the product. You can also watch the Metricly demos here.

You will also want to have an AWS environment configured which includes the Lambda functions that you would like to monitor. AWS currently offers new users a year of access to the AWS free tier, which includes the ability to create and execute Lambda functions. Configuration of this environment and the creation of Lambda functions is well beyond the scope of this post, but excellent documentation is available from AWS which will assist with both.

Amazon Web Services Free Tier

AWS Lambda Getting Started Guide

Integration Between Metricly and Your AWS Account

The first thing we need to do is create a connection, or integration between Metricly and our AWS account. Log in to your Metricly account and navigate to the Integrations home page. You’ll have a wide range of integrations to choose from, and you’ll want to click the first option, Amazon Web Services.

Determine and enter a name for your integration, ensure that the Data Collection option is checked, and then decide on the method of authentication to AWS. There are two ways to connect the accounts. The first is by creating an IAM role in your AWS account, and the second is to create a new user with read-only access in your AWS account. I would recommend the IAM role approach, although both are clearly explained in the detailed instructions provided by Metricly.

IAM Role-Based Integration(Recommended)

AWS User-Based Integration

If you’re using the recommended IAM role approach, you should have an ARN to paste into the appropriate field on the page. The final step is to ensure that the Lambda type is included in the Integration. It is not checked by default, so you’ll want to scroll down the list of types and ensure that it is checked. With that complete, your screen should look similar to the image below, and you can click on Save.

AWS Lambda Summary Dashboard

Metricly automatically generates a simple Lambda dashboard for you named “AWS Lambda Summary.” This dashboard includes widgets which show:

The top five functions based on count of invocations during the time period.
The top five functions with the highest latency, or duration.
Events, showing events related to policies, which we’ll discuss shortly.

Your dashboard may look similar to the one shown below. I have two Lambdas currently active in my account. From what I have observed, metrics are gathered from AWS every five minutes, and may take an additional minute or two for processing. (Also note that the graphs may not populate until two data points are available.)

In addition to the graph view, each of the widgets also has a “Table” view which will list the specific lambdas being shown with the corresponding values.

Element Detail Dashboard

Metricly also provides an Element Detail dashboard for each Lambda function which can be accessed through the Inventory option on the navigation panel at the top of the page.

By selecting a specific element from the list of elements, you will be taken to a detailed view of the Lambda function which shows both the current state of the function in terms of Invocations and Latency, and the state of each of those metrics over time.

The dashboard above shows the metrics for a simple Lambda, which I created to accept a request, wait for a random time period between 0 and 300ms, and return a true response. I call the Lambda by executing the aws invoke-async command on my local workstation. Two key metrics are shown in each graphic. The light grey/blue color indicates the average for the metric being displayed, but the darker and more distinct line indicates the current state of the Lambda, or at least the current state over the preceding five minutes.

How to Know When Things Go Awry

Dashboards are great to look at, but not if you have to watch them continuously, looking for anomalies. We need a way to automate the process of monitoring AWS Lambdas so that we can focus on other tasks, but still know when something is happening which demands our attention. Metricly Policies fill that need, and the key policies have already been created for you.

While looking at the Detailed Element Report, you should notice a Policies link right above the graphics. You can also access these by clicking on the Policies option on the navigation panel at the top of the page.

Three policies are provided for you automatically by Metricly. The descriptions are taken from the Policy summary for each, and provided for convenience here.

Monitoring Policy	Description
AWS Lambda – Depressed Invocation Count	The number of calls to the function (invocations) have been lower than expected for at least the last 10 minutes.
AWS Lambda – Elevated Invocation Count	The number of calls to the function (invocations) have been greater than expected for at least the last 30 minutes.
AWS Lambda – Elevated Latency	The average duration per function call (latency) has been higher than expected for at least the past 30 minutes.

Each of the policies can be viewed in more detail by clicking on the policy name. Some of the aspects which can be configured include:

Scope, which can be used to filter which elements this policy will be applied to.
Conditions, which specify the duration of a condition that qualifies as an alertable event, as well as the conditions themselves.
Description
Notifications, which are not configured by default, but can be configured to produce notifications through:
- Email
- HipChat
- OpsGenie
- PagerDuty
- SNS
- Slack
- Webhook

Going Beyond Monitoring AWS Lambdas

Hopefully this article provides enough information to get started with monitoring AWS Lambdas using Metricly. What I’ve covered here barely scratches the surface of what Metricly offers in terms of monitoring and alerting. Aside from the integration and monitoring of Lambdas, EC2 instances, and other offerings of the AWS ecosystem, one thing which really impressed me with Metricly is the content and the quality of the context-specific help system. Clicking on the Help link in the top right corner of each page opens a new tab with definitions, screenshots, and helpful tips related to the topic you’re viewing.

Learn more

About Metricly

Metricly coaches users throughout their cloud journey to organize, plan, analyze, and optimize their public cloud resources.

Try Metricly Free

About the Author

Mike Mackrory

Mike Mackrory is a Global citizen who has settled down in the Pacific Northwest – for now. By day he works as a Senior Engineer on a Quality Engineering team and by night he writes, consults on several web based projects and runs a marginally successful eBay sticker business.