Important
Enable the AWS CloudWatch Metric Streams integration to monitor all CloudWatch metrics from your AWS services, including custom namespaces. Individual integrations are no longer our recommended option.
Currently, AWS CloudWatch metrics support only AWS Glue Spark ETL jobs. They do not support AWS Glue Python Shell jobs. An active feature request has been submitted to the AWS Glue team to include this support.
New Relic offers an integration for reporting your AWS Glue data. This document explains how to activate this integration and describes the data that can be reported.
Activate integration
To enable this integration follow standard procedures to Connect AWS services to New Relic.
Configuration and polling
You can change the polling frequency and filter data using configuration options.
Default polling information for the AWS Glue integration:
- New Relic polling interval: 5 minutes
- Amazon CloudWatch data interval: 1 minute
Find and use data
To find your integration data, go to one.newrelic.com > All capabilities > Infrastructure > AWS and select an integration.
You can query and explore your data using the following event type:
| Entity | Event Type | Provider | 
|---|---|---|
| Job | 
 | 
 | 
For more on how to use your data, see Understand and use integration data.
Metric data
This integration collects AWS Glue data for Job.
Glue Job data
| Metric | Unit | Description | 
|---|---|---|
| 
 | Bytes | The number of bytes read from all data sources by all completed Spark tasks running in all executors. | 
| 
 | Milliseconds | The ETL elapsed time in milliseconds (does not include the job bootstrap times). | 
| 
 | Count | The number of completed stages in the job. | 
| 
 | Count | The number of completed tasks in the job. | 
| 
 | Count | The number of failed tasks. | 
| 
 | Count | The number of tasks killed. | 
| 
 | Count | The number of records read from all data sources by all completed Spark tasks running in all executors. | 
| 
 | Bytes | The number of bytes written by all executors to shuffle data between them since the previous report (aggregated by the AWS Glue Metrics Dashboard as the number of bytes written for this purpose during the previous minute). | 
| 
 | Bytes | The number of bytes read by all executors to shuffle data between them since the previous report (aggregated by the AWS Glue Metrics Dashboard as the number of bytes read for this purpose during the previous minute). | 
| 
 | Bytes | The number of megabytes of disk space used across all executors. | 
| 
 | Count | The number of actively running job executors. | 
| 
 | Count | The number of maximum (actively running and pending) job executors needed to satisfy the current load. | 
| 
 | Percent | The fraction of memory used by the JVM heap for this driver (scale: 0-1) for driver. | 
| 
 | Percent | The fraction of memory used by the JVM heap for this driver (scale: 0-1) for ALL executors. | 
| 
 | Bytes | The number of memory bytes used by the JVM heap for the driver. | 
| 
 | Bytes | The number of memory bytes used by the JVM heap for ALL executors. | 
| 
 | Bytes | The number of bytes read from Amazon S3 by the driver since the previous report (aggregated by the AWS Glue Metrics Dashboard as the number of bytes read during the previous minute). | 
| 
 | Bytes | The number of bytes read from Amazon S3 by ALL executors since the previous report (aggregated by the AWS Glue Metrics Dashboard as the number of bytes read during the previous minute). | 
| 
 | Bytes | The number of bytes written to Amazon S3 by the driver since the previous report (aggregated by the AWS Glue Metrics Dashboard as the number of bytes written during the previous minute). | 
| 
 | Bytes | The number of bytes written to Amazon S3 by ALL executors since the previous report (aggregated by the AWS Glue Metrics Dashboard as the number of bytes written during the previous minute). | 
| 
 | Percent | The fraction of CPU system load used (scale: 0-1) by the driver. | 
| 
 | Percent | The fraction of CPU system load used (scale: 0-1) by the ALL executors. |