• /
  • ログイン

Cassandra monitoring integration

Our Cassandra integration sends performance metrics and inventory data from your Cassandra database to the New Relic platform. You can view pre-built dashboards of your Cassandra metric data, create alert policies, and create your own custom queries and charts.

Read on to install the integration, and to see what data we collect.

Compatibility and requirements

Our integration is compatible with Apache Cassandra version 3.x or higher.

Before installing the integration, make sure that you meet the following requirements:

Quick start: Use our guided install

Instrument your Cassandra database quickly and send your telemetry data with guided install. Our guided install creates a customized CLI command for your environment that downloads and installs the New Relic CLI and the infrastructure agent.

A screenshot of the guided install CLI.

Ready to get started? Click one of these button to try it out.

Guided install

Our guided install uses the infrastructure agent to set up the Cassandra integration. Not only that, it discovers other applications and log sources running in your environment and then recommends which ones you should instrument.

The guided install works with most setups. But if it doesn't suit your needs, you can find other methods below to get started monitoring your Cassandra database.

Install and activate

To install the Cassandra integration, follow the instructions for your environment:

Additional notes:

Configure the integration

An integration's YAML-format configuration is where you can place required login credentials and configure how data is collected. Which options you change depend on your setup and preference.

There are several ways to configure the integration, depending on how it was installed:

Commands

The configuration accepts the following commands:

  • metrics: Captures the metrics for a particular Cassandra node, including required login info.
  • inventory: Captures the configuration parameters set in the Cassandra config file as inventory data. To disable the collection of inventory data, delete the inventory parameter.
  • labels: The env label controls the environment attribute. The default value is production.

Arguments

The metrics command accept the following arguments:

  • hostname: The Cassandra node hostname.
  • port: The port where Cassandra is listening and exposing metrics and variables via JMX.
  • username: The username to connect to Cassandra via JMX.
  • password: The password to connect to Cassandra via JMX.
  • timeout: Request for timeout in milliseconds.
  • key_store: The filepath of the keystore containing the JMX client's SSL certificate.
  • key_store_password: The password for the JMX SSL key store.
  • trust_store: The filepath of the trust keystore containing the JMX server's SSL certificate.
  • trust_store_password: The password for the JMX trust store.

The inventory command accept the following arguments:

  • hostname: The Cassandra node hostname.
  • config_path: The path to the Cassandra config file.

Activate remote monitoring

The remote_monitoring parameter enables remote monitoring and multi-tenancy for this integration.

Activating remote_monitoring may change some attributes and/or affect your configured alerts. For more information, see remote monitoring in on-host integrations.

重要

Infrastructure agent version 1.2.25 or higher is required to use remote_monitoring.

Environment variable passthroughs

Environment variables can be used to control config settings, and are then passed through to the infrastructure agent. For instructions, see Configure the infrastructure agent.

重要

With secrets management, you can configure on-host integrations with New Relic Infrastructure's agent to use sensitive data (such as passwords) without having to write them as plain text into the integration's configuration file. For more information, see Secrets management.

For more about the general structure of on-host integration configuration, see Configuration.

Find and use data

Data from this service is reported to an integration dashboard.

Cassandra data is attached to the CassandraSample and CassandraColumnFamilySample event types. You can query this data for troubleshooting purposes or to create charts and dashboards.

For more on how to find and use your data, see Understand integration data.

Metric data

The Cassandra integration collects the following metrics.

Node metrics

Cassandra node metrics are attached to the CassandraSample event type. The Cassandra integration collects these node metrics:

Name

Description

db.allMemtablesOffHeapSizeBytes

Total amount of bytes stored in the memtables (2i and pending flush memtables included) that resides on-heap.

db.allMemtablesOnHeapSizeBytes

Total amount of bytes stored in the memtables (2i and pending flush memtables included) that resides off-heap.

db.commitLogCompletedTasksPerSecond

The number of commit log messages written per second.

db.commitLogPendindTasks

Number of commit log messages written but yet to be fsync’ed.

db.commitLogTotalSizeBytes

Current size, in bytes, used by all the commit log segments.

db.droppedRequestTypeMessagesPerSecond

Dropped messages per second for this type of request. RequestType can be any of the following: BatchRemove, BatchStore, CounterMutation, Hint, Mutation, PagedRange, RangeSlice, Read, ReadRepair, RequestResponse, or Trace.

db.keyCacheCapacityBytes

Key cache capacity in bytes.

db.keyCacheHitRate

One-minute key cache hit rate.

db.keyCacheHitsPerSecond

Number of key cache hits per second.

db.keyCacheRequestsPerSecond

Number of requests to the key cache per second.

db.keyCacheSizeBytes

Size of occupied cache in bytes.

db.liveSSTableCount

Number of SSTables on disk for this column family.

db.loadBytes

Size, in bytes, of the on disk data size this node manages.

db.rowCacheCapacityBytes

Row cache capacity in bytes.

db.rowCacheHitRate

One-minute row cache hit rate.

db.rowCacheHitsPerSecond

Number of row cache hits per second.

db.rowCacheRequestsPerSecond

Number of requests to the row cache per second.

db.rowCacheSizeBytes

Total size of occupied row cache, in bytes.

db.threadpool.poolActiveTasks

Number of tasks being actively worked on by this pool. pool can be one of the following:

  • internalAntiEntropyStage
  • internalCacheCleanupExecutor
  • internalCompactionExecutor
  • internalGossipStage
  • internalHintsDispatcher
  • internalInternalResponseStage
  • internalMemtableFlushWriter
  • internalMemtablePostFlush
  • internalMemtableReclaimMemory
  • internalMigrationStage
  • internalMiscStage
  • internalPendingRangeCalculator
  • internalSampler
  • internalSecondaryIndexManagement
  • internalValidationExecutor
  • requestCounterMutationStage
  • requestMutationStage
  • requestReadRepairStage
  • requestReadStage
  • requestRequestResponse
  • requestViewMutationStage

db.threadpool.pool.PendingTasks

Number of tasks being actively worked on by this pool. pool can be any of the items in the list provided in the description of db.threadpool.poolActiveTasks.

db.totalHintsInProgress

Number of hints currently attempting to be sent.

db.totalHintsPerSecond

Number of hint messages per second written to this node. Includes one entry for each host to be hinted per hint.

query.CASReadRequestsPerSecond

Transaction read latency in requests per second.

query.CASWriteRequestsPerSecond

Transaction write latency in requests per second.

query.rangeSliceRequestsPerSecond

Number of range slice requests per second.

query.rangeSliceTimeoutsPerSecond

Number of timeouts encountered per second when processing token range read requests.

query.rangeSliceUnavailablesPerSecond

Number of unavailable exceptions encountered per second when processing token range read requests.

query.readLatency50thPercentileMilliseconds

Read latency in milliseconds, 50th percentile.

query.readLatency75thPercentileMilliseconds

Read latency in milliseconds, 75th percentile.

query.readLatency95thPercentileMilliseconds

Read latency in milliseconds, 95th percentile.

query.readLatency98thPercentileMilliseconds

Read latency in milliseconds, 98th percentile.

query.readLatency999thPercentileMilliseconds

Read latency in milliseconds, 999th percentile.

query.readLatency99thPercentileMilliseconds

Read latency in milliseconds, 99th percentile.

query.readRequestsPerSecond

Number of read requests per second.

query.readTimeoutsPerSecond

Number of timeouts encountered per second when processing standard read requests.

query.readUnavailablesPerSecond

Number of unavailable exceptions encountered per second when processing standard read requests.

query.viewWriteRequestsPerSecond

Number of view write requests per second.

query.writeLatency50thPercentileMilliseconds

Write latency in milliseconds, 50th percentile.

query.writeLatency75thPercentileMilliseconds

Write latency in milliseconds, 75th percentile.

query.writeLatency95thPercentileMilliseconds

Write latency in milliseconds, 95th percentile.

query.writeLatency98thPercentileMilliseconds

Write latency in milliseconds, 98th percentile.

query.writeLatency999thPercentileMilliseconds

Write latency in milliseconds, 999th percentile.

query.writeLatency99thPercentileMilliseconds

Write latency in milliseconds, 99th percentile.

query.writeRequestsPerSecond

Number of write requests per second.

query.writeTimeoutsPerSecond

Number of timeouts encountered per second when processing regular write requests.

query.writeUnavailablesPerSecond

Number of unavailable exceptions encountered per second when processing regular write requests.

Cassandra column family metrics and metadata

The Cassandra integration retrieves column family metrics. Cassandra column family data is attached to the CassandraColumnFamilySample event type. It skips system keyspaces (system, system_auth, system_distributed, system_schema, system_traces and OpsCenter). To limit the performance impact, the integration will only capture metrics for a maximum of 20 column families.

The following metadata indicates the keyspace and column family associated with the sample metrics:

Name

Description

db.columnFamily

The Cassandra column family these metrics refer to.

db.keyspace

The Cassandra keyspace that contains this column family.

db.keyspaceAndColumnFamily

The keyspace and column family in a single metadata attribute in the following format: keyspace.columnFamily.

The list of metrics below refer to the specific keyspace and column family specified in the metadata above:

Name

Description

db.allMemtablesOffHeapSizeBytes

Total number of bytes stored in the memtables (2i and pending flush memtables included) that resides off-heap.

db.allMemtablesOnHeapSizeBytes

Total number of bytes stored in the memtables (2i and pending flush memtables included) that resides on-heap.

db.liveDiskSpaceUsedBytes

Disk space in bytes used by SSTables belonging to this column family (in bytes).

db.liveSSTableCount

Number of SSTables on disk for this column family.

db.pendingCompactions

Estimate of number of pending compactions for this column family.

db.SSTablesPerRead50thPercentileMilliseconds

Number of sstable data files accessed per read, 50th percentile.

db.SSTablesPerRead75thPercentileMilliseconds

Number of sstable data files accessed per read, 75th percentile.

db.SSTablesPerRead95thPercentileMilliseconds

Number of sstable data files accessed per read, 95th percentile.

db.SSTablesPerRead98thPercentileMilliseconds

Number of sstable data files accessed per read, 98th percentile.

db.SSTablesPerRead999thPercentileMilliseconds

Number of sstable data files accessed per read, 999th percentile.

db.SSTablesPerRead99thPercentileMilliseconds

Number of sstable data files accessed per read, 99th percentile.

query.readLatency50thPercentileMilliseconds

Local read latency in milliseconds for this column family, 50th percentile.

query.readLatency75thPercentileMilliseconds

Local read latency in milliseconds for this column family, 75th percentile.

query.readLatency95thPercentileMilliseconds

Local read latency in milliseconds for this column family, 95th percentile.

query.readLatency98thPercentileMilliseconds

Local read latency in milliseconds for this column family, 98th percentile.

query.readLatency999thPercentileMilliseconds

Local read latency in milliseconds for this column family, 999th percentile.

query.readLatency99thPercentileMilliseconds

Local read latency in milliseconds for this column family, 99th percentile.

query.readRequestsPerSecond

Number of read requests per second for this column family.

query.writeLatency50thPercentileMilliseconds

Local write latency in milliseconds for this column family, 50th percentile.

query.writeLatency75thPercentileMilliseconds

Local write latency in milliseconds for this column family, 75th percentile.

query.writeLatency95thPercentileMilliseconds

Local write latency in milliseconds for this column family, 95th percentile.

query.writeLatency98thPercentileMilliseconds

Local write latency in milliseconds for this column family, 98th percentile.

query.writeLatency999thPercentileMilliseconds

Local write latency in milliseconds for this column family, 999th percentile.

query.writeLatency99thPercentileMilliseconds

Local write latency in milliseconds for this column family, 99th percentile.

query.writeRequestsPerSecond

Number of write requests per second for this column family.

Inventory

The integration captures configuration options defined in the Cassandra configuration and reports them as inventory data in the New Relic UI.

System metadata

The Cassandra integration also collects these attributes about the service and its configuration:

Name

Description

software.version

The Cassandra version.

cluster.name

The name of the cluster this Cassandra node belongs to.

Troubleshooting

Troubleshooting via jmxterm

JMXTerm is a command line interactive tool bundled within the integration package.

Docs for JMXTerm can be found at our nrjmx page in GitHub.

Check the source code

This integration is open source software. That means you can browse its source code and send improvements, or create your own fork and build it.

その他のヘルプ

さらに支援が必要な場合は、これらのサポートと学習リソースを確認してください:

問題を作成するこのページを編集する
Copyright © 2020 New Relic Inc.