Hadoop JMX π
The Splunk Distribution of OpenTelemetry Collector uses the Smart Agent receiver with the Hadoop JMX monitor type to collect metrics from Hadoop 2.0 or higher clusters.
This integration produces metrics from a set of built-in MBeans available for the respective nodeTypes
:
Name Nodes (
nameNode
)Resource Manager (
resourceManager
)Node Manager (
nodeManager
)Data Nodes (
dataNode
)
This integration uses the collectd GenericJMX plugin. You can also configure the hadoop
monitor to collect additional metrics about the Hadoop cluster using the REST API.
This integration is only available on Kubernetes and Linux.
Benefits π
After you configure the integration, you can access these features:
View metrics. You can create your own custom dashboards, and most monitors provide built-in dashboards as well. For information about dashboards, see View dashboards in Observability Cloud.
View a data-driven visualization of the physical servers, virtual machines, AWS instances, and other resources in your environment that are visible to Infrastructure Monitoring. For information about navigators, see Splunk Infrastructure Monitoring navigators.
Access the Metric Finder and search for metrics sent by the monitor. For information, see Use the Metric Finder.
Requirements π
To activate JMX in Hadoop, add the following Java Virtual Machine (JVM) options to hadoop-env.sh and yarn-env.sh:
hadoop-env.sh:
export HADOOP_NAMENODE_OPTS="-Dcom.sun.management.jmxremote.ssl=false -Dcom.sun.management.jmxremote.authenticate=false -Dcom.sun.management.jmxremote.port=5677 $HADOOP_NAMENODE_OPTS"
export HADOOP_DATANODE_OPTS="-Dcom.sun.management.jmxremote.ssl=false -Dcom.sun.management.jmxremote.authenticate=false -Dcom.sun.management.jmxremote.port=5679 $HADOOP_DATANODE_OPTS"
yarn-env.sh:
export YARN_NODEMANAGER_OPTS="-Dcom.sun.management.jmxremote.ssl=false -Dcom.sun.management.jmxremote.authenticate=false -Dcom.sun.management.jmxremote.port=8002 $YARN_NODEMANAGER_OPTS"
export YARN_RESOURCEMANAGER_OPTS="-Dcom.sun.management.jmxremote.ssl=false -Dcom.sun.management.jmxremote.authenticate=false -Dcom.sun.management.jmxremote.port=5680 $YARN_RESOURCEMANAGER_OPTS"
Installation π
Follow these steps to deploy this integration:
Deploy the Splunk Distribution of OpenTelemetry Collector to your host or container platform:
Configure the monitor, as described in the Configuration section.
Restart the Splunk Distribution of OpenTelemetry Collector.
Configuration π
To use this integration of a Smart Agent monitor with the Collector:
Include the Smart Agent receiver in your configuration file.
Add the monitor type to the Collector configuration, both in the receiver and pipelines sections.
Read more on how to Use Smart Agent monitors with the Collector.
See how to set up the Smart Agent receiver.
Learn about config options in Collector default configuration.
Example π
To activate this integration, add the following to your Collector configuration:
receivers:
smartagent/ collectd/hadoopjmx:
type: collectd/hadoopjmx
... # Additional config
Next, add the monitor to the service > pipelines > metrics > receivers
section of your configuration file:
service:
pipelines:
metrics:
monitors: [smartagent/collectd/hadoopjmx]
Configuration options π
The following table shows the configuration options for the collectd/hadoopjmx receiver:
Option |
Required |
Type |
Description |
---|---|---|---|
|
yes |
|
Host to connect to. JMX must be configured for remote access and be accessible from the agent. |
|
yes |
|
JMX connection port on the application. Not the RMI port. This correponds to the |
|
no |
|
|
|
no |
|
How the service type is identified in Splunk Observability Cloud so that you can get built-in content for it. For custom JMX integrations, set it to an arbitrary value. |
|
no |
|
The JMX connection string. Rendered as a Go template. Has access to the other values in this configuration. Under normal circumstances, donβt set this string directly and use the host and port settings instead. The default value is |
|
no |
|
Prefixes the generated plugin instance with a prefix. If a second |
|
no |
|
Username to authenticate to the server. |
|
no |
|
User password to authenticate to the server. |
|
no |
|
Takes in key-values pairs of custom dimensions at the connection level. |
|
no |
|
A list of the MBeans to be collected, as defined in |
|
no |
|
A list of the MBeans to omit. This can be useful when only a few MBeans need to omitted from the default list. |
|
no |
|
Specifies how to map JMX MBean values to metrics. Specific service monitors such as Cassandra, Kafka, or Activemq, are configured with a set of mappings: additional mappings are merged with those. See the official collectd GenericJMX documentation for more details. |
|
yes |
|
Hadoop nodeType. |
The nested mBeanDefinitions
configuration object has the following fields:
Option |
Required |
Type |
Description |
---|---|---|---|
|
no |
|
Sets the pattern used to retrieve MBeans from the MBeanServer. If more than one MBean is returned, use the |
|
no |
|
Prefixes the generated plugin instance with a prefix. |
|
no |
|
The object names used by JMX to identify MBeans include properties, which are in the form of key-value-pairs. If the given object name is not unique and multiple MBeans are returned, the values of those properties might differ. Use this option to build the plugin instance from the appropriate property values. To generate the plugin instance from multiple property values, use multiple instances of this setting. |
|
no |
|
The |
|
no |
|
A list of strings for the dimensions. |
The nested values
config object has the following fields:
Option |
Required |
Type |
Description |
---|---|---|---|
|
no |
|
Sets the dataset used within collectd to handle the values of the MBean attribute. |
|
no |
|
Whether the returned attribute is a composite type. If set to |
|
no |
|
Similar to the |
|
no |
|
Similar to the |
|
no |
|
The name of the attribute from which the value is read. You can access the keys of composite types by using a dot to concatenate the key name to the attribute name. For example, |
|
no |
|
The plural form of the |
Metrics π
The following metrics are available for this integration:
Notes π
Learn more about the available metric types in Observability Cloud.
Default metrics are those metrics included in host-based subscriptions in Observability Cloud, such as host, container, or bundled metrics. Custom metrics are not provided by default and might be subject to charges. See more about metric categories.
To add additional metrics, see how to configure
extraMetrics
using the Collector.
Troubleshooting π
If you are not able to see your data in Splunk Observability Cloud, try these tips:
Submit a case in the Splunk Support Portal
Available to Splunk Observability Cloud customers
-
Available to Splunk Observability Cloud customers
Ask a question and get answers through community support at Splunk Answers
Available to Splunk Observability Cloud customers and free trial users
Join the Splunk #observability user group Slack channel to communicate with customers, partners, and Splunk employees worldwide
Available to Splunk Observability Cloud customers and free trial users
To learn how to join, see Get Started with Splunk Community - Chat groups
To learn about even more support options, see Splunk Customer Success.