Docs » Integrations Guide » Integrations Reference » Pivotal Cloud Foundry

image0 Pivotal Cloud Foundry

Metadata associated with the Pivotal Cloud Foundry Integration can be found here. The relevant code for the integration can be found here.

DESCRIPTION

Use this integration to monitor a Pivotal Cloud Foundry deployment. This integration provides metrics about the performance of the various components that make up Pivotal Cloud Foundry.

FEATURES

Infrastructure Page

  • Infrastructure Navigator: On the Infrastructure page in SignalFx, the Infrastructure Navigator visualizes Cloud Foundry instances as squares, colored by metrics including CPU, disk, and network. Additional views are provided for focus on the health and performance of specific Cloud Foundry services, as well as Garden Containers. Click here to read more about the Infrastructure Page.

You can quickly see the status of all of the VMs in your Cloud Foundry cluster:

image1

Here is a sample of the view for Garden Containers:

image2

Built-in dashboards

This integration includes built-in dashboards listed under Cloud Foundry on the Dashboards page in SignalFx. Here are some examples:

  • Key Capacity Scaling Indicators: Helps you figure out whether you need to add resources to your cluster.

image3

  • Diego: Metrics around Diego, including many KPIs

image4

  • Garden Containers: High-level look at the Garden container system

image5

And many more...

REQUIREMENTS AND DEPENDENCIES

This integration requires administrative access to a Pivotal Cloud Foundry deployment. Pivotal Web Services is not supported. Versions known to work are:

Software Version
Pivotal Ops Manager 1.9.1+
Pivotal Elastic Runtime 1.9.0+

INSTALLATION

Follow these steps to enable this integration:

  1. Download the product file from Pivotal Network.
  2. Follow the installation instructions for the tile.

Metrics from Pivotal Cloud Foundry should begin streaming into SignalFx.

To monitor services running within Garden containers (e.g. webservers) you will
along with the CF meta-buildpack.
To get our agent on to your own BOSH deployments, you can use our BOSH
release.

METRICS

Below is a list of all metrics.

Metric Name Brief Type
DopplerServer.LinuxFileDescriptor Number of file handles for the Dopplers process. gauge
DopplerServer.TruncatingBuffer.DroppedMessages Number of messages intentionally dropped by Doppler from the sink for the specific sink. This counter event will correspond with log messages Log message output is too high. Emitted every 5 seconds. gauge
DopplerServer.TruncatingBuffer.totalDroppedMessages Lifetime total number of messages intentionally dropped by Doppler from all of its sinks due to back pressure. Emitted every 5 seconds. gauge
DopplerServer.Uptime Uptime for the Dopplers process. gauge
DopplerServer.dropsondeListener.currentBufferCount DEPRECATED cumulative_counter
DopplerServer.dropsondeListener.receivedByteCount DEPRECATED in favor of DopplerServer.udpListener.receivedByteCount. cumulative_counter
DopplerServer.dropsondeListener.receivedMessageCount DEPRECATED in favor of DopplerServer.udpListener.receivedMessageCount. cumulative_counter
DopplerServer.dropsondeUnmarshaller.containerMetricReceived Lifetime number of ContainerMetric messages unmarshalled. cumulative_counter
DopplerServer.dropsondeUnmarshaller.counterEventReceived Lifetime number of CounterEvent messages unmarshalled. cumulative_counter
DopplerServer.dropsondeUnmarshaller.errorReceived Lifetime number of Error messages unmarshalled. gauge
DopplerServer.dropsondeUnmarshaller.heartbeatReceived DEPRECATED gauge
DopplerServer.dropsondeUnmarshaller.httpStartStopReceived Lifetime number of HttpStartStop messages unmarshalled. cumulative_counter
DopplerServer.dropsondeUnmarshaller.logMessageTotal Lifetime number of LogMessage messages unmarshalled. cumulative_counter
DopplerServer.dropsondeUnmarshaller.unmarshalErrors Lifetime number of errors when unmarshalling messages. gauge
DopplerServer.dropsondeUnmarshaller.valueMetricReceived Lifetime number of ValueMetric messages unmarshalled. cumulative_counter
DopplerServer.httpServer.receivedMessages Number of messages received by Dopplers internal MessageRouter. Emitted every 5 seconds. cumulative_counter
DopplerServer.listeners.totalReceivedMessageCount Total number of messages received across all of Dopplers listeners (UDP, TCP, TLS). cumulative_counter
DopplerServer.memoryStats.lastGCPauseTimeNS Duration of the last Garbage Collector pause in nanoseconds. gauge
DopplerServer.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
DopplerServer.memoryStats.numBytesAllocatedHeap Number of bytes allocated on the heap gauge
DopplerServer.memoryStats.numBytesAllocatedStack Number of bytes allocated on the stack gauge
DopplerServer.memoryStats.numFrees Lifetime number of memory deallocations. gauge
DopplerServer.memoryStats.numMallocs Lifetime number of memory allocations. gauge
DopplerServer.messageRouter.numberOfContainerMetricSinks Instantaneous number of container metric sinks known to the SinkManager. Emitted every 5 seconds. gauge
DopplerServer.messageRouter.numberOfDumpSinks Instantaneous number of dump sinks known to the SinkManager. Emitted every 5 seconds. gauge
DopplerServer.messageRouter.numberOfFirehoseSinks Instantaneous number of firehose sinks known to the SinkManager. Emitted every 5 seconds. gauge
DopplerServer.messageRouter.numberOfSyslogSinks Instantaneous number of syslog sinks known to the SinkManager. gauge
DopplerServer.messageRouter.numberOfWebsocketSinks Instantaneous number of WebSocket sinks known to the SinkManager. Emitted every 5 seconds. gauge
DopplerServer.messageRouter.totalDroppedMessages Lifetime number of messages dropped inside Doppler for various reasons (downstream consumer cant keep up internal object wasnt ready for message, etc.). gauge
DopplerServer.numCpus Number of CPUs on the machine. gauge
DopplerServer.numGoRoutines Instantaneous number of active goroutines in the Doppler process. gauge
DopplerServer.sentMessagesFirehose.SUBSCRIPTION_ID Number of sent messages through the firehose per subscription id. Emitted every 5 seconds. gauge
DopplerServer.sentMessagesFirehose Number of messages sent on the Firehose cumulative_counter
DopplerServer.signatureVerifier.invalidSignatureErrors Lifetime number of messages received with an invalid signature. gauge
DopplerServer.signatureVerifier.missingSignatureErrors Lifetime number of messages received that are too small to contain a signature. gauge
DopplerServer.signatureVerifier.validSignatures Lifetime number of messages received with valid signatures. cumulative_counter
DopplerServer.tcpListener.receivedByteCount Lifetime number of bytes received by Dopplers TCP Listener. Emitted every 5 seconds. cumulative_counter
DopplerServer.tcpListener.receivedErrorCount Lifetime number of errors encountered by Dopplers TCP Listener while handshaking, decoding or reading from the connection. cumulative_counter
DopplerServer.tcpListener.receivedMessageCount Lifetime number of messages received by Dopplers TCP Listener. Emitted every 5 seconds. cumulative_counter
DopplerServer.tlsListener.receivedByteCount Lifetime number of bytes received by Dopplers TLS Listener. Emitted every 5 seconds. cumulative_counter
DopplerServer.tlsListener.receivedErrorCount Lifetime number of errors encountered by Dopplers TLS Listener while handshaking, decoding or reading from the connection. cumulative_counter
DopplerServer.tlsListener.receivedMessageCount Lifetime number of messages received by Dopplers TLS Listener. Emitted every 5 seconds. cumulative_counter
DopplerServer.udpListener.receivedByteCount Lifetime number of bytes received by Dopplers UDP Listener. cumulative_counter
DopplerServer.udpListener.receivedErrorCount Lifetime number of errors encountered by Dopplers UDP Listener while reading from the connection. cumulative_counter
DopplerServer.udpListener.receivedMessageCount Lifetime number of messages received by Dopplers UDP Listener. cumulative_counter
LoggregatorTrafficController.LinuxFileDescriptor Number of file handles for the TrafficControllers process. gauge
LoggregatorTrafficController.Uptime Uptime for the Traffic Controllers process. Emitted every 30 seconds. gauge
LoggregatorTrafficController.dopplerProxy.containermetricsLatency Duration for serving container metrics via the containermetrics endpoint (milliseconds). Emitted every 30 seconds. gauge
LoggregatorTrafficController.dopplerProxy.recentlogsLatency Duration for serving recent logs via the recentLogs endpoint (milliseconds). Emitted every 30 seconds. gauge
LoggregatorTrafficController.memoryStats.lastGCPauseTimeNS Duration of the last Garbage Collector pause in nanoseconds. gauge
LoggregatorTrafficController.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
LoggregatorTrafficController.memoryStats.numBytesAllocatedHeap Instantaneous count of bytes allocated on the main heap and still in use. gauge
LoggregatorTrafficController.memoryStats.numBytesAllocatedStack Instantaneous count of bytes used by the stack allocator. gauge
LoggregatorTrafficController.memoryStats.numFrees Lifetime number of memory deallocations. gauge
LoggregatorTrafficController.memoryStats.numMallocs Lifetime number of memory allocations. gauge
LoggregatorTrafficController.numCPUS Number of CPUs on the machine. gauge
LoggregatorTrafficController.numGoRoutines Instantaneous number of active goroutines in the Doppler process. gauge
MetronAgent.DopplerForwarder.sentMessages Lifetime number of messages sent to Doppler regardless of protocol. Emitted every 30 seconds. cumulative_counter
MetronAgent.MessageAggregator.counterEventReceived Lifetime number of CounterEvents aggregated in Metron. cumulative_counter
MetronAgent.MessageBuffer.droppedMessageCount Lifetime number of intentionally dropped messages from Metrons batch writer buffer. Batch writing is performed over TCP/TLS only. cumulative_counter
MetronAgent.dropsondeAgentListener.currentBufferCount Instantaneous number of Dropsonde messages read by UDP socket but not yet unmarshalled. cumulative_counter
MetronAgent.dropsondeAgentListener.receivedByteCount Lifetime number of bytes of Dropsonde messages read by UDP socket. cumulative_counter
MetronAgent.dropsondeAgentListener.receivedMessageCount Lifetime number of Dropsonde messages read by UDP socket. cumulative_counter
MetronAgent.dropsondeMarshaller.containerMetricMarshalled Lifetime number of ContainerMetric messages marshalled. gauge
MetronAgent.dropsondeMarshaller.counterEventMarshalled Lifetime number of CounterEvent messages marshalled. cumulative_counter
MetronAgent.dropsondeMarshaller.errorMarshalled Lifetime number of Error messages marshalled. gauge
MetronAgent.dropsondeMarshaller.heartbeatMarshalled Lifetime number of Heartbeat messages marshalled. gauge
MetronAgent.dropsondeMarshaller.httpStartStopMarshalled Lifetime number of HttpStartStop messages marshalled. gauge
MetronAgent.dropsondeMarshaller.logMessageMarshalled Lifetime number of LogMessage messages marshalled. gauge
MetronAgent.dropsondeMarshaller.marshalErrors Lifetime number of errors when marshalling messages. gauge
MetronAgent.dropsondeMarshaller.valueMetricMarshalled Lifetime number of ValueMetric messages marshalled. gauge
MetronAgent.dropsondeUnmarshaller.containerMetricReceived Lifetime number of ContainerMetric messages unmarshalled. cumulative_counter
MetronAgent.dropsondeUnmarshaller.counterEventReceived Lifetime number of CounterEvent messages unmarshalled. cumulative_counter
MetronAgent.dropsondeUnmarshaller.errorReceived Lifetime number of Error messages unmarshalled. gauge
MetronAgent.dropsondeUnmarshaller.heartbeatReceived DEPRECATED gauge
MetronAgent.dropsondeUnmarshaller.httpStartStopReceived Lifetime number of HttpStartStop messages unmarshalled. cumulative_counter
MetronAgent.dropsondeUnmarshaller.logMessageTotal Lifetime number of LogMessage messages unmarshalled. cumulative_counter
MetronAgent.dropsondeUnmarshaller.unmarshalErrors Lifetime number of errors when unmarshalling messages. gauge
MetronAgent.dropsondeUnmarshaller.valueMetricReceived Lifetime number of ValueMetric messages unmarshalled. cumulative_counter
MetronAgent.legacyAgentListener.currentBufferCount Instantaneous number of Legacy messages read by UDP socket but not yet unmarshalled. cumulative_counter
MetronAgent.legacyAgentListener.receivedByteCount Lifetime number of bytes of Legacy messages read by UDP socket. cumulative_counter
MetronAgent.legacyAgentListener.receivedMessageCount Lifetime number of Legacy messages read by UDP socket. cumulative_counter
MetronAgent.memoryStats.lastGCPauseTimeNS Duration of the last Garbage Collector pause in nanoseconds. gauge
MetronAgent.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
MetronAgent.memoryStats.numBytesAllocatedHeap Instantaneous count of bytes allocated on the main heap and still in use. gauge
MetronAgent.memoryStats.numBytesAllocatedStack Instantaneous count of bytes used by the stack allocator. gauge
MetronAgent.memoryStats.numFrees Lifetime number of memory deallocations. gauge
MetronAgent.memoryStats.numMallocs Lifetime number of memory allocations. gauge
MetronAgent.numCpus Number of CPUs on the machine. gauge
MetronAgent.numGoRoutines Instantaneous number of active goroutines in the Doppler process. gauge
MetronAgent.tcp.sendErrorCount Lifetime number of errors if writing to Doppler over TCP fails. cumulative_counter
MetronAgent.tcp.sentByteCount Lifetime number of sent bytes to Doppler over TCP. cumulative_counter
MetronAgent.tcp.sentMessageCount Lifetime number of sent messages to Doppler over TCP. cumulative_counter
MetronAgent.tls.sendErrorCount Lifetime number of errors if writing to Doppler over TLS fails. cumulative_counter
MetronAgent.tls.sentByteCount Lifetime number of sent bytes to Doppler over TLS. Emitted every 30 seconds. cumulative_counter
MetronAgent.tls.sentMessageCount Lifetime number of sent messages to Doppler over TLS. Emitted every 30 seconds. cumulative_counter
MetronAgent.udp.sendErrorCount Lifetime number of errors if writing to Doppler over UDP fails. cumulative_counter
MetronAgent.udp.sentByteCount Lifetime number of sent bytes to Doppler over UDP. cumulative_counter
MetronAgent.udp.sentMessageCount Lifetime number of sent messages to Doppler over UDP. cumulative_counter
auctioneer.AuctioneerFetchStatesDuration Time in nanoseconds that the auctioneer took to fetch state from all the cells when running its auction. Emitted every 30 seconds during each auction. gauge
auctioneer.AuctioneerLRPAuctionsFailed Cumulative number of LRP instances that the auctioneer failed to place on Diego cells. Emitted every 30 seconds during each auction. cumulative_counter
auctioneer.AuctioneerLRPAuctionsStarted Cumulative number of LRP instances that the auctioneer successfully placed on Diego cells. Emitted every 30 seconds during each auction. cumulative_counter
auctioneer.AuctioneerTaskAuctionsFailed Cumulative number of Tasks that the auctioneer failed to place on Diego cells. Emitted every 30 seconds during each auction. cumulative_counter
auctioneer.AuctioneerTaskAuctionsStarted Cumulative number of Tasks that the auctioneer successfully placed on Diego cells. Emitted every 30 seconds during each auction. cumulative_counter
auctioneer.LockHeld.v1-locks-auctioneer_lock Whether an auctioneer holds the auctioneer lock: 1 means the lock is held, and 0 means the lock was lost. Emitted every 30 seconds by the active auctioneer. gauge
auctioneer.LockHeldDuration.v1-locks-auctioneer_lock Time in nanoseconds that the active auctioneer has held the auctioneer lock. Emitted every 30 seconds by the active auctioneer. gauge
auctioneer.memoryStats.lastGCPauseTimeNS Duration in nanoseconds of the last garbage collector pause. gauge
auctioneer.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
auctioneer.memoryStats.numBytesAllocatedHeap Instantaneous count of bytes allocated on the main heap and still in use. gauge
auctioneer.memoryStats.numBytesAllocatedStack Instantaneous count of bytes used by the stack allocator. gauge
auctioneer.memoryStats.numFrees Lifetime number of memory deallocations. gauge
auctioneer.memoryStats.numMallocs Lifetime number of memory allocations. gauge
auctioneer.numCPUS Number of CPUs on the machine. gauge
auctioneer.numGoRoutines Instantaneous number of active goroutines in the process. gauge
bbs.BBSMasterElected Emitted once when the BBS is elected as master. gauge
bbs.ConvergenceLRPDuration Time in nanoseconds that the BBS took to run its LRP convergence pass. Emitted every 30 seconds when LRP convergence runs. gauge
bbs.ConvergenceLRPPreProcessingActualLRPsDeleted Cumulative number of times the BBS has detected and deleted a malformed ActualLRP in its LRP convergence pass. Emitted every 30 seconds. gauge
bbs.ConvergenceLRPPreProcessingMalformedRunInfos Cumulative number of times the BBS has detected a malformed DesiredLRP RunInfo in its LRP convergence pass. Emitted every 30 seconds. gauge
bbs.ConvergenceLRPPreProcessingMalformedSchedulingInfos Cumulative number of times the BBS has detected a malformed DesiredLRP SchedulingInfo in its LRP convergence pass. Emitted every 30 seconds. gauge
bbs.ConvergenceLRPRuns Cumulative number of times BBS has run its LRP convergence pass. Emitted every 30 seconds. cumulative_counter
bbs.ConvergenceTaskDuration Time in nanoseconds that the BBS took to run its Task convergence pass. Emitted every 30 seconds when Task convergence runs. gauge
bbs.ConvergenceTaskRuns Cumulative number of times the BBS has run its Task convergence pass. Emitted every 30 seconds. cumulative_counter
bbs.ConvergenceTasksKicked Cumulative number of times the BBS has updated a Task during its Task convergence pass. Emitted every 30 seconds. cumulative_counter
bbs.ConvergenceTasksPruned Cumulative number of times the BBS has deleted a malformed Task during its Task convergence pass. Emitted every 30 seconds. cumulative_counter
bbs.CrashedActualLRPs Total number of LRP instances that have crashed. Emitted every 30 seconds. gauge
bbs.CrashingDesiredLRPs Total number of DesiredLRPs that have at least one crashed instance. Emitted every 30 seconds. gauge
bbs.Domain.cf-apps Whether the cf-apps domain is up-to-date, so that CF apps from CC have been synchronized with DesiredLRPs for Diego to run. 1 means the domain is up-to-date, no data means it is not. Emitted every 30 seconds. gauge
bbs.Domain.cf-tasks Whether the cf-tasks domain is up-to-date, so that CF tasks from CC have been synchronized with tasks for Diego to run. 1 means the domain is up-to-date, no data means it is not. Emitted every 30 seconds. gauge
bbs.ETCDLeader Index of the leader node in the etcd cluster. Emitted every 30 seconds. gauge
bbs.ETCDRaftTerm Raft term of the etcd cluster. Emitted every 30 seconds. gauge
bbs.ETCDReceivedBandwidthRate Number of bytes per second received by the follower etcd node. Emitted every 30 seconds. gauge
bbs.ETCDReceivedRequestRate Number of requests per second received by the follower etcd node. Emitted every 30 seconds. gauge
bbs.ETCDSentBandwidthRate Number of bytes per second sent by the leader etcd node. Emitted every 30 seconds. gauge
bbs.ETCDSentRequestRate Number of requests per second sent by the leader etcd node. Emitted every 30 seconds. gauge
bbs.ETCDWatchers Number of watches set against the etcd cluster. Emitted every 30 seconds. gauge
bbs.LRPsClaimed Total number of LRP instances that have been claimed by some cell. Emitted every 30 seconds. gauge
bbs.LRPsDesired Total number of LRP instances desired across all LRPs. Emitted periodically. gauge
bbs.LRPsExtra Total number of LRP instances that are no longer desired but still have a BBS record. Emitted every 30 seconds. gauge
bbs.LRPsMissing Total number of LRP instances that are desired but have no record in the BBS. Emitted every 30 seconds. gauge
bbs.LRPsRunning Total number of LRP instances that are running on cells. Emitted every 30 seconds. gauge
bbs.LRPsUnclaimed Total number of LRP instances that have not yet been claimed by a cell. Emitted every 30 seconds. gauge
bbs.LockHeld.v1-locks-bbs_lock Whether a BBS holds the BBS lock: 1 means the lock is held, and 0 means the lock was lost. Emitted every 30 seconds by the active BBS server. gauge
bbs.LockHeldDuration.v1-locks-bbs_lock Time in nanoseconds that the active BBS has held the BBS lock. Emitted every 30 seconds by the active BBS server. gauge
bbs.MetricsReportingDuration Time in nanoseconds that the BBS took to emit metrics about etcd. Emitted every 30 seconds. gauge
bbs.MigrationDuration Time in nanoseconds that the BBS took to run migrations against its persistence store. Emitted each time a BBS becomes the active master. gauge
bbs.RequestCount Cumulative number of requests the BBS has handled through its API. Emitted for each BBS request. cumulative_counter
bbs.RequestLatency Time in nanoseconds that the BBS took to handle requests to its API endpoints. Emitted when the BBS API handles requests. gauge
bbs.TasksCompleted Total number of Tasks that have completed. Emitted every 30 seconds. gauge
bbs.TasksPending Total number of Tasks that have not yet been placed on a cell. Emitted every 30 seconds. gauge
bbs.TasksResolving Total number of Tasks locked for deletion. Emitted every 30 seconds. gauge
bbs.TasksRunning Total number of Tasks running on cells. Emitted every 30 seconds. gauge
bbs.memoryStats.lastGCPauseTimeNS Duration in nanoseconds of the last garbage collector pause. gauge
bbs.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
bbs.memoryStats.numBytesAllocatedHeap Instantaneous count of bytes allocated on the main heap and still in use. gauge
bbs.memoryStats.numBytesAllocatedStack Instantaneous count of bytes used by the stack allocator. gauge
bbs.memoryStats.numFrees Lifetime number of memory deallocations. gauge
bbs.memoryStats.numMallocs Lifetime number of memory allocations. gauge
bbs.numCPUS Number of CPUs on the machine. gauge
bbs.numGoRoutines Instantaneous number of active goroutines in the process. gauge
cc.failed_job_count.VM_NAME-VM_INDEX Number of failed jobs in the <VM_NAME-<VM_INDEX queue. This is the number of delayed jobs where the failed at column is populated with the time of the most recently failed attempt at the job. The failed job count is not specific to the jobs run by the Cloud Controller worker. By default, Cloud Controller deletes failed jobs after 31 days. Emitted every 30 seconds per VM. cumulative_counter
cc.failed_job_count.cc-generic Number of failed jobs in the cc-generic queue. By default, Cloud Controller deletes failed jobs after 31 days. Emitted every 30 seconds per VM. cumulative_counter
cc.failed_job_count.total Number of failed jobs in all queues. By default, Cloud Controller deletes failed jobs after 31 days. Emitted every 30 seconds per VM. gauge
cc.http_status.1XX Number of HTTP response status codes of type 1xx (informational). This resets when the Cloud Controller process is restarted and is incremented at the end of each request cycle. cumulative_counter
cc.http_status.2XX Number of HTTP response status codes of type 2xx (success). This resets when the Cloud Controller process is restarted and is incremented at the end of each request cycle. Emitted for each Cloud Controller request. cumulative_counter
cc.http_status.3XX Number of HTTP response status codes of type 3xx (redirection). This resets when the Cloud Controller process is restarted and is incremented at the end of each request cycle. Emitted for each Cloud Controller request. cumulative_counter
cc.http_status.4XX Number of HTTP response status codes of type 4xx (client error). This resets when the Cloud Controller process is restarted and is incremented at the end of each request cycle. Emitted for each Cloud Controller request. cumulative_counter
cc.http_status.5XX Number of HTTP response status codes of type 5xx (server error). This resets when the Cloud Controller process is restarted and is incremented at the end of each request cycle. cumulative_counter
cc.job_queue_length.cc-VM_NAME-VM_INDEX Number of background jobs in the <VM_NAME-<VM_INDEX queue that have yet to run for the first time. Emitted every 30 seconds per VM. gauge
cc.job_queue_length.cc-generic Number of background jobs in the cc-generic queue that have yet to run for the first time. Emitted every 30 seconds per VM. gauge
cc.job_queue_length.total Total number of background jobs in the queues that have yet to run for the first time. Emitted every 30 seconds per VM. gauge
cc.log_count.all Total number of log messages, sum of messages of all severity levels. The count resets when the Cloud Controller process is restarted. Emitted every 30 seconds per VM. gauge
cc.log_count.debug Number of log messages of severity debug. The count resets when the Cloud Controller process is restarted. Emitted every 30 seconds per VM. gauge
cc.log_count.debug1 Not used. gauge
cc.log_count.debug2 Number of log messages of severity debug2. The count resets when the Cloud Controller process is restarted. Emitted every 30 seconds per VM. gauge
cc.log_count.error Number of error log messages cumulative_counter
cc.log_count.fatal Number of fatal log messages cumulative_counter
cc.log_count.info Number of log messages of severity info. Examples of info messages are droplet created, copying package, uploading package, access denied due to insufficient scope, job logging, blobstore actions, staging requests, and app running requests. The count resets when the Cloud Controller process is restarted. Emitted every 30 seconds per VM. gauge
cc.log_count.off Number of log messages of severity off. The count resets when the Cloud Controller process is restarted. Emitted every 30 seconds per VM. gauge
cc.log_count.warn Number of warn log messages cumulative_counter
cc.requests.completed Number of Cloud Controller API requests completed cumulative_counter
cc.requests.outstanding Number of Cloud Controller requests made but not completed cumulative_counter
cc.tasks_running.count Number of tasks currently running gauge
cc.tasks_running.memory_in_mb Memory being consumed by all currently running tasks. Emitted every 30 seconds per VM. This metric is only seen in version 3 of the Cloud Foundry API. gauge
cc.thread_info.event_machine.connection_count Number of open connections to event machine. Emitted every 30 seconds per VM. gauge
cc.thread_info.event_machine.resultqueue.num_waiting Number of scheduled tasks in the result. Emitted every 30 seconds per VM. gauge
cc.thread_info.event_machine.resultqueue.size Number of unscheduled tasks in the result. Emitted every 30 seconds per VM. gauge
cc.thread_info.event_machine.threadqueue.num_waiting Number of scheduled tasks in the threadqueue. Emitted every 30 seconds per VM. gauge
cc.thread_info.event_machine.threadqueue.size Number of unscheduled tasks in the threadqueue. Emitted every 30 seconds per VM. gauge
cc.thread_info.thread_count Total number of threads that are either runnable or stopped. Emitted every 30 seconds per VM. gauge
cc.total_users Total number of users ever created, including inactive users. Emitted every 10 minutes per VM. gauge
cc.vcap_sinatra.recent_errors 50 most recent errors. DEPRECATED gauge
cc.vitals.cpu Percentage of CPU used by the Cloud Controller process. Emitted every 30 seconds per VM. gauge
cc.vitals.cpu_load_avg System CPU load averaged over the last 1 minute according to the OS. Emitted every 30 seconds per VM. gauge
cc.vitals.mem_bytes The RSS bytes (resident set size) or real memory of the Cloud Controller process. Emitted every 30 seconds per VM. gauge
cc.vitals.mem_free_bytes Total memory available according to the OS. Emitted every 30 seconds per VM. gauge
cc.vitals.mem_used_bytes Total memory used (active + wired) according to the OS. Emitted every 30 seconds per VM. gauge
cc.vitals.num_cores The number of CPUs of a host machine. Emitted every 30 seconds per VM. gauge
cc.vitals.uptime The uptime of the Cloud Controller process in seconds. Emitted every 30 seconds per VM. gauge
cc_uploader.memoryStats.lastGCPauseTimeNS Duration in nanoseconds of the last garbage collector pause. gauge
cc_uploader.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
cc_uploader.memoryStats.numBytesAllocatedHeap Instantaneous count of bytes allocated on the main heap and still in use. gauge
cc_uploader.memoryStats.numBytesAllocatedStack Instantaneous count of bytes used by the stack allocator. gauge
cc_uploader.memoryStats.numFrees Lifetime number of memory deallocations. gauge
cc_uploader.memoryStats.numMallocs Lifetime number of memory allocations. gauge
cc_uploader.numCPUS Number of CPUs on the machine. gauge
cc_uploader.numGoRoutines Instantaneous number of active goroutines in the process. gauge
container.cpu_percentage Percentage of CPU used by this container gauge
container.disk_bytes Number of bytes of disk used by this container gauge
container.disk_bytes_quota Number of bytes of disk allowed for this container gauge
container.memory_bytes Number of bytes of RAM used by this container gauge
container.memory_bytes_quota Number of bytes of RAM allocated to this container gauge
etcd.CompareAndDeleteFail CompareAndDeleteFail operation count. Emitted every 30 seconds. gauge
etcd.CompareAndDeleteSuccess CompareAndDeleteSuccess operation countEmitted every 30 seconds. gauge
etcd.CompareAndSwapFail CompareAndSwapFail operation count. Emitted every 30 seconds. gauge
etcd.CompareAndSwapSuccess CompareAndSwapSuccess operation count. Emitted every 30 seconds. gauge
etcd.CreateFail CreateFail operation count. Emitted every 30 seconds. gauge
etcd.CreateSuccess CreateSuccess operation count. Emitted every 30 seconds. gauge
etcd.DeleteFail DeleteFail operation count. Emitted every 30 seconds. gauge
etcd.DeleteSuccess DeleteSuccess operation count. Emitted every 30 seconds. gauge
etcd.EtcdIndex X-Etcd-Index value from the /stats/store endpoint. Emitted every 30 seconds. gauge
etcd.ExpireCount ExpireCount operation count. Emitted every 30 seconds. gauge
etcd.Followers Number of etcd followers. Emitted every 30 seconds. gauge
etcd.GetsFail GetsFail operation count. Emitted every 30 seconds. gauge
etcd.GetsSuccess GetsSuccess operation count. Emitted every 30 seconds. gauge
etcd.IsLeader 1 if the current server is the leader, 0 if it is a follower. Emitted every 30 seconds. gauge
etcd.Latency Current latency in milliseconds from leader to a specific follower. Emitted every 30 seconds. gauge
etcd.RaftIndex X-Raft-Index value from the /stats/store endpoint. Emitted every 30 seconds. gauge
etcd.RaftTerm X-Raft-Term value from the /stats/store endpoint. Emitted every 30 seconds. gauge
etcd.ReceivedAppendRequests Number of append requests this node has processed. Emitted every 30 seconds. gauge
etcd.ReceivingBandwidthRate Number of bytes per second this node is receiving (follower only). Emitted every 30 seconds. gauge
etcd.ReceivingRequestRate Number of requests per second this node is receiving (follower only). Emitted every 30 seconds. gauge
etcd.SendingBandwidthRate Number of bytes per second this node is sending (leader only). This value is undefined on single member clusters. Emitted every 30 seconds. gauge
etcd.SendingRequestRate Number of requests per second this node is sending (leader only). This value is undefined on single member clusters. Emitted every 30 seconds. gauge
etcd.SentAppendRequests Number of requests that this node has sent. Emitted every 30 seconds. gauge
etcd.SetsFail SetsFail operation count. Emitted every 30 seconds. gauge
etcd.SetsSuccess SetsSuccess operation count. Emitted every 30 seconds. gauge
etcd.UpdateFail UpdateFail operation count. Emitted every 30 seconds. gauge
etcd.UpdateSuccess UpdateSuccess operation count. Emitted every 30 seconds. gauge
etcd.Watchers Watchers operation count. Emitted every 30 seconds. gauge
file_server.memoryStats.lastGCPauseTimeNS Duration in nanoseconds of the last garbage collector pause. gauge
file_server.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
file_server.memoryStats.numBytesAllocatedHeap Instantaneous count of bytes allocated on the main heap and still in use. gauge
file_server.memoryStats.numBytesAllocatedStack Instantaneous count of bytes used by the stack allocator. gauge
file_server.memoryStats.numFrees Lifetime number of memory deallocations. gauge
file_server.memoryStats.numMallocs Lifetime number of memory allocations. gauge
file_server.numCPUS Number of CPUs on the machine. gauge
file_server.numGoRoutines Instantaneous number of active goroutines in the process. gauge
garden_linux.BackingStores Number of container backing store files. Emitted every 30 seconds. gauge
garden_linux.DepotDirs Number of directories in the Garden depot. Emitted every 30 seconds. gauge
garden_linux.LoopDevices Number of attached loop devices. Emitted every 30 seconds. gauge
garden_linux.MetricsReporting How long it took to emit the BackingStores, DepotDirs, and LoopDevices metrics. Emitted every 30 seconds. gauge
garden_linux.memoryStats.lastGCPauseTimeNS Duration in nanoseconds of the last garbage collector pause. gauge
garden_linux.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
garden_linux.memoryStats.numBytesAllocatedHeap Instantaneous count of bytes allocated on the main heap and still in use. gauge
garden_linux.memoryStats.numBytesAllocatedStack Instantaneous count of bytes used by the stack allocator. gauge
garden_linux.memoryStats.numFrees Lifetime number of memory deallocations. gauge
garden_linux.memoryStats.numMallocs Lifetime number of memory allocations. gauge
garden_linux.numCPUS Number of CPUs on the machine. gauge
garden_linux.numGoRoutines Instantaneous number of active goroutines in the process. gauge
gorouter.bad_gateways Number of bad gateway events cumulative_counter
gorouter.responses Number of router responses cumulative_counter
gorouter.total_requests Number of router requests received cumulative_counter
gorouter.total_routes Number of registered routes gauge
nsync_bulker.DesiredLRPSyncDuration Time in nanoseconds that the nsync-bulker took to synchronize CF apps and Diego DesiredLRPs. Emitted every 30 seconds. gauge
nsync_bulker.LRPsDesired Cumulative number of LRPs desired through the nsync API. Emitted on each request desiring a new LRP, every 30 seconds. gauge
nsync_bulker.LockHeld.v1-locks-nsync_bulker_lock Whether an nsync-bulker holds the nsync-bulker lock: 1 means the lock is held, and 0 means the lock was lost. Emitted every 30 seconds by the active nsync-bulker. gauge
nsync_bulker.LockHeldDuration.v1-locks-nsync_bulker_lock Time in nanoseconds that the active nsync-bulker has held the convergence lock. Emitted every 30 seconds by the active nsync-bulker. gauge
nsync_bulker.NsyncInvalidDesiredLRPsFound Number of invalid DesiredLRPs found during nsync-bulker periodic synchronization. Emitted every 30 seconds. gauge
nsync_bulker.memoryStats.lastGCPauseTimeNS Duration in nanoseconds of the last garbage collector pause. gauge
nsync_bulker.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
nsync_bulker.memoryStats.numBytesAllocatedHeap Instantaneous count of bytes allocated on the main heap and still in use. gauge
nsync_bulker.memoryStats.numBytesAllocatedStack Instantaneous count of bytes used by the stack allocator. gauge
nsync_bulker.memoryStats.numFrees Lifetime number of memory deallocations. gauge
nsync_bulker.memoryStats.numMallocs Lifetime number of memory allocations. gauge
nsync_bulker.numCPUS Number of CPUs on the machine. gauge
nsync_bulker.numGoRoutines Instantaneous number of active goroutines in the process. gauge
nsync_listener.memoryStats.lastGCPauseTimeNS Duration in nanoseconds of the last garbage collector pause. gauge
nsync_listener.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
nsync_listener.memoryStats.numBytesAllocatedHeap Instantaneous count of bytes allocated on the main heap and still in use. gauge
nsync_listener.memoryStats.numBytesAllocatedStack Instantaneous count of bytes used by the stack allocator. gauge
nsync_listener.memoryStats.numFrees Lifetime number of memory deallocations. gauge
nsync_listener.memoryStats.numMallocs Lifetime number of memory allocations. gauge
nsync_listener.numCPUS Number of CPUs on the machine. gauge
nsync_listener.numGoRoutines Instantaneous number of active goroutines in the process. gauge
rep.CM Emitted every 30 seconds. gauge
rep.CapacityRemainingContainers Remaining number of containers this cell can host. Emitted every 60 seconds. gauge
rep.CapacityRemainingDisk Amount of disk available to allocate in the cell, in megabytes gauge
rep.CapacityRemainingMemory Amount of memory available to allocate in the cell, in megabytes gauge
rep.CapacityTotalContainers Total number of containers this cell can host. Emitted every 60 seconds. gauge
rep.CapacityTotalDisk Total amount of disk in a cell, in megabytes gauge
rep.CapacityTotalMemory Total amount of memory in a cell, in megabytes gauge
rep.ContainerCount Number of Diego containers currently running gauge
rep.GardenContainerCreationDuration Time in nanoseconds that the rep Garden backend took to create a container. Emitted after every successful container creation. gauge
rep.LogMessage Emitted every 30 seconds. gauge
rep.RepBulkSyncDuration Time in nanoseconds that the cell rep took to synchronize the ActualLRPs it has claimed with its actual garden containers. Emitted every 30 seconds by each rep. gauge
rep.UnhealthyCell Number of unhealthy Diego cells gauge
rep.logSenderTotalMessagesRead Count of application log messages sent by Diego Executor. Emitted every 30 seconds. cumulative_counter
rep.memoryStats.lastGCPauseTimeNS Duration in nanoseconds of the last garbage collector pause. gauge
rep.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
rep.memoryStats.numBytesAllocatedHeap Instantaneous count of bytes allocated on the main heap and still in use. gauge
rep.memoryStats.numBytesAllocatedStack Instantaneous count of bytes used by the stack allocator. gauge
rep.memoryStats.numFrees Lifetime number of memory deallocations. gauge
rep.memoryStats.numMallocs Lifetime number of memory allocations. gauge
rep.numCPUS Number of CPUs on the machine. gauge
rep.numGoRoutines Instantaneous number of active goroutines in the process. gauge
route_emitter.LockHeld.v1-locks-route_emitter_lock Whether a route-emitter holds the route-emitter lock: 1 means the lock is held, and 0 means the lock was lost. Emitted every 30 seconds by the active route-emitter. gauge
route_emitter.LockHeldDuration.v1-locks-route_emitter_lock Time in nanoseconds that the active route-emitter has held the route-emitter lock. Emitted every 30 seconds by the active route-emitter. gauge
route_emitter.MessagesEmitted The cumulative number of registration messages that this process has sent. Emitted every 30 seconds. cumulative_counter
route_emitter.RouteEmitterSyncDuration Time in nanoseconds that the active route-emitter took to perform its synchronization pass. Emitted every 60 seconds. gauge
route_emitter.RoutesRegistered Cumulative number of route registrations emitted from the route-emitter as it reacts to changes to LRPs. Emitted every 30 seconds. cumulative_counter
route_emitter.RoutesSynced Cumulative number of route registrations emitted from the route-emitter during its periodic route-table synchronization. Emitted every 30 seconds. cumulative_counter
route_emitter.RoutesTotal Number of routes in the route-emitters routing table. Emitted every 30 seconds. gauge
route_emitter.RoutesUnregistered Cumulative number of route unregistrations emitted from the route-emitter as it reacts to changes to LRPs. Emitted every 30 seconds. cumulative_counter
route_emitter.memoryStats.lastGCPauseTimeNS Duration in nanoseconds of the last garbage collector pause. gauge
route_emitter.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
route_emitter.memoryStats.numBytesAllocatedHeap Instantaneous count of bytes allocated on the main heap and still in use. gauge
route_emitter.memoryStats.numBytesAllocatedStack Instantaneous count of bytes used by the stack allocator. gauge
route_emitter.memoryStats.numFrees Lifetime number of memory deallocations. gauge
route_emitter.memoryStats.numMallocs Lifetime number of memory allocations. gauge
route_emitter.numCPUS Number of CPUs on the machine. gauge
route_emitter.numGoRoutines Instantaneous number of active goroutines in the process. gauge
ssh_proxy.memoryStats.lastGCPauseTimeNS Duration in nanoseconds of the last garbage collector pause. gauge
ssh_proxy.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
ssh_proxy.memoryStats.numBytesAllocatedHeap Instantaneous count of bytes allocated on the main heap and still in use. gauge
ssh_proxy.memoryStats.numBytesAllocatedStack Instantaneous count of bytes used by the stack allocator. gauge
ssh_proxy.memoryStats.numFrees Lifetime number of memory deallocations. gauge
ssh_proxy.memoryStats.numMallocs Lifetime number of memory allocations. gauge
ssh_proxy.numCPUS Number of CPUs on the machine. gauge
ssh_proxy.numGoRoutines Instantaneous number of active goroutines in the process . gauge
stager.StagingRequestFailedDuration Time in nanoseconds that the failed staging task took to run. Emitted each time a staging task fails. gauge
stager.StagingRequestSucceededDuration Time in nanoseconds that the successful staging task took to run. Emitted each time a staging task completes successfully. gauge
stager.StagingRequestsFailed Cumulative number of failed staging tasks handled by each stager. Emitted every time a staging task fails. gauge
stager.StagingRequestsSucceeded Cumulative number of successful staging tasks handled by each stager. Emitted every time a staging task completes successfully. gauge
stager.StagingStartRequestsReceived Cumulative number of requests to start a staging task. Emitted by a stager each time it handles a request. gauge
stager.memoryStats.lastGCPauseTimeNS Duration in nanoseconds of the last garbage collector pause. gauge
stager.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
stager.memoryStats.numBytesAllocatedHeap Instantaneous count of bytes allocated on the main heap and still in use. gauge
stager.memoryStats.numBytesAllocatedStack Instantaneous count of bytes used by the stack allocator. gauge
stager.memoryStats.numFrees Lifetime number of memory deallocations. gauge
stager.memoryStats.numMallocs Lifetime number of memory allocations. gauge
stager.numCPUS Number of CPUs on the machine. gauge
stager.numGoRoutines Instantaneous number of active goroutines in the process. gauge
syslog_drain_binder.memoryStats.lastGCPauseTimeNS Duration of the last Garbage Collector pause in nanoseconds. gauge
syslog_drain_binder.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
syslog_drain_binder.memoryStats.numBytesAllocatedHeap Instantaneous count of bytes allocated on the main heap and still in use. gauge
syslog_drain_binder.memoryStats.numBytesAllocatedStack Instantaneous count of bytes used by the stack allocator. gauge
syslog_drain_binder.memoryStats.numFrees Lifetime number of memory deallocations. gauge
syslog_drain_binder.memoryStats.numMallocs Lifetime number of memory allocations. gauge
syslog_drain_binder.numCPUS Number of CPUs on the machine. gauge
syslog_drain_binder.numGoRoutines Instantaneous number of active goroutines in the Doppler process. gauge
syslog_drain_binder.pollCount Number of times the syslog drain binder has polled the cloud controller for syslog drain bindings. Emitted every 30 seconds. cumulative_counter
syslog_drain_binder.totalDrains Number of syslog drains returned by cloud controller. Emitted every 30 seconds. gauge
system.cpu.sys CPU load consumed by the kernel gauge
system.cpu.user CPU load consumed by userspace gauge
system.cpu.wait Time CPU spent waiting for IO gauge
system.disk.ephemeral.percent Percentage of the ephemeral disk used gauge
system.disk.system.percent Percentage of the system disk used gauge
system.healthy Overall status of system health gauge
system.mem.percent Percentage of RAM used gauge
system.swap.percent Percentage of swap space used gauge
tps_listener.memoryStats.lastGCPauseTimeNS Duration in nanoseconds of the last garbage collector pause. gauge
tps_listener.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
tps_listener.memoryStats.numBytesAllocatedHeap Instantaneous count of bytes allocated on the main heap and still in use. gauge
tps_listener.memoryStats.numBytesAllocatedStack Instantaneous count of bytes used by the stack allocator. gauge
tps_listener.memoryStats.numFrees Lifetime number of memory deallocations. gauge
tps_listener.memoryStats.numMallocs Lifetime number of memory allocations. gauge
tps_listener.numCPUS Number of CPUs on the machine. gauge
tps_listener.numGoRoutines Instantaneous number of active goroutines in the process. gauge
tps_watcher.LockHeld.v1-locks-tps_watcher_lock Whether a tps-watcher holds the tps-watcher lock: 1 means the lock is held, and 0 means the lock was lost. Emitted every 30 seconds by the active tps-watcher. gauge
tps_watcher.LockHeldDuration.v1-locks-tps_watcher_lock Time in nanoseconds that the active tps-watcher has held the convergence lock. Emitted every 30 seconds by the active tps-watcher. gauge
tps_watcher.memoryStats.lastGCPauseTimeNS Duration in nanoseconds of the last garbage collector pause. gauge
tps_watcher.memoryStats.numBytesAllocated Instantaneous count of bytes allocated and still in use. gauge
tps_watcher.memoryStats.numBytesAllocatedHeap Instantaneous count of bytes allocated on the main heap and still in use. gauge
tps_watcher.memoryStats.numBytesAllocatedStack Instantaneous count of bytes used by the stack allocator. gauge
tps_watcher.memoryStats.numFrees Lifetime number of memory deallocations. gauge
tps_watcher.memoryStats.numMallocs Lifetime number of memory allocations. gauge
tps_watcher.numCPUS Number of CPUs on the machine. Emitted every 30 seconds. gauge
tps_watcher.numGoRoutines Instantaneous number of active goroutines in the process. gauge
uaa.audit_service.client_authentication_count Number of client authentication attempts cumulative_counter
uaa.audit_service.client_authentication_failure_count Number of failed client authentication attempts cumulative_counter
uaa.audit_service.principal_authentication_failure_count Number of failed principal authentication attempts cumulative_counter
uaa.audit_service.principal_not_found_count Number of times a non-user was not found cumulative_counter
uaa.audit_service.user_authentication_count Number of times a user has successfully authenticated cumulative_counter
uaa.audit_service.user_authentication_failure_count Number of failed user authentication attempts cumulative_counter
uaa.audit_service.user_not_found_count Number of times a user was not found cumulative_counter
uaa.audit_service.user_password_changes Number of times a user password has changed cumulative_counter
uaa.audit_service.user_password_failures Number of times a user password change has failed cumulative_counter

DopplerServer.LinuxFileDescriptor

gauge

Number of file handles for the Dopplers process.

DopplerServer.TruncatingBuffer.DroppedMessages

gauge

Number of messages intentionally dropped by Doppler from the sink for the specific sink. This counter event will correspond with log messages Log message output is too high. Emitted every 5 seconds.

DopplerServer.TruncatingBuffer.totalDroppedMessages

gauge

Lifetime total number of messages intentionally dropped by Doppler from all of its sinks due to back pressure. Emitted every 5 seconds.

DopplerServer.Uptime

gauge

Uptime for the Dopplers process.

DopplerServer.dropsondeListener.currentBufferCount

cumulative_counter

DEPRECATED

DopplerServer.dropsondeListener.receivedByteCount

cumulative_counter

DEPRECATED in favor of DopplerServer.udpListener.receivedByteCount.

DopplerServer.dropsondeListener.receivedMessageCount

cumulative_counter

DEPRECATED in favor of DopplerServer.udpListener.receivedMessageCount.

DopplerServer.dropsondeUnmarshaller.containerMetricReceived

cumulative_counter

Lifetime number of ContainerMetric messages unmarshalled.

DopplerServer.dropsondeUnmarshaller.counterEventReceived

cumulative_counter

Lifetime number of CounterEvent messages unmarshalled.

DopplerServer.dropsondeUnmarshaller.errorReceived

gauge

Lifetime number of Error messages unmarshalled.

DopplerServer.dropsondeUnmarshaller.heartbeatReceived

gauge

DEPRECATED

DopplerServer.dropsondeUnmarshaller.httpStartStopReceived

cumulative_counter

Lifetime number of HttpStartStop messages unmarshalled.

DopplerServer.dropsondeUnmarshaller.logMessageTotal

cumulative_counter

Lifetime number of LogMessage messages unmarshalled.

DopplerServer.dropsondeUnmarshaller.unmarshalErrors

gauge

Lifetime number of errors when unmarshalling messages.

DopplerServer.dropsondeUnmarshaller.valueMetricReceived

cumulative_counter

Lifetime number of ValueMetric messages unmarshalled.

DopplerServer.httpServer.receivedMessages

cumulative_counter

Number of messages received by Dopplers internal MessageRouter. Emitted every 5 seconds.

DopplerServer.listeners.totalReceivedMessageCount

cumulative_counter

Total number of messages received across all of Dopplers listeners (UDP, TCP, TLS).

DopplerServer.memoryStats.lastGCPauseTimeNS

gauge

Duration of the last Garbage Collector pause in nanoseconds.

DopplerServer.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

DopplerServer.memoryStats.numBytesAllocatedHeap

gauge

Number of bytes allocated on the heap.

DopplerServer.memoryStats.numBytesAllocatedStack

gauge

Number of bytes allocated on the stack.

DopplerServer.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

DopplerServer.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

DopplerServer.messageRouter.numberOfContainerMetricSinks

gauge

Instantaneous number of container metric sinks known to the SinkManager. Emitted every 5 seconds.

DopplerServer.messageRouter.numberOfDumpSinks

gauge

Instantaneous number of dump sinks known to the SinkManager. Emitted every 5 seconds.

DopplerServer.messageRouter.numberOfFirehoseSinks

gauge

Instantaneous number of firehose sinks known to the SinkManager. Emitted every 5 seconds.

DopplerServer.messageRouter.numberOfSyslogSinks

gauge

Instantaneous number of syslog sinks known to the SinkManager.

DopplerServer.messageRouter.numberOfWebsocketSinks

gauge

Instantaneous number of WebSocket sinks known to the SinkManager. Emitted every 5 seconds.

DopplerServer.messageRouter.totalDroppedMessages

gauge

Lifetime number of messages dropped inside Doppler for various reasons (downstream consumer cant keep up internal object wasnt ready for message, etc.).

DopplerServer.numCpus

gauge

Number of CPUs on the machine.

DopplerServer.numGoRoutines

gauge

Instantaneous number of active goroutines in the Doppler process.

DopplerServer.sentMessagesFirehose.SUBSCRIPTION_ID

gauge

Number of sent messages through the firehose per subscription id. Emitted every 5 seconds.

DopplerServer.sentMessagesFirehose

cumulative_counter

Number of messages sent on the Firehose.

DopplerServer.signatureVerifier.invalidSignatureErrors

gauge

Lifetime number of messages received with an invalid signature.

DopplerServer.signatureVerifier.missingSignatureErrors

gauge

Lifetime number of messages received that are too small to contain a signature.

DopplerServer.signatureVerifier.validSignatures

cumulative_counter

Lifetime number of messages received with valid signatures.

DopplerServer.tcpListener.receivedByteCount

cumulative_counter

Lifetime number of bytes received by Dopplers TCP Listener. Emitted every 5 seconds.

DopplerServer.tcpListener.receivedErrorCount

cumulative_counter

Lifetime number of errors encountered by Dopplers TCP Listener while handshaking, decoding or reading from the connection.

DopplerServer.tcpListener.receivedMessageCount

cumulative_counter

Lifetime number of messages received by Dopplers TCP Listener. Emitted every 5 seconds.

DopplerServer.tlsListener.receivedByteCount

cumulative_counter

Lifetime number of bytes received by Dopplers TLS Listener. Emitted every 5 seconds.

DopplerServer.tlsListener.receivedErrorCount

cumulative_counter

Lifetime number of errors encountered by Dopplers TLS Listener while handshaking, decoding or reading from the connection.

DopplerServer.tlsListener.receivedMessageCount

cumulative_counter

Lifetime number of messages received by Dopplers TLS Listener. Emitted every 5 seconds.

DopplerServer.udpListener.receivedByteCount

cumulative_counter

Lifetime number of bytes received by Dopplers UDP Listener.

DopplerServer.udpListener.receivedErrorCount

cumulative_counter

Lifetime number of errors encountered by Dopplers UDP Listener while reading from the connection.

DopplerServer.udpListener.receivedMessageCount

cumulative_counter

Lifetime number of messages received by Dopplers UDP Listener.

LoggregatorTrafficController.LinuxFileDescriptor

gauge

Number of file handles for the TrafficControllers process.

LoggregatorTrafficController.Uptime

gauge

Uptime for the Traffic Controllers process. Emitted every 30 seconds.

LoggregatorTrafficController.dopplerProxy.containermetricsLatency

gauge

Duration for serving container metrics via the containermetrics endpoint (milliseconds). Emitted every 30 seconds.

LoggregatorTrafficController.dopplerProxy.recentlogsLatency

gauge

Duration for serving recent logs via the recentLogs endpoint (milliseconds). Emitted every 30 seconds.

LoggregatorTrafficController.memoryStats.lastGCPauseTimeNS

gauge

Duration of the last Garbage Collector pause in nanoseconds.

LoggregatorTrafficController.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

LoggregatorTrafficController.memoryStats.numBytesAllocatedHeap

gauge

Instantaneous count of bytes allocated on the main heap and still in use.

LoggregatorTrafficController.memoryStats.numBytesAllocatedStack

gauge

Instantaneous count of bytes used by the stack allocator.

LoggregatorTrafficController.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

LoggregatorTrafficController.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

LoggregatorTrafficController.numCPUS

gauge

Number of CPUs on the machine.

LoggregatorTrafficController.numGoRoutines

gauge

Instantaneous number of active goroutines in the Doppler process.

MetronAgent.DopplerForwarder.sentMessages

cumulative_counter

Lifetime number of messages sent to Doppler regardless of protocol. Emitted every 30 seconds.

MetronAgent.MessageAggregator.counterEventReceived

cumulative_counter

Lifetime number of CounterEvents aggregated in Metron.

MetronAgent.MessageBuffer.droppedMessageCount

cumulative_counter

Lifetime number of intentionally dropped messages from Metrons batch writer buffer. Batch writing is performed over TCP/TLS only.

MetronAgent.dropsondeAgentListener.currentBufferCount

cumulative_counter

Instantaneous number of Dropsonde messages read by UDP socket but not yet unmarshalled.

MetronAgent.dropsondeAgentListener.receivedByteCount

cumulative_counter

Lifetime number of bytes of Dropsonde messages read by UDP socket.

MetronAgent.dropsondeAgentListener.receivedMessageCount

cumulative_counter

Lifetime number of Dropsonde messages read by UDP socket.

MetronAgent.dropsondeMarshaller.containerMetricMarshalled

gauge

Lifetime number of ContainerMetric messages marshalled.

MetronAgent.dropsondeMarshaller.counterEventMarshalled

cumulative_counter

Lifetime number of CounterEvent messages marshalled.

MetronAgent.dropsondeMarshaller.errorMarshalled

gauge

Lifetime number of Error messages marshalled.

MetronAgent.dropsondeMarshaller.heartbeatMarshalled

gauge

Lifetime number of Heartbeat messages marshalled.

MetronAgent.dropsondeMarshaller.httpStartStopMarshalled

gauge

Lifetime number of HttpStartStop messages marshalled.

MetronAgent.dropsondeMarshaller.logMessageMarshalled

gauge

Lifetime number of LogMessage messages marshalled.

MetronAgent.dropsondeMarshaller.marshalErrors

gauge

Lifetime number of errors when marshalling messages.

MetronAgent.dropsondeMarshaller.valueMetricMarshalled

gauge

Lifetime number of ValueMetric messages marshalled.

MetronAgent.dropsondeUnmarshaller.containerMetricReceived

cumulative_counter

Lifetime number of ContainerMetric messages unmarshalled.

MetronAgent.dropsondeUnmarshaller.counterEventReceived

cumulative_counter

Lifetime number of CounterEvent messages unmarshalled.

MetronAgent.dropsondeUnmarshaller.errorReceived

gauge

Lifetime number of Error messages unmarshalled.

MetronAgent.dropsondeUnmarshaller.heartbeatReceived

gauge

DEPRECATED

MetronAgent.dropsondeUnmarshaller.httpStartStopReceived

cumulative_counter

Lifetime number of HttpStartStop messages unmarshalled.

MetronAgent.dropsondeUnmarshaller.logMessageTotal

cumulative_counter

Lifetime number of LogMessage messages unmarshalled.

MetronAgent.dropsondeUnmarshaller.unmarshalErrors

gauge

Lifetime number of errors when unmarshalling messages.

MetronAgent.dropsondeUnmarshaller.valueMetricReceived

cumulative_counter

Lifetime number of ValueMetric messages unmarshalled.

MetronAgent.legacyAgentListener.currentBufferCount

cumulative_counter

Instantaneous number of Legacy messages read by UDP socket but not yet unmarshalled.

MetronAgent.legacyAgentListener.receivedByteCount

cumulative_counter

Lifetime number of bytes of Legacy messages read by UDP socket.

MetronAgent.legacyAgentListener.receivedMessageCount

cumulative_counter

Lifetime number of Legacy messages read by UDP socket.

MetronAgent.memoryStats.lastGCPauseTimeNS

gauge

Duration of the last Garbage Collector pause in nanoseconds.

MetronAgent.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

MetronAgent.memoryStats.numBytesAllocatedHeap

gauge

Instantaneous count of bytes allocated on the main heap and still in use.

MetronAgent.memoryStats.numBytesAllocatedStack

gauge

Instantaneous count of bytes used by the stack allocator.

MetronAgent.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

MetronAgent.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

MetronAgent.numCpus

gauge

Number of CPUs on the machine.

MetronAgent.numGoRoutines

gauge

Instantaneous number of active goroutines in the Doppler process.

MetronAgent.tcp.sendErrorCount

cumulative_counter

Lifetime number of errors if writing to Doppler over TCP fails.

MetronAgent.tcp.sentByteCount

cumulative_counter

Lifetime number of sent bytes to Doppler over TCP.

MetronAgent.tcp.sentMessageCount

cumulative_counter

Lifetime number of sent messages to Doppler over TCP.

MetronAgent.tls.sendErrorCount

cumulative_counter

Lifetime number of errors if writing to Doppler over TLS fails.

MetronAgent.tls.sentByteCount

cumulative_counter

Lifetime number of sent bytes to Doppler over TLS. Emitted every 30 seconds.

MetronAgent.tls.sentMessageCount

cumulative_counter

Lifetime number of sent messages to Doppler over TLS. Emitted every 30 seconds.

MetronAgent.udp.sendErrorCount

cumulative_counter

Lifetime number of errors if writing to Doppler over UDP fails.

MetronAgent.udp.sentByteCount

cumulative_counter

Lifetime number of sent bytes to Doppler over UDP.

MetronAgent.udp.sentMessageCount

cumulative_counter

Lifetime number of sent messages to Doppler over UDP.

auctioneer.AuctioneerFetchStatesDuration

gauge

Time in nanoseconds that the auctioneer took to fetch state from all the cells when running its auction. Emitted every 30 seconds during each auction.

auctioneer.AuctioneerLRPAuctionsFailed

cumulative_counter

Cumulative number of LRP instances that the auctioneer failed to place on Diego cells. Emitted every 30 seconds during each auction.

auctioneer.AuctioneerLRPAuctionsStarted

cumulative_counter

Cumulative number of LRP instances that the auctioneer successfully placed on Diego cells. Emitted every 30 seconds during each auction.

auctioneer.AuctioneerTaskAuctionsFailed

cumulative_counter

Cumulative number of Tasks that the auctioneer failed to place on Diego cells. Emitted every 30 seconds during each auction.

auctioneer.AuctioneerTaskAuctionsStarted

cumulative_counter

Cumulative number of Tasks that the auctioneer successfully placed on Diego cells. Emitted every 30 seconds during each auction.

auctioneer.LockHeld.v1-locks-auctioneer_lock

gauge

Whether an auctioneer holds the auctioneer lock: 1 means the lock is held, and 0 means the lock was lost. Emitted every 30 seconds by the active auctioneer.

auctioneer.LockHeldDuration.v1-locks-auctioneer_lock

gauge

Time in nanoseconds that the active auctioneer has held the auctioneer lock. Emitted every 30 seconds by the active auctioneer.

auctioneer.memoryStats.lastGCPauseTimeNS

gauge

Duration in nanoseconds of the last garbage collector pause.

auctioneer.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

auctioneer.memoryStats.numBytesAllocatedHeap

gauge

Instantaneous count of bytes allocated on the main heap and still in use.

auctioneer.memoryStats.numBytesAllocatedStack

gauge

Instantaneous count of bytes used by the stack allocator.

auctioneer.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

auctioneer.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

auctioneer.numCPUS

gauge

Number of CPUs on the machine.

auctioneer.numGoRoutines

gauge

Instantaneous number of active goroutines in the process.

bbs.BBSMasterElected

gauge

Emitted once when the BBS is elected as master.

bbs.ConvergenceLRPDuration

gauge

Time in nanoseconds that the BBS took to run its LRP convergence pass. Emitted every 30 seconds when LRP convergence runs.

bbs.ConvergenceLRPPreProcessingActualLRPsDeleted

gauge

Cumulative number of times the BBS has detected and deleted a malformed ActualLRP in its LRP convergence pass. Emitted every 30 seconds.

bbs.ConvergenceLRPPreProcessingMalformedRunInfos

gauge

Cumulative number of times the BBS has detected a malformed DesiredLRP RunInfo in its LRP convergence pass. Emitted every 30 seconds.

bbs.ConvergenceLRPPreProcessingMalformedSchedulingInfos

gauge

Cumulative number of times the BBS has detected a malformed DesiredLRP SchedulingInfo in its LRP convergence pass. Emitted every 30 seconds.

bbs.ConvergenceLRPRuns

cumulative_counter

Cumulative number of times BBS has run its LRP convergence pass. Emitted every 30 seconds.

bbs.ConvergenceTaskDuration

gauge

Time in nanoseconds that the BBS took to run its Task convergence pass. Emitted every 30 seconds when Task convergence runs.

bbs.ConvergenceTaskRuns

cumulative_counter

Cumulative number of times the BBS has run its Task convergence pass. Emitted every 30 seconds.

bbs.ConvergenceTasksKicked

cumulative_counter

Cumulative number of times the BBS has updated a Task during its Task convergence pass. Emitted every 30 seconds.

bbs.ConvergenceTasksPruned

cumulative_counter

Cumulative number of times the BBS has deleted a malformed Task during its Task convergence pass. Emitted every 30 seconds.

bbs.CrashedActualLRPs

gauge

Total number of LRP instances that have crashed. Emitted every 30 seconds.

bbs.CrashingDesiredLRPs

gauge

Total number of DesiredLRPs that have at least one crashed instance. Emitted every 30 seconds.

bbs.Domain.cf-apps

gauge

Whether the cf-apps domain is up-to-date, so that CF apps from CC have been synchronized with DesiredLRPs for Diego to run. 1 means the domain is up-to-date, no data means it is not. Emitted every 30 seconds.

bbs.Domain.cf-tasks

gauge

Whether the cf-tasks domain is up-to-date, so that CF tasks from CC have been synchronized with tasks for Diego to run. 1 means the domain is up-to-date, no data means it is not. Emitted every 30 seconds.

bbs.ETCDLeader

gauge

Index of the leader node in the etcd cluster. Emitted every 30 seconds.

bbs.ETCDRaftTerm

gauge

Raft term of the etcd cluster. Emitted every 30 seconds.

bbs.ETCDReceivedBandwidthRate

gauge

Number of bytes per second received by the follower etcd node. Emitted every 30 seconds.

bbs.ETCDReceivedRequestRate

gauge

Number of requests per second received by the follower etcd node. Emitted every 30 seconds.

bbs.ETCDSentBandwidthRate

gauge

Number of bytes per second sent by the leader etcd node. Emitted every 30 seconds.

bbs.ETCDSentRequestRate

gauge

Number of requests per second sent by the leader etcd node. Emitted every 30 seconds.

bbs.ETCDWatchers

gauge

Number of watches set against the etcd cluster. Emitted every 30 seconds.

bbs.LRPsClaimed

gauge

Total number of LRP instances that have been claimed by some cell. Emitted every 30 seconds.

bbs.LRPsDesired

gauge

Total number of LRP instances desired across all LRPs. Emitted periodically.

bbs.LRPsExtra

gauge

Total number of LRP instances that are no longer desired but still have a BBS record. Emitted every 30 seconds.

bbs.LRPsMissing

gauge

Total number of LRP instances that are desired but have no record in the BBS. Emitted every 30 seconds.

bbs.LRPsRunning

gauge

Total number of LRP instances that are running on cells. Emitted every 30 seconds.

bbs.LRPsUnclaimed

gauge

Total number of LRP instances that have not yet been claimed by a cell. Emitted every 30 seconds.

bbs.LockHeld.v1-locks-bbs_lock

gauge

Whether a BBS holds the BBS lock: 1 means the lock is held, and 0 means the lock was lost. Emitted every 30 seconds by the active BBS server.

bbs.LockHeldDuration.v1-locks-bbs_lock

gauge

Time in nanoseconds that the active BBS has held the BBS lock. Emitted every 30 seconds by the active BBS server.

bbs.MetricsReportingDuration

gauge

Time in nanoseconds that the BBS took to emit metrics about etcd. Emitted every 30 seconds.

bbs.MigrationDuration

gauge

Time in nanoseconds that the BBS took to run migrations against its persistence store. Emitted each time a BBS becomes the active master.

bbs.RequestCount

cumulative_counter

Cumulative number of requests the BBS has handled through its API. Emitted for each BBS request.

bbs.RequestLatency

gauge

Time in nanoseconds that the BBS took to handle requests to its API endpoints. Emitted when the BBS API handles requests.

bbs.TasksCompleted

gauge

Total number of Tasks that have completed. Emitted every 30 seconds.

bbs.TasksPending

gauge

Total number of Tasks that have not yet been placed on a cell. Emitted every 30 seconds.

bbs.TasksResolving

gauge

Total number of Tasks locked for deletion. Emitted every 30 seconds.

bbs.TasksRunning

gauge

Total number of Tasks running on cells. Emitted every 30 seconds.

bbs.memoryStats.lastGCPauseTimeNS

gauge

Duration in nanoseconds of the last garbage collector pause.

bbs.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

bbs.memoryStats.numBytesAllocatedHeap

gauge

Instantaneous count of bytes allocated on the main heap and still in use.

bbs.memoryStats.numBytesAllocatedStack

gauge

Instantaneous count of bytes used by the stack allocator.

bbs.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

bbs.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

bbs.numCPUS

gauge

Number of CPUs on the machine.

bbs.numGoRoutines

gauge

Instantaneous number of active goroutines in the process.

cc.failed_job_count.VM_NAME-VM_INDEX

cumulative_counter

Number of failed jobs in the <VM_NAME-<VM_INDEX queue. This is the number of delayed jobs where the failed at column is populated with the time of the most recently failed attempt at the job. The failed job count is not specific to the jobs run by the Cloud Controller worker. By default, Cloud Controller deletes failed jobs after 31 days. Emitted every 30 seconds per VM.

cc.failed_job_count.cc-generic

cumulative_counter

Number of failed jobs in the cc-generic queue. By default, Cloud Controller deletes failed jobs after 31 days. Emitted every 30 seconds per VM.

cc.failed_job_count.total

gauge

Number of failed jobs in all queues. By default, Cloud Controller deletes failed jobs after 31 days. Emitted every 30 seconds per VM.

cc.http_status.1XX

cumulative_counter

Number of HTTP response status codes of type 1xx (informational). This resets when the Cloud Controller process is restarted and is incremented at the end of each request cycle.

cc.http_status.2XX

cumulative_counter

Number of HTTP response status codes of type 2xx (success). This resets when the Cloud Controller process is restarted and is incremented at the end of each request cycle. Emitted for each Cloud Controller request.

cc.http_status.3XX

cumulative_counter

Number of HTTP response status codes of type 3xx (redirection). This resets when the Cloud Controller process is restarted and is incremented at the end of each request cycle. Emitted for each Cloud Controller request.

cc.http_status.4XX

cumulative_counter

Number of HTTP response status codes of type 4xx (client error). This resets when the Cloud Controller process is restarted and is incremented at the end of each request cycle. Emitted for each Cloud Controller request.

cc.http_status.5XX

cumulative_counter

Number of HTTP response status codes of type 5xx (server error). This resets when the Cloud Controller process is restarted and is incremented at the end of each request cycle.

cc.job_queue_length.cc-VM_NAME-VM_INDEX

gauge

Number of background jobs in the <VM_NAME-<VM_INDEX queue that have yet to run for the first time. Emitted every 30 seconds per VM.

cc.job_queue_length.cc-generic

gauge

Number of background jobs in the cc-generic queue that have yet to run for the first time. Emitted every 30 seconds per VM.

cc.job_queue_length.total

gauge

Total number of background jobs in the queues that have yet to run for the first time. Emitted every 30 seconds per VM.

cc.log_count.all

gauge

Total number of log messages, sum of messages of all severity levels. The count resets when the Cloud Controller process is restarted. Emitted every 30 seconds per VM.

cc.log_count.debug

gauge

Number of log messages of severity debug. The count resets when the Cloud Controller process is restarted. Emitted every 30 seconds per VM.

cc.log_count.debug1

gauge

Not used.

cc.log_count.debug2

gauge

Number of log messages of severity debug2. The count resets when the Cloud Controller process is restarted. Emitted every 30 seconds per VM.

cc.log_count.error

cumulative_counter

Number of error log messages.

cc.log_count.fatal

cumulative_counter

Number of fatal log messages.

cc.log_count.info

gauge

Number of log messages of severity info. Examples of info messages are droplet created, copying package, uploading package, access denied due to insufficient scope, job logging, blobstore actions, staging requests, and app running requests. The count resets when the Cloud Controller process is restarted. Emitted every 30 seconds per VM.

cc.log_count.off

gauge

Number of log messages of severity off. The count resets when the Cloud Controller process is restarted. Emitted every 30 seconds per VM.

cc.log_count.warn

cumulative_counter

Number of warn log messages.

cc.requests.completed

cumulative_counter

Number of Cloud Controller API requests completed.

cc.requests.outstanding

cumulative_counter

Number of Cloud Controller requests made but not completed.

cc.tasks_running.count

gauge

Number of tasks currently running.

cc.tasks_running.memory_in_mb

gauge

Memory being consumed by all currently running tasks. Emitted every 30 seconds per VM. This metric is only seen in version 3 of the Cloud Foundry API.

cc.thread_info.event_machine.connection_count

gauge

Number of open connections to event machine. Emitted every 30 seconds per VM.

cc.thread_info.event_machine.resultqueue.num_waiting

gauge

Number of scheduled tasks in the result. Emitted every 30 seconds per VM.

cc.thread_info.event_machine.resultqueue.size

gauge

Number of unscheduled tasks in the result. Emitted every 30 seconds per VM.

cc.thread_info.event_machine.threadqueue.num_waiting

gauge

Number of scheduled tasks in the threadqueue. Emitted every 30 seconds per VM.

cc.thread_info.event_machine.threadqueue.size

gauge

Number of unscheduled tasks in the threadqueue. Emitted every 30 seconds per VM.

cc.thread_info.thread_count

gauge

Total number of threads that are either runnable or stopped. Emitted every 30 seconds per VM.

cc.total_users

gauge

Total number of users ever created, including inactive users. Emitted every 10 minutes per VM.

cc.vcap_sinatra.recent_errors

gauge

50 most recent errors. DEPRECATED

cc.vitals.cpu

gauge

Percentage of CPU used by the Cloud Controller process. Emitted every 30 seconds per VM.

cc.vitals.cpu_load_avg

gauge

System CPU load averaged over the last 1 minute according to the OS. Emitted every 30 seconds per VM.

cc.vitals.mem_bytes

gauge

The RSS bytes (resident set size) or real memory of the Cloud Controller process. Emitted every 30 seconds per VM.

cc.vitals.mem_free_bytes

gauge

Total memory available according to the OS. Emitted every 30 seconds per VM.

cc.vitals.mem_used_bytes

gauge

Total memory used (active + wired) according to the OS. Emitted every 30 seconds per VM.

cc.vitals.num_cores

gauge

The number of CPUs of a host machine. Emitted every 30 seconds per VM.

cc.vitals.uptime

gauge

The uptime of the Cloud Controller process in seconds. Emitted every 30 seconds per VM.

cc_uploader.memoryStats.lastGCPauseTimeNS

gauge

Duration in nanoseconds of the last garbage collector pause.

cc_uploader.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

cc_uploader.memoryStats.numBytesAllocatedHeap

gauge

Instantaneous count of bytes allocated on the main heap and still in use.

cc_uploader.memoryStats.numBytesAllocatedStack

gauge

Instantaneous count of bytes used by the stack allocator.

cc_uploader.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

cc_uploader.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

cc_uploader.numCPUS

gauge

Number of CPUs on the machine.

cc_uploader.numGoRoutines

gauge

Instantaneous number of active goroutines in the process.

container.cpu_percentage

gauge

Percentage of CPU used by this container

container.disk_bytes

gauge

Number of bytes of disk used by this container

container.disk_bytes_quota

gauge

Number of bytes of disk allowed for this container

container.memory_bytes

gauge

Number of bytes of RAM used by this container

container.memory_bytes_quota

gauge

Number of bytes of RAM allocated to this container

etcd.CompareAndDeleteFail

gauge

CompareAndDeleteFail operation count. Emitted every 30 seconds.

etcd.CompareAndDeleteSuccess

gauge

CompareAndDeleteSuccess operation countEmitted every 30 seconds.

etcd.CompareAndSwapFail

gauge

CompareAndSwapFail operation count. Emitted every 30 seconds.

etcd.CompareAndSwapSuccess

gauge

CompareAndSwapSuccess operation count. Emitted every 30 seconds.

etcd.CreateFail

gauge

CreateFail operation count. Emitted every 30 seconds.

etcd.CreateSuccess

gauge

CreateSuccess operation count. Emitted every 30 seconds.

etcd.DeleteFail

gauge

DeleteFail operation count. Emitted every 30 seconds.

etcd.DeleteSuccess

gauge

DeleteSuccess operation count. Emitted every 30 seconds.

etcd.EtcdIndex

gauge

X-Etcd-Index value from the /stats/store endpoint. Emitted every 30 seconds.

etcd.ExpireCount

gauge

ExpireCount operation count. Emitted every 30 seconds.

etcd.Followers

gauge

Number of etcd followers. Emitted every 30 seconds.

etcd.GetsFail

gauge

GetsFail operation count. Emitted every 30 seconds.

etcd.GetsSuccess

gauge

GetsSuccess operation count. Emitted every 30 seconds.

etcd.IsLeader

gauge

1 if the current server is the leader, 0 if it is a follower. Emitted every 30 seconds.

etcd.Latency

gauge

Current latency in milliseconds from leader to a specific follower. Emitted every 30 seconds.

etcd.RaftIndex

gauge

X-Raft-Index value from the /stats/store endpoint. Emitted every 30 seconds.

etcd.RaftTerm

gauge

X-Raft-Term value from the /stats/store endpoint. Emitted every 30 seconds.

etcd.ReceivedAppendRequests

gauge

Number of append requests this node has processed. Emitted every 30 seconds.

etcd.ReceivingBandwidthRate

gauge

Number of bytes per second this node is receiving (follower only). Emitted every 30 seconds.

etcd.ReceivingRequestRate

gauge

Number of requests per second this node is receiving (follower only). Emitted every 30 seconds.

etcd.SendingBandwidthRate

gauge

Number of bytes per second this node is sending (leader only). This value is undefined on single member clusters. Emitted every 30 seconds.

etcd.SendingRequestRate

gauge

Number of requests per second this node is sending (leader only). This value is undefined on single member clusters. Emitted every 30 seconds.

etcd.SentAppendRequests

gauge

Number of requests that this node has sent. Emitted every 30 seconds.

etcd.SetsFail

gauge

SetsFail operation count. Emitted every 30 seconds.

etcd.SetsSuccess

gauge

SetsSuccess operation count. Emitted every 30 seconds.

etcd.UpdateFail

gauge

UpdateFail operation count. Emitted every 30 seconds.

etcd.UpdateSuccess

gauge

UpdateSuccess operation count. Emitted every 30 seconds.

etcd.Watchers

gauge

Watchers operation count. Emitted every 30 seconds.

file_server.memoryStats.lastGCPauseTimeNS

gauge

Duration in nanoseconds of the last garbage collector pause.

file_server.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

file_server.memoryStats.numBytesAllocatedHeap

gauge

Instantaneous count of bytes allocated on the main heap and still in use.

file_server.memoryStats.numBytesAllocatedStack

gauge

Instantaneous count of bytes used by the stack allocator.

file_server.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

file_server.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

file_server.numCPUS

gauge

Number of CPUs on the machine.

file_server.numGoRoutines

gauge

Instantaneous number of active goroutines in the process.

garden_linux.BackingStores

gauge

Number of container backing store files. Emitted every 30 seconds.

garden_linux.DepotDirs

gauge

Number of directories in the Garden depot. Emitted every 30 seconds.

garden_linux.LoopDevices

gauge

Number of attached loop devices. Emitted every 30 seconds.

garden_linux.MetricsReporting

gauge

How long it took to emit the BackingStores, DepotDirs, and LoopDevices metrics. Emitted every 30 seconds.

garden_linux.memoryStats.lastGCPauseTimeNS

gauge

Duration in nanoseconds of the last garbage collector pause.

garden_linux.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

garden_linux.memoryStats.numBytesAllocatedHeap

gauge

Instantaneous count of bytes allocated on the main heap and still in use.

garden_linux.memoryStats.numBytesAllocatedStack

gauge

Instantaneous count of bytes used by the stack allocator.

garden_linux.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

garden_linux.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

garden_linux.numCPUS

gauge

Number of CPUs on the machine.

garden_linux.numGoRoutines

gauge

Instantaneous number of active goroutines in the process.

gorouter.bad_gateways

cumulative_counter

Number of bad gateway events.

gorouter.responses

cumulative_counter

Number of router responses.

gorouter.total_requests

cumulative_counter

Number of router requests received.

gorouter.total_routes

gauge

Number of registered routes.

nsync_bulker.DesiredLRPSyncDuration

gauge

Time in nanoseconds that the nsync-bulker took to synchronize CF apps and Diego DesiredLRPs. Emitted every 30 seconds.

nsync_bulker.LRPsDesired

gauge

Cumulative number of LRPs desired through the nsync API. Emitted on each request desiring a new LRP, every 30 seconds.

nsync_bulker.LockHeld.v1-locks-nsync_bulker_lock

gauge

Whether an nsync-bulker holds the nsync-bulker lock: 1 means the lock is held, and 0 means the lock was lost. Emitted every 30 seconds by the active nsync-bulker.

nsync_bulker.LockHeldDuration.v1-locks-nsync_bulker_lock

gauge

Time in nanoseconds that the active nsync-bulker has held the convergence lock. Emitted every 30 seconds by the active nsync-bulker.

nsync_bulker.NsyncInvalidDesiredLRPsFound

gauge

Number of invalid DesiredLRPs found during nsync-bulker periodic synchronization. Emitted every 30 seconds.

nsync_bulker.memoryStats.lastGCPauseTimeNS

gauge

Duration in nanoseconds of the last garbage collector pause.

nsync_bulker.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

nsync_bulker.memoryStats.numBytesAllocatedHeap

gauge

Instantaneous count of bytes allocated on the main heap and still in use.

nsync_bulker.memoryStats.numBytesAllocatedStack

gauge

Instantaneous count of bytes used by the stack allocator.

nsync_bulker.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

nsync_bulker.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

nsync_bulker.numCPUS

gauge

Number of CPUs on the machine.

nsync_bulker.numGoRoutines

gauge

Instantaneous number of active goroutines in the process.

nsync_listener.memoryStats.lastGCPauseTimeNS

gauge

Duration in nanoseconds of the last garbage collector pause.

nsync_listener.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

nsync_listener.memoryStats.numBytesAllocatedHeap

gauge

Instantaneous count of bytes allocated on the main heap and still in use.

nsync_listener.memoryStats.numBytesAllocatedStack

gauge

Instantaneous count of bytes used by the stack allocator.

nsync_listener.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

nsync_listener.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

nsync_listener.numCPUS

gauge

Number of CPUs on the machine.

nsync_listener.numGoRoutines

gauge

Instantaneous number of active goroutines in the process.

rep.CM

gauge

Emitted every 30 seconds.

rep.CapacityRemainingContainers

gauge

Remaining number of containers this cell can host. Emitted every 60 seconds.

rep.CapacityRemainingDisk

gauge

Amount of disk available to allocate in the cell, in megabytes.

rep.CapacityRemainingMemory

gauge

Amount of memory available to allocate in the cell, in megabytes.

rep.CapacityTotalContainers

gauge

Total number of containers this cell can host. Emitted every 60 seconds.

rep.CapacityTotalDisk

gauge

Total amount of disk in a cell, in megabytes.

rep.CapacityTotalMemory

gauge

Total amount of memory in a cell, in megabytes.

rep.ContainerCount

gauge

Number of Diego containers currently running.

rep.GardenContainerCreationDuration

gauge

Time in nanoseconds that the rep Garden backend took to create a container. Emitted after every successful container creation.

rep.LogMessage

gauge

Emitted every 30 seconds.

rep.RepBulkSyncDuration

gauge

Time in nanoseconds that the cell rep took to synchronize the ActualLRPs it has claimed with its actual garden containers. Emitted every 30 seconds by each rep.

rep.UnhealthyCell

gauge

Number of unhealthy Diego cells.

rep.logSenderTotalMessagesRead

cumulative_counter

Count of application log messages sent by Diego Executor. Emitted every 30 seconds.

rep.memoryStats.lastGCPauseTimeNS

gauge

Duration in nanoseconds of the last garbage collector pause.

rep.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

rep.memoryStats.numBytesAllocatedHeap

gauge

Instantaneous count of bytes allocated on the main heap and still in use.

rep.memoryStats.numBytesAllocatedStack

gauge

Instantaneous count of bytes used by the stack allocator.

rep.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

rep.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

rep.numCPUS

gauge

Number of CPUs on the machine.

rep.numGoRoutines

gauge

Instantaneous number of active goroutines in the process.

route_emitter.LockHeld.v1-locks-route_emitter_lock

gauge

Whether a route-emitter holds the route-emitter lock: 1 means the lock is held, and 0 means the lock was lost. Emitted every 30 seconds by the active route-emitter.

route_emitter.LockHeldDuration.v1-locks-route_emitter_lock

gauge

Time in nanoseconds that the active route-emitter has held the route-emitter lock. Emitted every 30 seconds by the active route-emitter.

route_emitter.MessagesEmitted

cumulative_counter

The cumulative number of registration messages that this process has sent. Emitted every 30 seconds.

route_emitter.RouteEmitterSyncDuration

gauge

Time in nanoseconds that the active route-emitter took to perform its synchronization pass. Emitted every 60 seconds.

route_emitter.RoutesRegistered

cumulative_counter

Cumulative number of route registrations emitted from the route-emitter as it reacts to changes to LRPs. Emitted every 30 seconds.

route_emitter.RoutesSynced

cumulative_counter

Cumulative number of route registrations emitted from the route-emitter during its periodic route-table synchronization. Emitted every 30 seconds.

route_emitter.RoutesTotal

gauge

Number of routes in the route-emitters routing table. Emitted every 30 seconds.

route_emitter.RoutesUnregistered

cumulative_counter

Cumulative number of route unregistrations emitted from the route-emitter as it reacts to changes to LRPs. Emitted every 30 seconds.

route_emitter.memoryStats.lastGCPauseTimeNS

gauge

Duration in nanoseconds of the last garbage collector pause.

route_emitter.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

route_emitter.memoryStats.numBytesAllocatedHeap

gauge

Instantaneous count of bytes allocated on the main heap and still in use.

route_emitter.memoryStats.numBytesAllocatedStack

gauge

Instantaneous count of bytes used by the stack allocator.

route_emitter.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

route_emitter.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

route_emitter.numCPUS

gauge

Number of CPUs on the machine.

route_emitter.numGoRoutines

gauge

Instantaneous number of active goroutines in the process.

ssh_proxy.memoryStats.lastGCPauseTimeNS

gauge

Duration in nanoseconds of the last garbage collector pause.

ssh_proxy.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

ssh_proxy.memoryStats.numBytesAllocatedHeap

gauge

Instantaneous count of bytes allocated on the main heap and still in use.

ssh_proxy.memoryStats.numBytesAllocatedStack

gauge

Instantaneous count of bytes used by the stack allocator.

ssh_proxy.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

ssh_proxy.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

ssh_proxy.numCPUS

gauge

Number of CPUs on the machine.

ssh_proxy.numGoRoutines

gauge

Instantaneous number of active goroutines in the process .

stager.StagingRequestFailedDuration

gauge

Time in nanoseconds that the failed staging task took to run. Emitted each time a staging task fails.

stager.StagingRequestSucceededDuration

gauge

Time in nanoseconds that the successful staging task took to run. Emitted each time a staging task completes successfully.

stager.StagingRequestsFailed

gauge

Cumulative number of failed staging tasks handled by each stager. Emitted every time a staging task fails.

stager.StagingRequestsSucceeded

gauge

Cumulative number of successful staging tasks handled by each stager. Emitted every time a staging task completes successfully.

stager.StagingStartRequestsReceived

gauge

Cumulative number of requests to start a staging task. Emitted by a stager each time it handles a request.

stager.memoryStats.lastGCPauseTimeNS

gauge

Duration in nanoseconds of the last garbage collector pause.

stager.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

stager.memoryStats.numBytesAllocatedHeap

gauge

Instantaneous count of bytes allocated on the main heap and still in use.

stager.memoryStats.numBytesAllocatedStack

gauge

Instantaneous count of bytes used by the stack allocator.

stager.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

stager.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

stager.numCPUS

gauge

Number of CPUs on the machine.

stager.numGoRoutines

gauge

Instantaneous number of active goroutines in the process.

syslog_drain_binder.memoryStats.lastGCPauseTimeNS

gauge

Duration of the last Garbage Collector pause in nanoseconds.

syslog_drain_binder.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

syslog_drain_binder.memoryStats.numBytesAllocatedHeap

gauge

Instantaneous count of bytes allocated on the main heap and still in use.

syslog_drain_binder.memoryStats.numBytesAllocatedStack

gauge

Instantaneous count of bytes used by the stack allocator.

syslog_drain_binder.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

syslog_drain_binder.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

syslog_drain_binder.numCPUS

gauge

Number of CPUs on the machine.

syslog_drain_binder.numGoRoutines

gauge

Instantaneous number of active goroutines in the Doppler process.

syslog_drain_binder.pollCount

cumulative_counter

Number of times the syslog drain binder has polled the cloud controller for syslog drain bindings. Emitted every 30 seconds.

syslog_drain_binder.totalDrains

gauge

Number of syslog drains returned by cloud controller. Emitted every 30 seconds.

system.cpu.sys

gauge

CPU load consumed by the kernel.

system.cpu.user

gauge

CPU load consumed by userspace.

system.cpu.wait

gauge

Time CPU spent waiting for IO.

system.disk.ephemeral.percent

gauge

Percentage of the ephemeral disk used.

system.disk.system.percent

gauge

Percentage of the system disk used.

system.healthy

gauge

Overall status of system health.

system.mem.percent

gauge

Percentage of RAM used.

system.swap.percent

gauge

Percentage of swap space used.

tps_listener.memoryStats.lastGCPauseTimeNS

gauge

Duration in nanoseconds of the last garbage collector pause.

tps_listener.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

tps_listener.memoryStats.numBytesAllocatedHeap

gauge

Instantaneous count of bytes allocated on the main heap and still in use.

tps_listener.memoryStats.numBytesAllocatedStack

gauge

Instantaneous count of bytes used by the stack allocator.

tps_listener.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

tps_listener.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

tps_listener.numCPUS

gauge

Number of CPUs on the machine.

tps_listener.numGoRoutines

gauge

Instantaneous number of active goroutines in the process.

tps_watcher.LockHeld.v1-locks-tps_watcher_lock

gauge

Whether a tps-watcher holds the tps-watcher lock: 1 means the lock is held, and 0 means the lock was lost. Emitted every 30 seconds by the active tps-watcher.

tps_watcher.LockHeldDuration.v1-locks-tps_watcher_lock

gauge

Time in nanoseconds that the active tps-watcher has held the convergence lock. Emitted every 30 seconds by the active tps-watcher.

tps_watcher.memoryStats.lastGCPauseTimeNS

gauge

Duration in nanoseconds of the last garbage collector pause.

tps_watcher.memoryStats.numBytesAllocated

gauge

Instantaneous count of bytes allocated and still in use.

tps_watcher.memoryStats.numBytesAllocatedHeap

gauge

Instantaneous count of bytes allocated on the main heap and still in use.

tps_watcher.memoryStats.numBytesAllocatedStack

gauge

Instantaneous count of bytes used by the stack allocator.

tps_watcher.memoryStats.numFrees

gauge

Lifetime number of memory deallocations.

tps_watcher.memoryStats.numMallocs

gauge

Lifetime number of memory allocations.

tps_watcher.numCPUS

gauge

Number of CPUs on the machine. Emitted every 30 seconds.

tps_watcher.numGoRoutines

gauge

Instantaneous number of active goroutines in the process.

uaa.audit_service.client_authentication_count

cumulative_counter

Number of client authentication attempts.

uaa.audit_service.client_authentication_failure_count

cumulative_counter

Number of failed client authentication attempts.

uaa.audit_service.principal_authentication_failure_count

cumulative_counter

Number of failed principal authentication attempts.

uaa.audit_service.principal_not_found_count

cumulative_counter

Number of times a non-user was not found.

uaa.audit_service.user_authentication_count

cumulative_counter

Number of times a user has successfully authenticated.

uaa.audit_service.user_authentication_failure_count

cumulative_counter

Number of failed user authentication attempts.

uaa.audit_service.user_not_found_count

cumulative_counter

Number of times a user was not found.

uaa.audit_service.user_password_changes

cumulative_counter

Number of times a user password has changed.

uaa.audit_service.user_password_failures

cumulative_counter

Number of times a user password change has failed.