ClickHouse/docs/en/operations/monitoring.md

---
slug: /en/operations/monitoring
sidebar_position: 45
sidebar_label: Monitoring
description: You can monitor the utilization of hardware resources and also ClickHouse server metrics.
keywords: [monitoring, observability, advanced dashboard, dashboard, observability dashboard]
---

# Monitoring
import SelfManaged from '@site/docs/en/_snippets/_self_managed_only_automated.md';

<SelfManaged />

You can monitor:

- Utilization of hardware resources.
- ClickHouse server metrics.

## Built-in advanced observability dashboard

<img width="400" alt="Screenshot 2023-11-12 at 6 08 58 PM" src="https://github.com/ClickHouse/ClickHouse/assets/3936029/2bd10011-4a47-4b94-b836-d44557c7fdc1" />

ClickHouse comes with a built-in advanced observability dashboard feature which can be accessed by `$HOST:$PORT/dashboard` (requires user and password) that shows the following metrics:
- Queries/second
- CPU usage (cores)
- Queries running
- Merges running
- Selected bytes/second
- IO wait
- CPU wait
- OS CPU Usage (userspace)
- OS CPU Usage (kernel)
- Read from disk
- Read from filesystem
- Memory (tracked)
- Inserted rows/second
- Total MergeTree parts
- Max parts for partition

## Resource Utilization {#resource-utilization}

ClickHouse also monitors the state of hardware resources by itself such as:

- Load and temperature on processors.
- Utilization of storage system, RAM and network.

This data is collected in the `system.asynchronous_metric_log` table.

## ClickHouse Server Metrics {#clickhouse-server-metrics}

ClickHouse server has embedded instruments for self-state monitoring.

To track server events use server logs. See the [logger](../operations/server-configuration-parameters/settings.md#server_configuration_parameters-logger) section of the configuration file.

ClickHouse collects:

- Different metrics of how the server uses computational resources.
- Common statistics on query processing.

You can find metrics in the [system.metrics](../operations/system-tables/metrics.md#system_tables-metrics), [system.events](../operations/system-tables/events.md#system_tables-events), and [system.asynchronous_metrics](../operations/system-tables/asynchronous_metrics.md#system_tables-asynchronous_metrics) tables.

You can configure ClickHouse to export metrics to [Graphite](https://github.com/graphite-project). See the [Graphite section](../operations/server-configuration-parameters/settings.md#server_configuration_parameters-graphite) in the ClickHouse server configuration file. Before configuring export of metrics, you should set up Graphite by following their official [guide](https://graphite.readthedocs.io/en/latest/install.html).

You can configure ClickHouse to export metrics to [Prometheus](https://prometheus.io). See the [Prometheus section](../operations/server-configuration-parameters/settings.md#server_configuration_parameters-prometheus) in the ClickHouse server configuration file. Before configuring export of metrics, you should set up Prometheus by following their official [guide](https://prometheus.io/docs/prometheus/latest/installation/).

Additionally, you can monitor server availability through the HTTP API. Send the `HTTP GET` request to `/ping`. If the server is available, it responds with `200 OK`.

To monitor servers in a cluster configuration, you should set the [max_replica_delay_for_distributed_queries](../operations/settings/settings.md#max_replica_delay_for_distributed_queries) parameter and use the HTTP resource `/replicas_status`. A request to `/replicas_status` returns `200 OK` if the replica is available and is not delayed behind the other replicas. If a replica is delayed, it returns `503 HTTP_SERVICE_UNAVAILABLE` with information about the gap.
Get rid of toc_en.yml (#10023) 2020-04-03 13:23:32 +00:00			`---`
add slugs 2022-08-28 14:53:34 +00:00			`slug: /en/operations/monitoring`
Removed /ja folder, cleaned up /ru markdown 2022-04-09 13:29:05 +00:00			`sidebar_position: 45`
			`sidebar_label: Monitoring`
New nav - reverting the revert 2023-03-18 02:45:43 +00:00			`description: You can monitor the utilization of hardware resources and also ClickHouse server metrics.`
[Docs] Add keywords for advanced dashboard Add keywords for advanced observability dashboard for better search 2024-03-24 07:07:08 +00:00			`keywords: [monitoring, observability, advanced dashboard, dashboard, observability dashboard]`
Get rid of toc_en.yml (#10023) 2020-04-03 13:23:32 +00:00			`---`

Remove H1 anchor tags from docs 2022-06-02 10:55:18 +00:00			`# Monitoring`
Go live with doc updates (#42053) * QIP to add overview page * wip * New Tutorial and Datasets landing page * give an example for Cloud * Update UK Price Paid for Cloud * Update nyc-taxi.md * add option for Cloud Load Data button * Removed the Import Raw Data section * Update nyc-taxi.md * update user management and replication docs * mark self managed * set doc ordering * add redirects setting * Simple fixes to index.md Co-authored-by: rfraposa <richraposa@gmail.com> 2022-10-04 11:36:59 +00:00			`import SelfManaged from '@site/docs/en/_snippets/_self_managed_only_automated.md';`

			`<SelfManaged />`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
			`You can monitor:`

Docs: Replace annoying three spaces in enumerations by a single space 2023-04-19 15:55:29 +00:00			`- Utilization of hardware resources.`
			`- ClickHouse server metrics.`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
[Docs] Add keywords for advanced dashboard Add keywords for advanced observability dashboard for better search 2024-03-24 07:07:08 +00:00			`## Built-in advanced observability dashboard`
[Docs] Add built-in observability dashboard 2023-11-13 02:13:25 +00:00
Fix docs check erorr 2023-11-27 07:57:33 +00:00			`<img width="400" alt="Screenshot 2023-11-12 at 6 08 58 PM" src="https://github.com/ClickHouse/ClickHouse/assets/3936029/2bd10011-4a47-4b94-b836-d44557c7fdc1" />`
[Docs] Add built-in observability dashboard 2023-11-13 02:13:25 +00:00
[Docs] Add keywords for advanced dashboard Add keywords for advanced observability dashboard for better search 2024-03-24 07:07:08 +00:00			ClickHouse comes with a built-in advanced observability dashboard feature which can be accessed by `$HOST:$PORT/dashboard` (requires user and password) that shows the following metrics:
[Docs] Add built-in observability dashboard 2023-11-13 02:13:25 +00:00			`- Queries/second`
			`- CPU usage (cores)`
			`- Queries running`
			`- Merges running`
			`- Selected bytes/second`
			`- IO wait`
			`- CPU wait`
			`- OS CPU Usage (userspace)`
			`- OS CPU Usage (kernel)`
			`- Read from disk`
			`- Read from filesystem`
			`- Memory (tracked)`
			`- Inserted rows/second`
			`- Total MergeTree parts`
			`- Max parts for partition`

Normalization for en markdown (#9763) 2020-03-20 10:10:48 +00:00			`## Resource Utilization {#resource-utilization}`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
Update monitoring.md 2023-02-16 19:09:10 +00:00			`ClickHouse also monitors the state of hardware resources by itself such as:`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
Docs: Replace annoying three spaces in enumerations by a single space 2023-04-19 15:55:29 +00:00			`- Load and temperature on processors.`
			`- Utilization of storage system, RAM and network.`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
Update monitoring.md 2023-02-16 19:09:10 +00:00			This data is collected in the `system.asynchronous_metric_log` table.

Update zh docs and fix en docs (#10125) 2020-04-08 14:22:25 +00:00			`## ClickHouse Server Metrics {#clickhouse-server-metrics}`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
			`ClickHouse server has embedded instruments for self-state monitoring.`

[docs] split aggregate function and system table references (#11742) * prefer relative links from root * wip * split aggregate function reference * split system tables 2020-06-18 08:24:31 +00:00			`To track server events use server logs. See the [logger](../operations/server-configuration-parameters/settings.md#server_configuration_parameters-logger) section of the configuration file.`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
DOCAPI-4994: Requirements, Monitoring and Troubleshooting are translated into Russian (#4269) 2019-02-05 14:35:58 +00:00			`ClickHouse collects:`

Docs: Replace annoying three spaces in enumerations by a single space 2023-04-19 15:55:29 +00:00			`- Different metrics of how the server uses computational resources.`
			`- Common statistics on query processing.`
DOCAPI-4994: Requirements, Monitoring and Troubleshooting are translated into Russian (#4269) 2019-02-05 14:35:58 +00:00
Fix broken links in docs 2020-10-13 17:23:29 +00:00			`You can find metrics in the [system.metrics](../operations/system-tables/metrics.md#system_tables-metrics), [system.events](../operations/system-tables/events.md#system_tables-events), and [system.asynchronous_metrics](../operations/system-tables/asynchronous_metrics.md#system_tables-asynchronous_metrics) tables.`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
[docs] split aggregate function and system table references (#11742) * prefer relative links from root * wip * split aggregate function reference * split system tables 2020-06-18 08:24:31 +00:00			`You can configure ClickHouse to export metrics to [Graphite](https://github.com/graphite-project). See the [Graphite section](../operations/server-configuration-parameters/settings.md#server_configuration_parameters-graphite) in the ClickHouse server configuration file. Before configuring export of metrics, you should set up Graphite by following their official [guide](https://graphite.readthedocs.io/en/latest/install.html).`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
[docs] split aggregate function and system table references (#11742) * prefer relative links from root * wip * split aggregate function reference * split system tables 2020-06-18 08:24:31 +00:00			`You can configure ClickHouse to export metrics to [Prometheus](https://prometheus.io). See the [Prometheus section](../operations/server-configuration-parameters/settings.md#server_configuration_parameters-prometheus) in the ClickHouse server configuration file. Before configuring export of metrics, you should set up Prometheus by following their official [guide](https://prometheus.io/docs/prometheus/latest/installation/).`
add <prometheus> in server settings and monitoring section (#10015) * add description for <prometheus> in server settings and monitoring section Signed-off-by: Slach <bloodjazman@gmail.com> * Update docs/en/operations/server_settings/settings.md Co-Authored-By: BayoNet <da-daos@yandex.ru> * Update docs/en/operations/server_settings/settings.md Co-Authored-By: BayoNet <da-daos@yandex.ru> * Update docs/en/operations/server_settings/settings.md Co-Authored-By: BayoNet <da-daos@yandex.ru> * Update docs/en/operations/server_settings/settings.md Co-Authored-By: BayoNet <da-daos@yandex.ru> * Update docs/en/operations/server_settings/settings.md Co-Authored-By: BayoNet <da-daos@yandex.ru> * Update docs/en/operations/server_settings/settings.md Co-Authored-By: BayoNet <da-daos@yandex.ru> * sync russian description with english Signed-off-by: Slach <bloodjazman@gmail.com> * Update docs/ru/operations/server_settings/settings.md * sync russian description with english Signed-off-by: Slach <bloodjazman@gmail.com> Co-authored-by: BayoNet <da-daos@yandex.ru> Co-authored-by: Ilya Yatsishin <2159081+qoega@users.noreply.github.com> 2020-04-11 05:27:24 +00:00
add description for /ping http handler Signed-off-by: Slach <bloodjazman@gmail.com> 2020-03-05 07:35:48 +00:00			Additionally, you can monitor server availability through the HTTP API. Send the `HTTP GET` request to `/ping`. If the server is available, it responds with `200 OK`.
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
Fix anchors to settings.md 2023-12-20 18:26:36 +00:00			To monitor servers in a cluster configuration, you should set the [max_replica_delay_for_distributed_queries](../operations/settings/settings.md#max_replica_delay_for_distributed_queries) parameter and use the HTTP resource `/replicas_status`. A request to `/replicas_status` returns `200 OK` if the replica is available and is not delayed behind the other replicas. If a replica is delayed, it returns `503 HTTP_SERVICE_UNAVAILABLE` with information about the gap.