ClickHouse/docs/en/operations/monitoring.md

# Monitoring {#monitoring}

You can monitor:

- Utilization of hardware resources.
- ClickHouse server metrics.

## Resource Utilization {#resource-utilization}

ClickHouse does not monitor the state of hardware resources by itself.

It is highly recommended to set up monitoring for:

- Load and temperature on processors.

  You can use [dmesg](https://en.wikipedia.org/wiki/Dmesg), [turbostat](https://www.linux.org/docs/man8/turbostat.html) or other instruments.

- Utilization of storage system, RAM and network.

## ClickHouse Server Metrics {#clickhouse-server-metrics}

ClickHouse server has embedded instruments for self-state monitoring.

To track server events use server logs. See the [logger](server_settings/settings.md#server_settings-logger) section of the configuration file.

ClickHouse collects:

- Different metrics of how the server uses computational resources.
- Common statistics on query processing.

You can find metrics in the [system.metrics](system_tables.md#system_tables-metrics), [system.events](system_tables.md#system_tables-events), and [system.asynchronous\_metrics](system_tables.md#system_tables-asynchronous_metrics) tables.

You can configure ClickHouse to export metrics to [Graphite](https://github.com/graphite-project). See the [Graphite section](server_settings/settings.md#server_settings-graphite) in the ClickHouse server configuration file. Before configuring export of metrics, you should set up Graphite by following their official [guide](https://graphite.readthedocs.io/en/latest/install.html).

Additionally, you can monitor server availability through the HTTP API. Send the `HTTP GET` request to `/ping`. If the server is available, it responds with `200 OK`.

To monitor servers in a cluster configuration, you should set the [max\_replica\_delay\_for\_distributed\_queries](settings/settings.md#settings-max_replica_delay_for_distributed_queries) parameter and use the HTTP resource `/replicas_status`. A request to `/replicas_status` returns `200 OK` if the replica is available and is not delayed behind the other replicas. If a replica is delayed, it returns `503 HTTP_SERVICE_UNAVAILABLE` with information about the gap.
Normalization for en markdown (#9763) 2020-03-20 10:10:48 +00:00			`# Monitoring {#monitoring}`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
			`You can monitor:`

DOCAPI-4994: EN review of Requirements, Monitoring, Troubleshooting and Update topics. (#4340) 2019-02-11 14:48:37 +00:00			`- Utilization of hardware resources.`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00			`- ClickHouse server metrics.`

Normalization for en markdown (#9763) 2020-03-20 10:10:48 +00:00			`## Resource Utilization {#resource-utilization}`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
			`ClickHouse does not monitor the state of hardware resources by itself.`

			`It is highly recommended to set up monitoring for:`

DOCAPI-4994: EN review of Requirements, Monitoring, Troubleshooting and Update topics. (#4340) 2019-02-11 14:48:37 +00:00			`- Load and temperature on processors.`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
Normalization for en markdown (#9763) 2020-03-20 10:10:48 +00:00			`You can use [dmesg](https://en.wikipedia.org/wiki/Dmesg), [turbostat](https://www.linux.org/docs/man8/turbostat.html) or other instruments.`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
			`- Utilization of storage system, RAM and network.`

Normalization for en markdown (#9763) 2020-03-20 10:10:48 +00:00			`## ClickHouse Server Metrics {#clickhouse-server-metrics}`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
			`ClickHouse server has embedded instruments for self-state monitoring.`

Fixed links in monitoring.md I hereby agree to the terms of the CLA available at: https://yandex.ru/legal/cla/?lang=en Category: Document Short description (up to few sentences): Fixed link in the logger word Added link in the word guide 2019-08-05 15:11:53 +00:00			`To track server events use server logs. See the [logger](server_settings/settings.md#server_settings-logger) section of the configuration file.`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
DOCAPI-4994: Requirements, Monitoring and Troubleshooting are translated into Russian (#4269) 2019-02-05 14:35:58 +00:00			`ClickHouse collects:`

			`- Different metrics of how the server uses computational resources.`
DOCAPI-4994: EN review of Requirements, Monitoring, Troubleshooting and Update topics. (#4340) 2019-02-11 14:48:37 +00:00			`- Common statistics on query processing.`
DOCAPI-4994: Requirements, Monitoring and Troubleshooting are translated into Russian (#4269) 2019-02-05 14:35:58 +00:00
Normalization for en markdown (#9763) 2020-03-20 10:10:48 +00:00			`You can find metrics in the [system.metrics](system_tables.md#system_tables-metrics), [system.events](system_tables.md#system_tables-events), and [system.asynchronous\_metrics](system_tables.md#system_tables-asynchronous_metrics) tables.`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
Fixed links in monitoring.md I hereby agree to the terms of the CLA available at: https://yandex.ru/legal/cla/?lang=en Category: Document Short description (up to few sentences): Fixed link in the logger word Added link in the word guide 2019-08-05 15:11:53 +00:00			`You can configure ClickHouse to export metrics to [Graphite](https://github.com/graphite-project). See the [Graphite section](server_settings/settings.md#server_settings-graphite) in the ClickHouse server configuration file. Before configuring export of metrics, you should set up Graphite by following their official [guide](https://graphite.readthedocs.io/en/latest/install.html).`
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
add description for /ping http handler Signed-off-by: Slach <bloodjazman@gmail.com> 2020-03-05 07:35:48 +00:00			Additionally, you can monitor server availability through the HTTP API. Send the `HTTP GET` request to `/ping`. If the server is available, it responds with `200 OK`.
Docapi 4994 registry (#4214) 2019-02-04 13:30:28 +00:00
Normalization for en markdown (#9763) 2020-03-20 10:10:48 +00:00			To monitor servers in a cluster configuration, you should set the [max\_replica\_delay\_for\_distributed\_queries](settings/settings.md#settings-max_replica_delay_for_distributed_queries) parameter and use the HTTP resource `/replicas_status`. A request to `/replicas_status` returns `200 OK` if the replica is available and is not delayed behind the other replicas. If a replica is delayed, it returns `503 HTTP_SERVICE_UNAVAILABLE` with information about the gap.