ClickHouse/docs/en/engines/table-engines/mergetree-family/custom-partitioning-key.md

---
slug: /en/engines/table-engines/mergetree-family/custom-partitioning-key
sidebar_position: 30
sidebar_label: Custom Partitioning Key
---

# Custom Partitioning Key

:::note
In most cases you do not need a partition key, and in most other cases you do not need a partition key more granular than by months.

You should never use too granular of partitioning. Don't partition your data by client identifiers or names. Instead, make a client identifier or name the first column in the ORDER BY expression.
:::

Partitioning is available for the [MergeTree](../../../engines/table-engines/mergetree-family/mergetree.md) family tables (including [replicated](../../../engines/table-engines/mergetree-family/replication.md) tables). [Materialized views](../../../engines/table-engines/special/materializedview.md#materializedview) based on MergeTree tables support partitioning, as well.

A partition is a logical combination of records in a table by a specified criterion. You can set a partition by an arbitrary criterion, such as by month, by day, or by event type. Each partition is stored separately to simplify manipulations of this data. When accessing the data, ClickHouse uses the smallest subset of partitions possible.

The partition is specified in the `PARTITION BY expr` clause when [creating a table](../../../engines/table-engines/mergetree-family/mergetree.md#table_engine-mergetree-creating-a-table). The partition key can be any expression from the table columns. For example, to specify partitioning by month, use the expression `toYYYYMM(date_column)`:

``` sql
CREATE TABLE visits
(
    VisitDate Date,
    Hour UInt8,
    ClientID UUID
)
ENGINE = MergeTree()
PARTITION BY toYYYYMM(VisitDate)
ORDER BY Hour;
```

The partition key can also be a tuple of expressions (similar to the [primary key](../../../engines/table-engines/mergetree-family/mergetree.md#primary-keys-and-indexes-in-queries)). For example:

``` sql
ENGINE = ReplicatedCollapsingMergeTree('/clickhouse/tables/name', 'replica1', Sign)
PARTITION BY (toMonday(StartDate), EventType)
ORDER BY (CounterID, StartDate, intHash32(UserID));
```

In this example, we set partitioning by the event types that occurred during the current week.

By default, the floating-point partition key is not supported. To use it enable the setting [allow_floating_point_partition_key](../../../operations/settings/merge-tree-settings.md#allow_floating_point_partition_key).

When inserting new data to a table, this data is stored as a separate part (chunk) sorted by the primary key. In 10-15 minutes after inserting, the parts of the same partition are merged into the entire part.

:::info
A merge only works for data parts that have the same value for the partitioning expression. This means **you shouldn’t make overly granular partitions** (more than about a thousand partitions). Otherwise, the `SELECT` query performs poorly because of an unreasonably large number of files in the file system and open file descriptors.
:::

Use the [system.parts](../../../operations/system-tables/parts.md#system_tables-parts) table to view the table parts and partitions. For example, let’s assume that we have a `visits` table with partitioning by month. Let’s perform the `SELECT` query for the `system.parts` table:

``` sql
SELECT
    partition,
    name,
    active
FROM system.parts
WHERE table = 'visits'
```

``` text
┌─partition─┬─name──────────────┬─active─┐
│ 201901    │ 201901_1_3_1      │      0 │
│ 201901    │ 201901_1_9_2_11   │      1 │
│ 201901    │ 201901_8_8_0      │      0 │
│ 201901    │ 201901_9_9_0      │      0 │
│ 201902    │ 201902_4_6_1_11   │      1 │
│ 201902    │ 201902_10_10_0_11 │      1 │
│ 201902    │ 201902_11_11_0_11 │      1 │
└───────────┴───────────────────┴────────┘
```

The `partition` column contains the names of the partitions. There are two partitions in this example: `201901` and `201902`. You can use this column value to specify the partition name in [ALTER … PARTITION](../../../sql-reference/statements/alter/partition.md) queries.

The `name` column contains the names of the partition data parts. You can use this column to specify the name of the part in the [ALTER ATTACH PART](../../../sql-reference/statements/alter/partition.md#alter_attach-partition) query.

Let’s break down the name of the part: `201901_1_9_2_11`:

-   `201901` is the partition name.
-   `1` is the minimum number of the data block.
-   `9` is the maximum number of the data block.
-   `2` is the chunk level (the depth of the merge tree it is formed from).
-   `11` is the mutation version (if a part mutated)

:::info
The parts of old-type tables have the name: `20190117_20190123_2_2_0` (minimum date - maximum date - minimum block number - maximum block number - level).
:::

The `active` column shows the status of the part. `1` is active; `0` is inactive. The inactive parts are, for example, source parts remaining after merging to a larger part. The corrupted data parts are also indicated as inactive.

As you can see in the example, there are several separated parts of the same partition (for example, `201901_1_3_1` and `201901_1_9_2`). This means that these parts are not merged yet. ClickHouse merges the inserted parts of data periodically, approximately 15 minutes after inserting. In addition, you can perform a non-scheduled merge using the [OPTIMIZE](../../../sql-reference/statements/optimize.md) query. Example:

``` sql
OPTIMIZE TABLE visits PARTITION 201902;
```

``` text
┌─partition─┬─name─────────────┬─active─┐
│ 201901    │ 201901_1_3_1     │      0 │
│ 201901    │ 201901_1_9_2_11  │      1 │
│ 201901    │ 201901_8_8_0     │      0 │
│ 201901    │ 201901_9_9_0     │      0 │
│ 201902    │ 201902_4_6_1     │      0 │
│ 201902    │ 201902_4_11_2_11 │      1 │
│ 201902    │ 201902_10_10_0   │      0 │
│ 201902    │ 201902_11_11_0   │      0 │
└───────────┴──────────────────┴────────┘
```

Inactive parts will be deleted approximately 10 minutes after merging.

Another way to view a set of parts and partitions is to go into the directory of the table: `/var/lib/clickhouse/data/<database>/<table>/`. For example:

``` bash
/var/lib/clickhouse/data/default/visits$ ls -l
total 40
drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  1 16:48 201901_1_3_1
drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  5 16:17 201901_1_9_2_11
drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  5 15:52 201901_8_8_0
drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  5 15:52 201901_9_9_0
drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  5 16:17 201902_10_10_0
drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  5 16:17 201902_11_11_0
drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  5 16:19 201902_4_11_2_11
drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  5 12:09 201902_4_6_1
drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  1 16:48 detached
```

The folders ‘201901_1_1_0’, ‘201901_1_7_1’ and so on are the directories of the parts. Each part relates to a corresponding partition and contains data just for a certain month (the table in this example has partitioning by month).

The `detached` directory contains parts that were detached from the table using the [DETACH](../../../sql-reference/statements/alter/partition.md#alter_detach-partition) query. The corrupted parts are also moved to this directory, instead of being deleted. The server does not use the parts from the `detached` directory. You can add, delete, or modify the data in this directory at any time – the server will not know about this until you run the [ATTACH](../../../sql-reference/statements/alter/partition.md#alter_attach-partition) query.

Note that on the operating server, you cannot manually change the set of parts or their data on the file system, since the server will not know about it. For non-replicated tables, you can do this when the server is stopped, but it isn’t recommended. For replicated tables, the set of parts cannot be changed in any case.

ClickHouse allows you to perform operations with the partitions: delete them, copy from one table to another, or create a backup. See the list of all operations in the section [Manipulations With Partitions and Parts](../../../sql-reference/statements/alter/partition.md#alter_manipulations-with-partitions).

## Group By optimisation using partition key

For some combinations of table's partition key and query's group by key it might be possible to execute aggregation for each partition independently.
Then we'll not have to merge partially aggregated data from all execution threads at the end,
because we provided with the guarantee that each group by key value cannot appear in working sets of two different threads.

The typical example is:

``` sql
CREATE TABLE session_log
(
    UserID UInt64,
    SessionID UUID
)
ENGINE = MergeTree
PARTITION BY sipHash64(UserID) % 16
ORDER BY tuple();

SELECT
    UserID,
    COUNT()
FROM session_log
GROUP BY UserID;
```

:::note
Performance of such a query heavily depends on the table layout. Because of that the optimisation is not enabled by default.
:::

The key factors for a good performance:

-   number of partitions involved in the query should be sufficiently large (more than `max_threads / 2`), otherwise query will underutilize the machine
-   partitions shouldn't be too small, so batch processing won't degenerate into row-by-row processing
-   partitions should be comparable in size, so all threads will do roughly the same amount of work

:::info
It's recommended to apply some hash function to columns in `partition by` clause in order to distribute data evenly between partitions.
:::

Relevant settings are:

-   `allow_aggregate_partitions_independently` - controls if the use of optimisation is enabled
-   `force_aggregate_partitions_independently` - forces its use when it's applicable from the correctness standpoint, but getting disabled by internal logic that estimates its expediency
-   `max_number_of_partitions_for_independent_aggregation` - hard limit on the maximal number of partitions table could have
-												Get rid of toc_en.yml (#10023)


											
										
										
											2020-04-03 13:23:32 +00:00
+								---
-												add slugs

											
										
										
											2022-08-28 14:53:34 +00:00
+								slug: /en/engines/table-engines/mergetree-family/custom-partitioning-key
-												Removed /ja folder, cleaned up /ru markdown

											
										
										
											2022-04-09 13:29:05 +00:00
+								sidebar_position: 30
 								sidebar_label: Custom Partitioning Key
-												Get rid of toc_en.yml (#10023)


											
										
										
											2020-04-03 13:23:32 +00:00
+								---
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								# Custom Partitioning Key
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												standardize admonitions

											
										
										
											2023-03-27 18:54:05 +00:00
+								:::note
-												add docs

											
										
										
											2023-01-18 18:36:41 +00:00
+								In most cases you do not need a partition key, and in most other cases you do not need a partition key more granular than by months.
-												Removed /ja folder, cleaned up /ru markdown

											
										
										
											2022-04-09 13:29:05 +00:00
 								You should never use too granular of partitioning. Don't partition your data by client identifiers or names. Instead, make a client identifier or name the first column in the ORDER BY expression.
 								:::
-												Update custom-partitioning-key.md
											
										
										
											2021-09-01 17:53:06 +00:00
-												[docs] split aggregate function and system table references (#11742)

* prefer relative links from root

* wip

* split aggregate function reference

* split system tables
											
										
										
											2020-06-18 08:24:31 +00:00
+								Partitioning is available for the [MergeTree](../../../engines/table-engines/mergetree-family/mergetree.md) family tables (including [replicated](../../../engines/table-engines/mergetree-family/replication.md) tables). [Materialized views](../../../engines/table-engines/special/materializedview.md#materializedview) based on MergeTree tables support partitioning, as well.
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Minor improvements in docs build and content (#9752)


											
										
										
											2020-03-19 11:51:22 +00:00
+								A partition is a logical combination of records in a table by a specified criterion. You can set a partition by an arbitrary criterion, such as by month, by day, or by event type. Each partition is stored separately to simplify manipulations of this data. When accessing the data, ClickHouse uses the smallest subset of partitions possible.
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
-												[docs] split aggregate function and system table references (#11742)

* prefer relative links from root

* wip

* split aggregate function reference

* split system tables
											
										
										
											2020-06-18 08:24:31 +00:00
+								The partition is specified in the `PARTITION BY expr` clause when [creating a table](../../../engines/table-engines/mergetree-family/mergetree.md#table_engine-mergetree-creating-a-table). The partition key can be any expression from the table columns. For example, to specify partitioning by month, use the expression `toYYYYMM(date_column)`:
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								``` sql
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
+								CREATE TABLE visits
 								(
-												translate docs/zh/operations/table_engines/custom_partitioning_key.md (#5134)


											
										
										
											2019-04-29 15:08:11 +00:00
+								    VisitDate Date,
 								    Hour UInt8,
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
+								    ClientID UUID
 								)
 								ENGINE = MergeTree()
-												Doc fix: updating sections about the partitioning (en, ru) (#4677)


											
										
										
											2019-03-18 12:48:06 +00:00
+								PARTITION BY toYYYYMM(VisitDate)
 								ORDER BY Hour;
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
+								```
-												[docs] split aggregate function and system table references (#11742)

* prefer relative links from root

* wip

* split aggregate function reference

* split system tables
											
										
										
											2020-06-18 08:24:31 +00:00
+								The partition key can also be a tuple of expressions (similar to the [primary key](../../../engines/table-engines/mergetree-family/mergetree.md#primary-keys-and-indexes-in-queries)). For example:
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								``` sql
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
+								ENGINE = ReplicatedCollapsingMergeTree('/clickhouse/tables/name', 'replica1', Sign)
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
+								PARTITION BY (toMonday(StartDate), EventType)
-												Doc fix: updating sections about the partitioning (en, ru) (#4677)


											
										
										
											2019-03-18 12:48:06 +00:00
+								ORDER BY (CounterID, StartDate, intHash32(UserID));
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
+								```
-												Doc fix: updating sections about the partitioning (en, ru) (#4677)


											
										
										
											2019-03-18 12:48:06 +00:00
+								In this example, we set partitioning by the event types that occurred during the current week.
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												en, ru docs

											
										
										
											2021-05-30 09:24:41 +00:00
+								By default, the floating-point partition key is not supported. To use it enable the setting [allow_floating_point_partition_key](../../../operations/settings/merge-tree-settings.md#allow_floating_point_partition_key).
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
+								When inserting new data to a table, this data is stored as a separate part (chunk) sorted by the primary key. In 10-15 minutes after inserting, the parts of the same partition are merged into the entire part.
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								:::info
-												Removed /ja folder, cleaned up /ru markdown

											
										
										
											2022-04-09 13:29:05 +00:00
+								A merge only works for data parts that have the same value for the partitioning expression. This means **you shouldn’t make overly granular partitions** (more than about a thousand partitions). Otherwise, the `SELECT` query performs poorly because of an unreasonably large number of files in the file system and open file descriptors.
 								:::
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												[docs] split aggregate function and system table references (#11742)

* prefer relative links from root

* wip

* split aggregate function reference

* split system tables
											
										
										
											2020-06-18 08:24:31 +00:00
+								Use the [system.parts](../../../operations/system-tables/parts.md#system_tables-parts) table to view the table parts and partitions. For example, let’s assume that we have a `visits` table with partitioning by month. Let’s perform the `SELECT` query for the `system.parts` table:
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								``` sql
-												translate docs/zh/operations/table_engines/custom_partitioning_key.md (#5134)


											
										
										
											2019-04-29 15:08:11 +00:00
+								SELECT
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
+								    partition,
-												translate docs/zh/operations/table_engines/custom_partitioning_key.md (#5134)


											
										
										
											2019-04-29 15:08:11 +00:00
+								    name,
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
+								    active
-												translate docs/zh/operations/table_engines/custom_partitioning_key.md (#5134)


											
										
										
											2019-04-29 15:08:11 +00:00
+								FROM system.parts
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
+								WHERE table = 'visits'
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
+								```
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								``` text
-												Update custom-partitioning-key.md
											
										
										
											2022-03-10 21:35:46 +00:00
+								┌─partition─┬─name──────────────┬─active─┐
 								│ 201901    │ 201901_1_3_1      │      0 │
 								│ 201901    │ 201901_1_9_2_11   │      1 │
 								│ 201901    │ 201901_8_8_0      │      0 │
 								│ 201901    │ 201901_9_9_0      │      0 │
 								│ 201902    │ 201902_4_6_1_11   │      1 │
 								│ 201902    │ 201902_10_10_0_11 │      1 │
 								│ 201902    │ 201902_11_11_0_11 │      1 │
 								└───────────┴───────────────────┴────────┘
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
+								```
-												Update some broken links

The links were linking to the page itself. The information probably has been moved to the SQL reference page.
											
										
										
											2021-12-16 13:27:38 +00:00
+								The `partition` column contains the names of the partitions. There are two partitions in this example: `201901` and `201902`. You can use this column value to specify the partition name in [ALTER … PARTITION](../../../sql-reference/statements/alter/partition.md) queries.
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
-												Update some broken links

The links were linking to the page itself. The information probably has been moved to the SQL reference page.
											
										
										
											2021-12-16 13:27:38 +00:00
+								The `name` column contains the names of the partition data parts. You can use this column to specify the name of the part in the [ALTER ATTACH PART](../../../sql-reference/statements/alter/partition.md#alter_attach-partition) query.
-												translate docs/zh/operations/table_engines/custom_partitioning_key.md (#5134)


											
										
										
											2019-04-29 15:08:11 +00:00
-												Update custom-partitioning-key.md
											
										
										
											2022-03-10 21:35:46 +00:00
+								Let’s break down the name of the part: `201901_1_9_2_11`:
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												[experimental] add "es" docs language as machine translated draft (#9787)

* replace exit with assert in test_single_page

* improve save_raw_single_page docs option

* More grammar fixes

* "Built from" link in new tab

* fix mistype

* Example of include in docs

* add anchor to meeting form

* Draft of translation helper

* WIP on translation helper

* Replace some fa docs content with machine translation

* add normalize-en-markdown.sh

* normalize some en markdown

* normalize some en markdown

* admonition support

* normalize

* normalize

* normalize

* support wide tables

* normalize

* normalize

* normalize

* normalize

* normalize

* normalize

* normalize

* normalize

* normalize

* normalize

* normalize

* normalize

* normalize

* lightly edited machine translation of introdpection.md

* lightly edited machhine translation of lazy.md

* WIP on translation utils

* Normalize ru docs

* Normalize other languages

* some fixes

* WIP on normalize/translate tools

* add requirements.txt

* [experimental] add es docs language as machine translated draft

* remove duplicate script

* Back to wider tab-stop (narrow renders not so well)
											
										
										
											2020-03-21 04:11:51 +00:00
+								-   `201901` is the partition name.
 								-   `1` is the minimum number of the data block.
-												Update custom-partitioning-key.md
											
										
										
											2022-03-10 21:35:46 +00:00
+								-   `9` is the maximum number of the data block.
 								-   `2` is the chunk level (the depth of the merge tree it is formed from).
 								-   `11` is the mutation version (if a part mutated)
-												Fixed russian inclusions into english version of the document.

											
										
										
											2018-01-19 14:36:40 +00:00
-												Removed /ja folder, cleaned up /ru markdown

											
										
										
											2022-04-09 13:29:05 +00:00
+								:::info
 								The parts of old-type tables have the name: `20190117_20190123_2_2_0` (minimum date - maximum date - minimum block number - maximum block number - level).
 								:::
-												Fixed russian inclusions into english version of the document.

											
										
										
											2018-01-19 14:36:40 +00:00
-												Doc fix: updating sections about the partitioning (en, ru) (#4677)


											
										
										
											2019-03-18 12:48:06 +00:00
+								The `active` column shows the status of the part. `1` is active; `0` is inactive. The inactive parts are, for example, source parts remaining after merging to a larger part. The corrupted data parts are also indicated as inactive.
-												[docs] split misc statements (#12403)


											
										
										
											2020-07-11 11:05:49 +00:00
+								As you can see in the example, there are several separated parts of the same partition (for example, `201901_1_3_1` and `201901_1_9_2`). This means that these parts are not merged yet. ClickHouse merges the inserted parts of data periodically, approximately 15 minutes after inserting. In addition, you can perform a non-scheduled merge using the [OPTIMIZE](../../../sql-reference/statements/optimize.md) query. Example:
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								``` sql
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
+								OPTIMIZE TABLE visits PARTITION 201902;
 								```
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								``` text
-												Update custom-partitioning-key.md
											
										
										
											2022-03-10 21:35:46 +00:00
+								┌─partition─┬─name─────────────┬─active─┐
 								│ 201901    │ 201901_1_3_1     │      0 │
 								│ 201901    │ 201901_1_9_2_11  │      1 │
 								│ 201901    │ 201901_8_8_0     │      0 │
 								│ 201901    │ 201901_9_9_0     │      0 │
 								│ 201902    │ 201902_4_6_1     │      0 │
 								│ 201902    │ 201902_4_11_2_11 │      1 │
 								│ 201902    │ 201902_10_10_0   │      0 │
 								│ 201902    │ 201902_11_11_0   │      0 │
 								└───────────┴──────────────────┴────────┘
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
+								```
-												translate docs/zh/operations/table_engines/custom_partitioning_key.md (#5134)


											
										
										
											2019-04-29 15:08:11 +00:00
+								Inactive parts will be deleted approximately 10 minutes after merging.
 								Another way to view a set of parts and partitions is to go into the directory of the table: `/var/lib/clickhouse/data/<database>/<table>/`. For example:
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								``` bash
-												DOCAPI-8530: Code blocks markup fix (#7060)

* Typo fix.

* Links fix.

* Fixed links in docs.

* More fixes.

* docs/en: cleaning some files

* docs/en: cleaning data_types

* docs/en: cleaning database_engines

* docs/en: cleaning development

* docs/en: cleaning getting_started

* docs/en: cleaning interfaces

* docs/en: cleaning operations

* docs/en: cleaning query_lamguage

* docs/en: cleaning en

* docs/ru: cleaning data_types

* docs/ru: cleaning index

* docs/ru: cleaning database_engines

* docs/ru: cleaning development

* docs/ru: cleaning general

* docs/ru: cleaning getting_started

* docs/ru: cleaning interfaces

* docs/ru: cleaning operations

* docs/ru: cleaning query_language

* docs: cleaning interfaces/http

* Update docs/en/data_types/array.md

decorated ```

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/getting_started/example_datasets/nyc_taxi.md

fixed typo

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/getting_started/example_datasets/ontime.md

fixed typo

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/interfaces/formats.md

fixed error

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/operations/table_engines/custom_partitioning_key.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/operations/utils/clickhouse-local.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/dicts/external_dicts_dict_sources.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/operations/utils/clickhouse-local.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/functions/json_functions.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/functions/json_functions.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/functions/other_functions.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/functions/other_functions.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/functions/date_time_functions.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/operations/table_engines/jdbc.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* docs: fixed error

* docs: fixed error

											
										
										
											2019-09-23 15:31:46 +00:00
+								/var/lib/clickhouse/data/default/visits$ ls -l
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
+								total 40
 								drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  1 16:48 201901_1_3_1
-												Update custom-partitioning-key.md
											
										
										
											2022-03-10 21:35:46 +00:00
+								drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  5 16:17 201901_1_9_2_11
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
+								drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  5 15:52 201901_8_8_0
 								drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  5 15:52 201901_9_9_0
 								drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  5 16:17 201902_10_10_0
 								drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  5 16:17 201902_11_11_0
-												Update custom-partitioning-key.md
											
										
										
											2022-03-10 21:35:46 +00:00
+								drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  5 16:19 201902_4_11_2_11
-												Doc fix: actualizing the partitions description (#4288)


											
										
										
											2019-02-06 13:00:14 +00:00
+								drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  5 12:09 201902_4_6_1
 								drwxr-xr-x 2 clickhouse clickhouse 4096 Feb  1 16:48 detached
 								```
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Fix broken links in docs

											
										
										
											2020-10-13 17:23:29 +00:00
+								The folders ‘201901_1_1_0’, ‘201901_1_7_1’ and so on are the directories of the parts. Each part relates to a corresponding partition and contains data just for a certain month (the table in this example has partitioning by month).
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												[docs] split the ALTER article (#12502)

* initial split

* initial adjust links

* make en buildable

* re-normalize
											
										
										
											2020-07-14 21:02:41 +00:00
+								The `detached` directory contains parts that were detached from the table using the [DETACH](../../../sql-reference/statements/alter/partition.md#alter_detach-partition) query. The corrupted parts are also moved to this directory, instead of being deleted. The server does not use the parts from the `detached` directory. You can add, delete, or modify the data in this directory at any time – the server will not know about this until you run the [ATTACH](../../../sql-reference/statements/alter/partition.md#alter_attach-partition) query.
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								Note that on the operating server, you cannot manually change the set of parts or their data on the file system, since the server will not know about it. For non-replicated tables, you can do this when the server is stopped, but it isn’t recommended. For replicated tables, the set of parts cannot be changed in any case.
-												translate docs/zh/operations/table_engines/custom_partitioning_key.md (#5134)


											
										
										
											2019-04-29 15:08:11 +00:00
-												[docs] split the ALTER article (#12502)

* initial split

* initial adjust links

* make en buildable

* re-normalize
											
										
										
											2020-07-14 21:02:41 +00:00
+								ClickHouse allows you to perform operations with the partitions: delete them, copy from one table to another, or create a backup. See the list of all operations in the section [Manipulations With Partitions and Parts](../../../sql-reference/statements/alter/partition.md#alter_manipulations-with-partitions).
-												add docs

											
										
										
											2023-01-18 18:36:41 +00:00
 								## Group By optimisation using partition key
 								For some combinations of table's partition key and query's group by key it might be possible to execute aggregation for each partition independently.
 								Then we'll not have to merge partially aggregated data from all execution threads at the end,
 								because we provided with the guarantee that each group by key value cannot appear in working sets of two different threads.
 								The typical example is:
 								``` sql
 								CREATE TABLE session_log
 								(
 								    UserID UInt64,
 								    SessionID UUID
 								)
 								ENGINE = MergeTree
 								PARTITION BY sipHash64(UserID) % 16
 								ORDER BY tuple();
 								SELECT
 								    UserID,
 								    COUNT()
 								FROM session_log
 								GROUP BY UserID;
 								```
-												standardize admonitions

											
										
										
											2023-03-27 18:54:05 +00:00
+								:::note
-												better

											
										
										
											2023-01-22 19:39:24 +00:00
+								Performance of such a query heavily depends on the table layout. Because of that the optimisation is not enabled by default.
-												add docs

											
										
										
											2023-01-18 18:36:41 +00:00
+								:::
-												better

											
										
										
											2023-01-22 19:39:24 +00:00
+								The key factors for a good performance:
-												add docs

											
										
										
											2023-01-18 18:36:41 +00:00
-												better

											
										
										
											2023-01-22 19:39:24 +00:00
+								-   number of partitions involved in the query should be sufficiently large (more than `max_threads / 2`), otherwise query will underutilize the machine
 								-   partitions shouldn't be too small, so batch processing won't degenerate into row-by-row processing
-												add docs

											
										
										
											2023-01-18 18:36:41 +00:00
+								-   partitions should be comparable in size, so all threads will do roughly the same amount of work
 								:::info
 								It's recommended to apply some hash function to columns in `partition by` clause in order to distribute data evenly between partitions.
 								:::
 								Relevant settings are:
 								-   `allow_aggregate_partitions_independently` - controls if the use of optimisation is enabled
 								-   `force_aggregate_partitions_independently` - forces its use when it's applicable from the correctness standpoint, but getting disabled by internal logic that estimates its expediency
 								-   `max_number_of_partitions_for_independent_aggregation` - hard limit on the maximal number of partitions table could have