Commit Graph

1642 Commits

Author SHA1 Message Date
Robert Schulze
dd484fc312
Simplify what happens in SELECT all, ... [...] ORDER BY ALL
Previously setting `enable_order_by_all` distinguished for ORDER BY ALL
whether we should sort by column 'all' (if given in the SELECT clause)
or by all columns. The actual behavior was not always intuitive.

Now, we throw unconditionally an exception which also simplifies the
handling a bit. Only an edge case is affected and if users really want
to run ORDER BY ALL on a column names 'all', they can alias it.
2024-02-02 09:53:35 +00:00
Julia Kartseva
5b07039ab2 Update settings.md 2024-02-01 22:45:10 +00:00
Shaun Struwig
e081a4f059
Merge branch 'master' into #31363_format_template_configure_in_settings 2024-01-29 21:29:22 +01:00
Blargian
4a8a7208f2 rename of settings, add setting for resultset, extend test, fix documentation and add to SettingsChanges log 2024-01-29 21:25:58 +01:00
Nikolai Kochetov
a51aa7b668
Merge pull request #59238 from kitaisreal/update-insert-deduplication-token-in-dependent-materialized-views
Added setting update_insert_deduplication_token_in_dependent_materialized_views
2024-01-29 12:52:06 +01:00
Maksim Kita
4a10b71252 Updated documentation 2024-01-27 15:57:49 +03:00
Maksim Kita
4ae425461f Added documentation 2024-01-27 15:57:49 +03:00
avogar
5833641fa5 Merge branch 'master' of github.com:ClickHouse/ClickHouse into variant-data-type 2024-01-26 16:54:02 +00:00
avogar
ff15ab12b8 Remove bad implementation of DataTypeVariant::createColumnConst, support use_variant_as_common_type setting in functions array/map 2024-01-25 21:06:50 +00:00
avogar
4b26de9660 Rename setting to use it not only in if/multiIf in future 2024-01-25 13:26:13 +00:00
Mikhail f. Shiryaev
38898575e5
Merge branch 'master' into #31363_format_template_configure_in_settings 2024-01-24 20:49:38 +01:00
Alex Cheng
3e795ccb01
Merge branch 'ClickHouse:master' into master 2024-01-24 21:11:03 +08:00
Alex Cheng
10aaf2cbe4
fix the default number of async_insert_max_data_size 2024-01-24 21:10:48 +08:00
avogar
1fd4b26c3c Better docs 2024-01-23 17:48:50 +00:00
avogar
be3b5dc45f Allow to ignore schema evolution in Iceberg table engine under a setting 2024-01-23 17:45:41 +00:00
jsc0218
6f3124acdb
Merge pull request #58402 from zhangyifan27/add_max_estimated_execution_time
Seperate max_execution_time and max_estimated_execution_time
2024-01-21 19:57:06 -05:00
Blargian
f1749217ee added format_schema_rows_template setting 2024-01-18 21:53:56 +02:00
Kruglov Pavel
5444cde408
Merge branch 'master' into variant-data-type 2024-01-18 18:31:27 +01:00
Kruglov Pavel
6d064512e1
Merge pull request #58614 from Blargian/58363_disable_ansi_pretty_automatically
58363 Automatically disable ANSI escape sequences in Pretty formats if the output is not a terminal
2024-01-17 13:45:41 +01:00
Blargian
72b5cf5993 #58363 - removed switch from PrettyBlockOutputFormat and modified BlockOutputFormats to use color variable. Updated english and russian documentation. Updated test 00405 reference file. 2024-01-12 19:46:03 +02:00
Ilya Yatsishin
1b548d3597
Update query-complexity.md to rerun tests 2024-01-12 20:44:46 +04:00
avogar
bfd7db8cf4 Better docs 2024-01-12 13:06:51 +00:00
avogar
fbfdde60a7 Add settings for better control of indexes type in Arrow dictionary. Use signed integer type for indexes by default 2024-01-12 13:06:51 +00:00
johnnymatthews
ebd7f19670 Example to show settings changed from default. 2024-01-10 12:38:38 -04:00
Robert Schulze
4cacfa6b43
Merge pull request #57509 from jinjunzh/qatzstd_main
Add Intel-QAT-based `ZSTD_QAT` compression codec
2024-01-10 10:31:39 +01:00
Kseniia Sumarokova
1b517105db
Merge pull request #57323 from valbok/mysql-binlog-client
MaterializedMySQL: Introduce MySQL Binlog Client
2024-01-10 09:08:13 +01:00
Robert Schulze
ced9e93ac6
Merge remote-tracking branch 'rschu1ze/master' into qatzstd_main 2024-01-09 20:36:02 +00:00
pufit
6cf55b82f4
Merge pull request #58539 from canhld94/file_custom_compress_level
Allow explicitly set compression level in output format
2024-01-09 13:43:38 -05:00
Duc Canh Le
37a7ce2950
Apply suggestions from code review
Co-authored-by: pufit <pufit@yandex.ru>
2024-01-09 17:17:30 +08:00
Kruglov Pavel
b947609b8e
Merge branch 'master' into variant-data-type 2024-01-08 15:04:51 +01:00
Robert Schulze
0f51e32bb5
Rename from ZSTDQPL to ZSTD_QPL 2024-01-07 17:47:31 +00:00
zhangyifan27
730135f933 Seperate max_execution_time and max_estimated_execution_time 2024-01-07 12:32:42 +08:00
avogar
7e5ba62017 Allow to read Bool values into String in JSON input formats 2024-01-05 20:33:30 +00:00
Duc Canh Le
c3586b2e35 fix docs
Signed-off-by: Duc Canh Le <duccanh.le@ahrefs.com>
2024-01-05 07:23:22 +00:00
Duc Canh Le
934e006213 add document
Signed-off-by: Duc Canh Le <duccanh.le@ahrefs.com>
2024-01-05 07:13:46 +00:00
Alexander Tokmakov
b1688b1953 Merge branch 'master' into rdb_dont_wait_inactive 2024-01-03 17:03:01 +01:00
Val Doroshchuk
5c221d123d MaterializedMySQL: Introduce MySQL Binlog Client
One binlog connection for many databases.

Suggesting to disable this feature by default for now. It should be explicitly enabled by SETTINGS use_binlog_client=1.
But if you would permanently enable it in MaterializedMySQLSettings, it should keep old behavior and all tests should pass too.

1. Introduced `IBinlog` and its impl to read the binlog events from socket - `BinlogFromSocket`, or file - `BinlogFromFile`. Based on prev impl of `EventBase` and the same old binlog parsers. It fully keeps BC with old version. Fixed `./check-mysql-binlog` to test new impl.
2. Introduced `BinlogEventsDispatcher`, it reads the event from the source `IBinlog` and sends it to currently attached `IBinlog` instances.
3. Introduced `BinlogClient`, which is used to group a list of `BinlogEventsDispatcher` by MySQL binlog connection which is defined by `user:password@host:port`. All dispatchers with the same binlog position should be merged to one.
4. Introduced `BinlogClientFactory`, which is a singleton and it is used to track all binlogs created over the instance.
5. Introduced `use_binlog_client` setting to `MaterializedMySQL`, which forces to reuse a `BinlogClient` if it already exists in `BinlogClientCatalog` or create new one. By default, it is disabled.
6. Introduced `max_bytes_in_binlog_queue` setting to define the limit of bytes in binlog's queue of events. If bytes in the queue increases this limit, `BinlogEventsDispatcher` will stop reading new events from source `IBinlog` until the space for new events will be freed.
7. Introduced `max_milliseconds_to_wait_in_binlog_queue` setting to define max ms to wait when the max bytes exceeded.
7. Introduced `max_milliseconds_to_wait_in_binlog_queue` setting to define max ms to wait when the max bytes exceeded.
8. Introduced `max_bytes_in_binlog_dispatcher_buffer` setting to define max bytes in the binlog dispatcher's buffer before it is flushed to attached binlogs.
9. Introduced `max_flush_milliseconds_in_binlog_dispatcher` setting to define max milliseconds in the binlog dispatcher's buffer to wait before it is flushed to attached binlogs.
10. Introduced `system.mysql_binlogs` system table, which shows a list of active binlogs.
11. Introduced `UnparsedRowsEvent` and `MYSQL_UNPARSED_ROWS_EVENT`, which defines that an event is not parsed and should be explicitly parsed later.
12. Fixed bug when not possible to apply DDL since syntax error or unsupported SQL.

@larspars is the author of following:
`GTIDSets::contains()`
`ReplicationHelper`
`shouldReconnectOnException()`
2024-01-03 15:26:09 +01:00
Kruglov Pavel
4d8cf71ba7
Merge branch 'master' into variant-data-type 2024-01-03 15:21:23 +01:00
Alexey Milovidov
7bded0a5e7
Merge pull request #58316 from ClickHouse/reintroduce_is_deleted
Re-introduce `is_deleted` column for ReplacingMergeTree
2023-12-31 00:57:19 +01:00
robot-ch-test-poll4
09ffe8ffcc
Merge pull request #57837 from ClickHouse/pufit/docs-patch-roles
Added information about preconfigured roles in users.xml to the docs
2023-12-30 13:09:50 +01:00
Alexander Tokmakov
bfc10bd234 an option to avoid waiting for inactive Replicated db replicas 2023-12-29 15:19:16 +01:00
Alexander Tokmakov
bdada351c8 Revert "Merge pull request #58274 from ClickHouse/revert-58267"
This reverts commit 583b9637c2, reversing
changes made to 224e937620.
2023-12-28 14:07:59 +01:00
Alexey Milovidov
c7efd2afea Revert #58267 2023-12-28 04:09:33 +01:00
Alexander Tokmakov
01d042c490 Revert "Merge pull request #57932 from ClickHouse/remove-shit-cleanup"
This reverts commit 2d58dc512c, reversing
changes made to 41873dc4a3.
2023-12-27 13:46:06 +01:00
Christoph Wurm
9def89d416 Fix anchors to settings.md 2023-12-20 18:26:36 +00:00
Kruglov Pavel
087946f385
Merge branch 'master' into variant-data-type 2023-12-20 00:40:18 +01:00
Jordi Villar
7c281d9a6f
Allow max_size_to_drop settings in query time (#57452) 2023-12-19 23:16:03 +01:00
avogar
319ae440b6 Implement Variant data type 2023-12-19 16:45:15 +00:00
Robert Schulze
952175ce39
Merge pull request #57875 from zhongyuankai/order_by_all
Support `ORDER BY ALL`
2023-12-19 13:38:07 +01:00
vdimir
252cb8a507
Merge pull request #54602 from fionera/issue-49028
Add setting http_make_head_request
2023-12-19 13:33:59 +01:00
Christoph Wurm
e357c088a0 Update settings.md: allow_experimental_parallel_reading_from_replicas
Document all possible values for `allow_experimental_parallel_reading_from_replicas`.
2023-12-18 16:55:44 +00:00
Robert Schulze
9033c96e29
Some fixups 2023-12-17 09:11:16 +00:00
Alexey Milovidov
ee502ef6b9 Remove trash (OPTIMIZE CLEANUP) 2023-12-15 22:41:58 +01:00
pufit
f49894f644
Added information about preconfigured roles in users.xml to the docs 2023-12-13 23:28:21 -05:00
Kruglov Pavel
6567fb2c08
Merge pull request #56859 from Avogar/csv-infer-numbers-from-strings
Allow to infer numbers from strings in CSV format
2023-12-12 17:09:02 +01:00
avogar
1db000dae5 Add docs 2023-12-11 18:02:05 +00:00
avogar
ee7af95bc0 Merge branch 'master' of github.com:ClickHouse/ClickHouse into schema-inference-union 2023-12-08 20:29:28 +00:00
hdhoang
fc8292182c
fix typo TKSV
correct name is TS-KV
2023-12-07 20:19:00 +07:00
Sema Checherinda
c61cd6aab2
Merge pull request #57335 from sichenzhao/increase_default_replicated_deduplication_window
increase default replicated_deduplication_window from 100 to 1k
2023-12-02 21:23:39 +01:00
Sichen Zhao
1b8b1c0a63 increase default replicated_deduplication_window from 100 to 1k 2023-11-28 14:24:06 -08:00
Han Fei
7bd2ea3add Merge branch 'master' into hanfei/statistic 2023-11-27 23:31:23 +01:00
Robert Schulze
ca63881357
Mention that the settings are dev settings 2023-11-24 14:46:02 +00:00
vdimir
1044e3366b
Update http_make_head_request doc 2023-11-24 09:32:20 +00:00
Tim Windelschmidt
51e3899f73
Add setting http_make_head_request
Clickhouse always does a HEAD request before actually executing the GET request,
this adds a settings to skip that request.
2023-11-24 09:32:16 +00:00
Robert Schulze
a8a3214185
+ documentation 2023-11-23 19:43:40 +00:00
avogar
f537bad469 Merge branch 'master' of github.com:ClickHouse/ClickHouse into schema-inference-union 2023-11-20 14:32:50 +00:00
avogar
872556a5d4 Merge branch 'master' of github.com:ClickHouse/ClickHouse into schema-inference-union 2023-11-20 14:03:36 +00:00
Sema Checherinda
f999337dae
Revert "Revert "s3 adaptive timeouts"" 2023-11-20 14:53:22 +01:00
Alexander Tokmakov
5031f239c3
Revert "s3 adaptive timeouts" 2023-11-20 14:28:59 +01:00
Sema Checherinda
a950595c24
Merge pull request #56314 from CheSema/s3-aggressive-timeouts
s3 adaptive timeouts
2023-11-19 14:12:14 +01:00
Justin de Guzman
96e87322b5
[Docs] Add insert_distributed_sync to Core Settings docs 2023-11-18 18:10:43 -08:00
Alexey Milovidov
0158db3e07
Merge pull request #56092 from ClickHouse/docs-max-block-size
Set correct default `max_block_size` value in docs
2023-11-18 17:23:43 +01:00
Kruglov Pavel
8e88bf1348 Revert "Revert "Add a setting max_execution_time_leaf to limit the execution time on shard for distributed query"" 2023-11-14 11:40:48 +00:00
Sema Checherinda
76d11687a7 adjuct docs 2023-11-14 11:34:12 +01:00
Sema Checherinda
e0edd165da add doc 2023-11-14 11:34:12 +01:00
Robert Schulze
361a3ca11f
Merge pull request #56519 from rschu1ze/qc-ignore-nondeterministic-queries
Query cache: Allow to ignore non-deterministic queries
2023-11-13 21:32:44 +01:00
Alexander Tokmakov
0622379ec7
Revert "Add a setting max_execution_time_leaf to limit the execution time on shard for distributed query" 2023-11-13 21:11:07 +01:00
Robert Schulze
7864df4826
Update docs + Try to stabilize test results, pt. II 2023-11-13 10:57:33 +00:00
Robert Schulze
91f53514d9
Incorporate review feedback 2023-11-12 15:11:38 +00:00
Robert Schulze
52c825db44
Query cache: Allow to ignore non-deterministic queries
Fixes: #56504
2023-11-12 15:11:38 +00:00
Yarik Briukhovetskyi
b8ea31db43
Merge pull request #56309 from yariks5s/preferred_optimize_projection_name
New setting preferred_projection_name
2023-11-08 18:06:03 +01:00
Sema Checherinda
02df5ac59c
Merge pull request #54340 from KevinyhZou/bug_fix_csv_parse_crlf
Fix CSV read while the end of line is CR(\r)
2023-11-07 12:51:05 +01:00
Sema Checherinda
a5e17dc919
Merge pull request #55566 from CheSema/remote-blobs-naming
feature: storage_metadata_write_full_object_key
2023-11-06 16:41:49 +01:00
kevinyhzou
2a50daf5dd Allow cr at end of csv line 2023-11-06 12:21:42 +08:00
kevinyhzou
ef30e6723d bug fix csv read while end of line is not crlf 2023-11-06 12:21:42 +08:00
Robert Schulze
049c811a38
Make type names in SHOW COLUMNS dependent only on connection type 2023-11-05 21:20:20 +00:00
yariks5s
d95262622b init 2023-11-03 16:47:06 +00:00
Denny Crane
dbaebe18f1
Update settings.md 2023-11-03 13:14:56 -03:00
Denny Crane
9dbd229462
Update settings.md 2023-11-03 13:09:51 -03:00
Kruglov Pavel
81b1ca22bb
Merge pull request #51823 from canhld94/ch_add_max_execution_time_leaf
Add a setting max_execution_time_leaf to limit the execution time on shard for distributed query
2023-11-03 13:31:15 +01:00
Duc Canh Le
dd6b02a2fd add document
Signed-off-by: Duc Canh Le <duccanh.le@ahrefs.com>
2023-11-03 03:00:13 +00:00
Sema Checherinda
90b64bcdb9 backward compatibility and implementation feature storage_metadata_write_full_object_key 2023-11-02 15:20:55 +01:00
Alexey Milovidov
1267246785
Merge pull request #55642 from ClickHouse/morton-utils
Indexing by space-filling curves
2023-11-02 12:21:33 +01:00
robot-ch-test-poll1
a5fb7b0a1e
Merge pull request #56213 from ttanay/sort-spill-to-disk-docs
Add `max_bytes_before_external_sort` setting to query complexity doc
2023-11-02 00:50:30 +01:00
Tanay Tummalapalli
6a3b2a04d8 Add max_bytes_before_external_sort setting to query complexity doc
The setting should be mentioned here along with
`max_bytes_before_external_group_by` setting.

Note: The russian translation is courtesy of Google Translate.
2023-11-01 22:33:25 +05:30
Azat Khuzhin
c25d6cd624
Rename directory monitor concept into background INSERT (#55978)
* Limit log frequence for "Skipping send data over distributed table" message

After SYSTEM STOP DISTRIBUTED SENDS it will constantly print this
message.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>

* Rename directory monitor concept into async INSERT

Rename the following query settings (with preserving backward
compatiblity, by keeping old name as an alias):
- distributed_directory_monitor_sleep_time_ms -> distributed_async_insert_sleep_time_ms
- distributed_directory_monitor_max_sleep_time_ms -> distributed_async_insert_max_sleep_time_ms
- distributed_directory_monitor_batch -> distributed_async_insert_batch_inserts
- distributed_directory_monitor_split_batch_on_failure -> distributed_async_insert_split_batch_on_failure

Rename the following table settings (with preserving backward
compatiblity, by keeping old name as an alias):
- monitor_batch_inserts -> async_insert_batch
- monitor_split_batch_on_failure -> async_insert_split_batch_on_failure
- directory_monitor_sleep_time_ms -> async_insert_sleep_time_ms
- directory_monitor_max_sleep_time_ms -> async_insert_max_sleep_time_ms

And also update all the references:

    $ gg -e directory_monitor_ -e monitor_ tests docs | cut -d: -f1 | sort -u | xargs sed -e 's/distributed_directory_monitor_sleep_time_ms/distributed_async_insert_sleep_time_ms/g' -e 's/distributed_directory_monitor_max_sleep_time_ms/distributed_async_insert_max_sleep_time_ms/g' -e 's/distributed_directory_monitor_batch_inserts/distributed_async_insert_batch/g' -e 's/distributed_directory_monitor_split_batch_on_failure/distributed_async_insert_split_batch_on_failure/g' -e 's/monitor_batch_inserts/async_insert_batch/g' -e 's/monitor_split_batch_on_failure/async_insert_split_batch_on_failure/g' -e 's/monitor_sleep_time_ms/async_insert_sleep_time_ms/g' -e 's/monitor_max_sleep_time_ms/async_insert_max_sleep_time_ms/g' -i

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>

* Rename async_insert for Distributed into background_insert

This will avoid amigibuity between general async INSERT's and INSERT
into Distributed, which are indeed background, so new term express it
even better.

Mostly done with:

    $ git di HEAD^ --name-only | xargs sed -i -e 's/distributed_async_insert/distributed_background_insert/g' -e 's/async_insert_batch/background_insert_batch/g' -e 's/async_insert_split_batch_on_failure/background_insert_split_batch_on_failure/g' -e 's/async_insert_sleep_time_ms/background_insert_sleep_time_ms/g' -e 's/async_insert_max_sleep_time_ms/background_insert_max_sleep_time_ms/g'

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>

* Mark 02417_opentelemetry_insert_on_distributed_table as long

CI: https://s3.amazonaws.com/clickhouse-test-reports/55978/7a6abb03a0b507e29e999cb7e04f246a119c6f28/stateless_tests_flaky_check__asan_.html
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>

---------

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-11-01 15:09:39 +01:00
Alexey Milovidov
016285ea87
Merge branch 'master' into morton-utils 2023-10-31 01:47:16 +01:00
yariks5s
0846714052 added docs, changed tests 2023-10-30 17:17:53 +00:00
Han Fei
19cdc6dfe2 Merge branch 'master' into hanfei/statistic 2023-10-30 04:53:18 +01:00
Justin de Guzman
940d099e84
Set correct max_block_size value in docs 2023-10-27 16:50:34 -07:00