Commit Graph

1158 Commits

Author SHA1 Message Date
Robert Schulze
7f675ddf80
Merge pull request #57878 from Jpnock/master
Apply full-text skipping index when using `hasAny()`
2023-12-17 11:06:29 +01:00
Alexey Milovidov
ee502ef6b9 Remove trash (OPTIMIZE CLEANUP) 2023-12-15 22:41:58 +01:00
James Nock
2c24e438aa Add docs for skip index support of hasXXX() funcs 2023-12-15 17:28:45 +00:00
Johnny
23ba3b5fa5
Merge pull request #57488 from ClickHouse/add/no-cloud-distributed-table-engine
Adds callout that distributed table engine that it is not supported in Cloud.
2023-12-11 17:06:12 -04:00
Alexey Milovidov
05bc8ef1e0
Merge pull request #57548 from ClickHouse/s3queue_production_ready
`S3Queue` is production ready
2023-12-06 22:21:08 +01:00
Alexey Milovidov
b91e0f89ec Address review comments 2023-12-06 12:24:58 +01:00
Alexey Milovidov
0b597eadf8 Clarify the usage of prefer_not_to_merge option 2023-12-06 12:11:03 +01:00
johnnymatthews
e9d9048903 Changes 'cannot run on cloud' message. 2023-12-05 17:14:10 -04:00
Alexey Milovidov
76bbeaa676 S3Queue is production ready 2023-12-05 21:10:19 +01:00
johnnymatthews
c6ca43b341 Moves self-hosted-only box under page title. 2023-12-04 18:05:34 -04:00
johnnymatthews
06a52e1452 Merge branch 'master' into add/no-cloud-distributed-table-engine 2023-12-04 18:00:40 -04:00
johnnymatthews
40062405fb Adds 'not available on cloud' to Distributed Table Engine. 2023-12-04 17:59:11 -04:00
johnnymatthews
e2eb47b2ec Reverts last commit. 2023-12-04 17:58:19 -04:00
johnnymatthews
7ce33b0737 Adds 'not available on cloud' to Distributed Table Engine. 2023-12-04 17:57:33 -04:00
Han Fei
7bd2ea3add Merge branch 'master' into hanfei/statistic 2023-11-27 23:31:23 +01:00
vdimir
bbc202a891
Merge pull request #56816 from AVMusorin/improve-settings-rocksdb
Show total_bytes and total_rows in system tables for RocksDB storage
2023-11-27 11:42:15 +01:00
Kruglov Pavel
b10e46b2bc
Merge pull request #57126 from Avogar/size-virtual-column
Add _size virtual column to s3/file/hdfs/url/azureBlobStorage engines
2023-11-27 10:12:18 +01:00
Nikolay Degterinsky
de0876ed68
Merge pull request #57234 from evillique/fix-docs-anchors
Fix docs
2023-11-27 01:22:43 +01:00
Nikolay Degterinsky
faa3a43803 Fix docs 2023-11-26 19:21:39 +00:00
Kseniia Sumarokova
13187bf4f1
Merge pull request #56959 from melvynator/master
Add experimental flag for MaterializedPostgreSQL table engine.
2023-11-24 16:16:42 +01:00
avogar
6b64efb55a Add docs 2023-11-22 18:21:30 +00:00
Aleksandr Musorin
ee0ae17ce2 Added optimize_trivial_approximate_count_query documentatiosn for RocksDB 2023-11-21 11:31:56 +01:00
robot-ch-test-poll
e71ca7dcef
Merge pull request #56985 from czlee/patch-1
[Docs] MaterializedPostgreSQL: Change DETACH to DETACH PERMANENTLY
2023-11-20 13:33:52 +01:00
Chuan-Zheng Lee
96c603ef97
[Docs] MaterializedPostgreSQL: Change DETACH to DETACH PERMANENTLY
If I'm not mistaken, ClickHouse/ClickHouse#35158 changed the syntax for dynamically removing tables from MaterializedPostgreSQL databases from `DETACH` to `DETACH PERMANENTLY`. Currently when just running `DETACH TABLE postgres_database.table_to_remove`, it shows an error:
```
DETACH TABLE not allowed, use DETACH PERMANENTLY. (NOT_IMPLEMENTED)
```

This adds the keyword `PERMANENTLY` to both places where `DETACH` occurs on the MaterializedPostgreSQL database engine page.
2023-11-20 18:15:14 +13:00
Peignon Melvyn
f6e4c29669
MaterializedMysql doc
Add experimental flag for materializedMysql
2023-11-20 01:49:17 +01:00
robot-ch-test-poll
4fa085d7e2
Merge pull request #56965 from ClickHouse/docs-replacingmergetree-freq-updates
[Docs] Recommend ReplacingMergeTree for frequent updates
2023-11-19 10:56:47 +01:00
Justin de Guzman
f29777db2d
[Docs] More typos 2023-11-18 18:41:20 -08:00
Justin de Guzman
a398e3f51e
[Docs] Fix typo 2023-11-18 18:40:14 -08:00
Justin de Guzman
8062fb578c
[Docs] Recommend ReplacingMergeTree for frequent updates 2023-11-18 18:38:49 -08:00
Justin de Guzman
5e1da38720
[Docs] Add details on why partitions improve query perf 2023-11-18 18:28:29 -08:00
Ubuntu
16e7bd201b Add experimental flag for PostgresSQL table engine. Also add documentation on how to enable it 2023-11-18 18:22:33 +00:00
avogar
58b9bde0fa Better docs for virtual columns in Kafka/RabbitMQ/NATS/FileLog 2023-11-14 21:15:30 +00:00
justindeguzman
f3b0550dd3 [Docs] Add details about sharding_key for distributed table engine 2023-11-12 19:43:43 -08:00
Peignon Melvyn
ba87360114
Mark MaterializedPostreSQL as experimental in docs (#56462) 2023-11-08 20:26:19 +01:00
Azat Khuzhin
c25d6cd624
Rename directory monitor concept into background INSERT (#55978)
* Limit log frequence for "Skipping send data over distributed table" message

After SYSTEM STOP DISTRIBUTED SENDS it will constantly print this
message.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>

* Rename directory monitor concept into async INSERT

Rename the following query settings (with preserving backward
compatiblity, by keeping old name as an alias):
- distributed_directory_monitor_sleep_time_ms -> distributed_async_insert_sleep_time_ms
- distributed_directory_monitor_max_sleep_time_ms -> distributed_async_insert_max_sleep_time_ms
- distributed_directory_monitor_batch -> distributed_async_insert_batch_inserts
- distributed_directory_monitor_split_batch_on_failure -> distributed_async_insert_split_batch_on_failure

Rename the following table settings (with preserving backward
compatiblity, by keeping old name as an alias):
- monitor_batch_inserts -> async_insert_batch
- monitor_split_batch_on_failure -> async_insert_split_batch_on_failure
- directory_monitor_sleep_time_ms -> async_insert_sleep_time_ms
- directory_monitor_max_sleep_time_ms -> async_insert_max_sleep_time_ms

And also update all the references:

    $ gg -e directory_monitor_ -e monitor_ tests docs | cut -d: -f1 | sort -u | xargs sed -e 's/distributed_directory_monitor_sleep_time_ms/distributed_async_insert_sleep_time_ms/g' -e 's/distributed_directory_monitor_max_sleep_time_ms/distributed_async_insert_max_sleep_time_ms/g' -e 's/distributed_directory_monitor_batch_inserts/distributed_async_insert_batch/g' -e 's/distributed_directory_monitor_split_batch_on_failure/distributed_async_insert_split_batch_on_failure/g' -e 's/monitor_batch_inserts/async_insert_batch/g' -e 's/monitor_split_batch_on_failure/async_insert_split_batch_on_failure/g' -e 's/monitor_sleep_time_ms/async_insert_sleep_time_ms/g' -e 's/monitor_max_sleep_time_ms/async_insert_max_sleep_time_ms/g' -i

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>

* Rename async_insert for Distributed into background_insert

This will avoid amigibuity between general async INSERT's and INSERT
into Distributed, which are indeed background, so new term express it
even better.

Mostly done with:

    $ git di HEAD^ --name-only | xargs sed -i -e 's/distributed_async_insert/distributed_background_insert/g' -e 's/async_insert_batch/background_insert_batch/g' -e 's/async_insert_split_batch_on_failure/background_insert_split_batch_on_failure/g' -e 's/async_insert_sleep_time_ms/background_insert_sleep_time_ms/g' -e 's/async_insert_max_sleep_time_ms/background_insert_max_sleep_time_ms/g'

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>

* Mark 02417_opentelemetry_insert_on_distributed_table as long

CI: https://s3.amazonaws.com/clickhouse-test-reports/55978/7a6abb03a0b507e29e999cb7e04f246a119c6f28/stateless_tests_flaky_check__asan_.html
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>

---------

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-11-01 15:09:39 +01:00
Kruglov Pavel
5533258091
Merge pull request #55477 from Avogar/streaming-errors
Allow to save unparsed records and errors in RabbitMQ, NATS and FileLog engines
2023-11-01 14:35:35 +01:00
Han Fei
19cdc6dfe2 Merge branch 'master' into hanfei/statistic 2023-10-30 04:53:18 +01:00
Han Fei
df1e019268 address comments 2023-10-30 00:39:16 +01:00
Azat Khuzhin
c7f392500e Fix incorrect free space accounting for least_used JBOD policy
Before least_used fails to detect when the disk started to have more
space, it works only when the disk starts to have less space.

The reason for this is that it uses priority_queue, and once the disk
goes at the bottom of the queue, free space will not be updated for it
until it will be selected again.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-10-29 17:36:31 +01:00
Rory Crispin
b817ab45bd
Fix dead link to Virtual Columns 2023-10-27 12:03:15 +02:00
Kruglov Pavel
14c97fb426
Update docs/en/engines/table-engines/mergetree-family/mergetree.md 2023-10-23 12:48:05 +02:00
Srikanth Chekuri
327386a348 Merge remote-tracking branch 'upstream/master' into issue-55023 2023-10-22 10:22:14 +05:30
Srikanth Chekuri
0f2e0ade99 Update merge tree doc 2023-10-21 21:51:10 +05:30
Kseniia Sumarokova
4e0122a299
Merge pull request #54422 from kssenii/s3-queue-fixes
Fixes for storage `S3Queue`
2023-10-18 21:51:51 +02:00
Kseniia Sumarokova
7cedfeff12
Merge pull request #55145 from kssenii/materialized-pg-allow-unique-identifier
Allow to have several MaterializedPostgreSQL tables following the same Postgres table
2023-10-17 18:47:32 +02:00
Han Fei
4b23142775
Merge branch 'master' into hanfei/statistic 2023-10-17 16:40:31 +02:00
kssenii
fd37e25fbd Fix doc 2023-10-17 15:07:58 +02:00
kssenii
4d2cf5206f Update documentation 2023-10-17 14:12:15 +02:00
Robert Schulze
96a82df75a
Merge pull request #55616 from HarryLeeIBM/hlee-fts-max-rows
Inverted index: Replace setting density with `max_rows_in_postings_list`
2023-10-17 11:36:46 +02:00
HarryLeeIBM
fb8cd298d1 Code refactoring according to code review 2023-10-16 17:52:39 -07:00