Commit Graph

8813 Commits

Author SHA1 Message Date
Robert Schulze
97d1bed114
Merge branch 'master' into improve_week_day 2023-01-21 20:40:33 +01:00
Han Fei
4bbe90f6b4
Merge pull request #45473 from hanfei1991/hanfei/async-inserts-doc
update docs for async insert deduplication
2023-01-20 18:25:19 +01:00
Han Fei
18a397f8c9 address comments 2023-01-20 18:09:42 +01:00
Han Fei
449ace3373
Update docs/en/operations/settings/settings.md
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-20 18:07:19 +01:00
Han Fei
9d87bd10ee
Update docs/en/operations/settings/settings.md
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-20 18:07:08 +01:00
Han Fei
badfbcb3d8
Update docs/en/operations/settings/settings.md
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-20 18:06:58 +01:00
Han Fei
e9c4cf46cd
Update docs/en/operations/settings/merge-tree-settings.md
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-20 18:06:46 +01:00
Han Fei
9d254f7d87
Update docs/en/operations/settings/merge-tree-settings.md
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-20 18:06:32 +01:00
robot-ch-test-poll4
2066581d8f
Merge pull request #45451 from evillique/default_granularity
Add default GRANULARITY argument for secondary indexes
2023-01-20 17:46:21 +01:00
Nikolay Degterinsky
02142596fb Add docs 2023-01-20 15:22:13 +00:00
Han Fei
5fc4998f10 update docs for async insert deduplication 2023-01-20 14:42:11 +01:00
Robert Schulze
4ac17d71fa
Merge pull request #45470 from ClickHouse/rs-doc-typos
Fix typos
2023-01-20 14:39:27 +01:00
Robert Schulze
3f2e4c8217
Fix typos 2023-01-20 13:20:25 +00:00
Robert Schulze
687f9c35a7
Merge pull request #45469 from ClickHouse/inv-idx-docs
Docs for inverted index
2023-01-20 14:17:58 +01:00
Robert Schulze
7e6d3163b1
Initial inverted index docs 2023-01-20 12:12:20 +00:00
Maksim Kita
758c8f2776
Merge branch 'master' into dict/remove-preallocate 2023-01-20 13:15:37 +03:00
Antonio Andelic
0ad37ad286
Merge pull request #45320 from stigsb/system_tables_volume_config
Add <storage_policy> config parameter for system logs
2023-01-20 10:27:57 +01:00
Robert Schulze
099f50c0e3
Remove redundancy 2023-01-19 18:35:19 +00:00
Robert Schulze
d8144ffafc
Merge branch 'master' into index-docs 2023-01-19 18:05:27 +01:00
Robert Schulze
c9988f4411
Mini fix 2023-01-19 17:04:17 +00:00
Robert Schulze
7355e3bc77
Docs: Beautify section on secondary index types 2023-01-19 17:00:31 +00:00
robot-clickhouse-ci-1
13d4d40568
Merge pull request #45325 from ClickHouse/download-builds
Improve manual to get binaries from CI
2023-01-19 17:49:52 +01:00
Stig Bakken
420c179b55 Add <storage_policy> config parameter for system logs 2023-01-19 10:25:28 +01:00
DanRoscigno
5d88a2b344 clarify autodetect 2023-01-18 15:51:31 -05:00
Azat Khuzhin
4366f7fb3b Remove PREALLOCATE for HASHED/SPARSE_HASHED dictionaries
It does not give significant benefit, but now, you hashed/sparse_hashed
dictionaries can be filled in parallel (#40003), using sharded
dictionaries, and this should be used instead of PREALLOCATE.

Note, that dictionaries, that had been created with PREALLOCATE will
work, but simply ignore this attribute.

Fixes: #41985 (cc @alexey-milovidov)
Reverts: #23979 (cc @kitaisreal)

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-18 20:18:37 +01:00
DanRoscigno
cd3ecb9091 add support_batch_delete to mergetree docs 2023-01-18 12:16:31 -05:00
Maksim Kita
8225d2814c
Merge pull request #40003 from azat/dict-shards
Add ability to load hashed dictionaries using multiple threads
2023-01-18 13:37:10 +03:00
Sergei Trifonov
ca0fb785f4
Merge pull request #45193 from ClickHouse/concurrency-control-docs
Add docs for concurrency control logic
2023-01-17 21:10:17 +01:00
Vladimir C
b80ee8df50
Merge pull request #45256 from DanRoscigno/add-grace-hash-docs 2023-01-16 18:43:19 +01:00
Kruglov Pavel
bdb3517512
Merge pull request #45231 from Avogar/json-tuples
Insert default values in case of missing tuple elements in JSONEachRow
2023-01-16 17:49:50 +01:00
Mikhail f. Shiryaev
08c9d81c90
Apply suggestions from code review
Co-authored-by: Igor Nikonov <954088+devcrafter@users.noreply.github.com>
2023-01-16 16:55:24 +01:00
Mikhail f. Shiryaev
1bd7279d5f
Improve manual to get binaries from CI 2023-01-16 16:45:30 +01:00
Nikolay Degterinsky
70e79de69b
Merge pull request #38252 from bharatnc/ncb/weighted-quantile-approx
add quantileInterpolatedWeighted function
2023-01-16 13:41:13 +01:00
Sema Checherinda
d746a3c4ff
Merge pull request #44480 from wineternity/issue_43333_doc
[DOC] Add support for signed arguments in range() #43333
2023-01-16 10:26:49 +01:00
Ilya Yatsishin
cf5052c77e
Merge pull request #45291 from den-crane/patch-57 2023-01-16 02:33:54 +01:00
Dan Roscigno
adca0b64d3
use markdown file instead of URL to enforce 404 checks 2023-01-15 19:31:58 -05:00
Peignon Melvyn
674a1d1877
Update json.md 2023-01-16 01:27:08 +01:00
Denny Crane
6cf603e05f
Update index.md 2023-01-15 18:40:59 -04:00
Dan Roscigno
f13fdfb431
Merge pull request #45280 from tbragin/patch-5
Update postgresql.md
2023-01-15 11:52:47 -05:00
Robert Schulze
bd41c74ddf
Various test, code and docs fixups 2023-01-15 13:47:34 +00:00
Alexey Milovidov
a5244f8d1d
Merge pull request #45279 from tbragin/patch-4
Update postgresql.md
2023-01-15 12:34:09 +03:00
Ilya Yatsishin
96987b7cd8
Merge pull request #45239 from Avogar/generate-random 2023-01-15 00:37:34 +01:00
Tanya Bragin
d825f50c84
Update postgresql.md
Adding Postgres blog to Postgres DB engine page

cc @DanRoscigno @gingerwizard
2023-01-14 14:39:41 -08:00
Tanya Bragin
b5773caeb2
Update postgresql.md
Adding related Postgres blog here. 

cc @gingerwizard @DanRoscigno
2023-01-14 14:34:33 -08:00
Tanya Bragin
36ec76f53d
Update clickhouse-local.md
cc @DanRoscigno @gingerwizard Adding latest clickhouse-local blog to related content on this page.
2023-01-14 07:50:10 -08:00
DanRoscigno
d0a55f6dc9 doc grace_hash algorithm for join 2023-01-13 13:17:03 -05:00
Rich Raposa
c7aad8e48b
Merge pull request #45207 from ClickHouse/add-maxintersections-to-docs
Add maxIntersections to docs
2023-01-13 10:27:59 -07:00
Robert Schulze
4ea836b87e
Revert "Revert "update function DAYOFWEEK and add new function WEEKDAY for mysql/spark compatiability""
This reverts commit e37f572c34.
2023-01-13 14:00:16 +00:00
Azat Khuzhin
99063b152f Allow to configure queue backlog of the parallel hashed dictionary loader
v2: Decrease default parallel_queue_backlog to 10000 (same speed)
v3: Rename parallel_queue_backlog to per_shard_load_backlog
v3: Rename per_shard_load_backlog to shard_load_queue_backlog
v4: Fix documentation
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-13 13:39:26 +01:00
Azat Khuzhin
345c422e28 Add ability to load hashed dictionaries using multiple threads
Right now dictionaries (here I will talk about only
HASHED/SPARSE_HASHED/COMPLEX_KEY_HASHED/COMPLEX_KEY_SPARSE_HASHED)
can load data only in one thread, since it uses one hash table that
cannot be filled from multiple threads.

And in case you have very big dictionary (i.e. 10e9 elements), it can
take a awhile to load them, especially for SPARSE_HASHED variants (and
if you have such amount of elements there, you are likely use
SPARSE_HASHED, since it requires less memory), in my env it takes ~4
hours, which is enormous amount of time.

So this patch add support of shards for dictionaries, number of shards
determine how much hash tables will use this dictionary, also, and which
is more important, how much threads it can use to load the data.

And with 16 threads this works 2x faster, not perfect though, see the
follow up patches in this series.

v0: PARTITION BY
v1: SHARDS 1
v2: SHARDS(1)
v3: tried optimized mod - logical and, but it does not gain even 10%
v4: tried squashing more (max_block_size * shards), but it does not gain even 10% either
v5: move SHARDS into layout parameters (unknown simply ignored)
v6: tune params for perf tests (to avoid too long queries)
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-13 13:39:25 +01:00