Azat Khuzhin
4366f7fb3b
Remove PREALLOCATE for HASHED/SPARSE_HASHED dictionaries
...
It does not give significant benefit, but now, you hashed/sparse_hashed
dictionaries can be filled in parallel (#40003 ), using sharded
dictionaries, and this should be used instead of PREALLOCATE.
Note, that dictionaries, that had been created with PREALLOCATE will
work, but simply ignore this attribute.
Fixes : #41985 (cc @alexey-milovidov)
Reverts: #23979 (cc @kitaisreal)
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-18 20:18:37 +01:00
Maksim Kita
8225d2814c
Merge pull request #40003 from azat/dict-shards
...
Add ability to load hashed dictionaries using multiple threads
2023-01-18 13:37:10 +03:00
Sergei Trifonov
ca0fb785f4
Merge pull request #45193 from ClickHouse/concurrency-control-docs
...
Add docs for concurrency control logic
2023-01-17 21:10:17 +01:00
Vladimir C
b80ee8df50
Merge pull request #45256 from DanRoscigno/add-grace-hash-docs
2023-01-16 18:43:19 +01:00
Kruglov Pavel
bdb3517512
Merge pull request #45231 from Avogar/json-tuples
...
Insert default values in case of missing tuple elements in JSONEachRow
2023-01-16 17:49:50 +01:00
Nikolay Degterinsky
70e79de69b
Merge pull request #38252 from bharatnc/ncb/weighted-quantile-approx
...
add quantileInterpolatedWeighted function
2023-01-16 13:41:13 +01:00
Sema Checherinda
d746a3c4ff
Merge pull request #44480 from wineternity/issue_43333_doc
...
[DOC] Add support for signed arguments in range() #43333
2023-01-16 10:26:49 +01:00
Ilya Yatsishin
cf5052c77e
Merge pull request #45291 from den-crane/patch-57
2023-01-16 02:33:54 +01:00
Dan Roscigno
adca0b64d3
use markdown file instead of URL to enforce 404 checks
2023-01-15 19:31:58 -05:00
Peignon Melvyn
674a1d1877
Update json.md
2023-01-16 01:27:08 +01:00
Denny Crane
6cf603e05f
Update index.md
2023-01-15 18:40:59 -04:00
Dan Roscigno
f13fdfb431
Merge pull request #45280 from tbragin/patch-5
...
Update postgresql.md
2023-01-15 11:52:47 -05:00
Alexey Milovidov
a5244f8d1d
Merge pull request #45279 from tbragin/patch-4
...
Update postgresql.md
2023-01-15 12:34:09 +03:00
Ilya Yatsishin
96987b7cd8
Merge pull request #45239 from Avogar/generate-random
2023-01-15 00:37:34 +01:00
Tanya Bragin
d825f50c84
Update postgresql.md
...
Adding Postgres blog to Postgres DB engine page
cc @DanRoscigno @gingerwizard
2023-01-14 14:39:41 -08:00
Tanya Bragin
b5773caeb2
Update postgresql.md
...
Adding related Postgres blog here.
cc @gingerwizard @DanRoscigno
2023-01-14 14:34:33 -08:00
Tanya Bragin
36ec76f53d
Update clickhouse-local.md
...
cc @DanRoscigno @gingerwizard Adding latest clickhouse-local blog to related content on this page.
2023-01-14 07:50:10 -08:00
DanRoscigno
d0a55f6dc9
doc grace_hash algorithm for join
2023-01-13 13:17:03 -05:00
Rich Raposa
c7aad8e48b
Merge pull request #45207 from ClickHouse/add-maxintersections-to-docs
...
Add maxIntersections to docs
2023-01-13 10:27:59 -07:00
Azat Khuzhin
99063b152f
Allow to configure queue backlog of the parallel hashed dictionary loader
...
v2: Decrease default parallel_queue_backlog to 10000 (same speed)
v3: Rename parallel_queue_backlog to per_shard_load_backlog
v3: Rename per_shard_load_backlog to shard_load_queue_backlog
v4: Fix documentation
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-13 13:39:26 +01:00
Azat Khuzhin
345c422e28
Add ability to load hashed dictionaries using multiple threads
...
Right now dictionaries (here I will talk about only
HASHED/SPARSE_HASHED/COMPLEX_KEY_HASHED/COMPLEX_KEY_SPARSE_HASHED)
can load data only in one thread, since it uses one hash table that
cannot be filled from multiple threads.
And in case you have very big dictionary (i.e. 10e9 elements), it can
take a awhile to load them, especially for SPARSE_HASHED variants (and
if you have such amount of elements there, you are likely use
SPARSE_HASHED, since it requires less memory), in my env it takes ~4
hours, which is enormous amount of time.
So this patch add support of shards for dictionaries, number of shards
determine how much hash tables will use this dictionary, also, and which
is more important, how much threads it can use to load the data.
And with 16 threads this works 2x faster, not perfect though, see the
follow up patches in this series.
v0: PARTITION BY
v1: SHARDS 1
v2: SHARDS(1)
v3: tried optimized mod - logical and, but it does not gain even 10%
v4: tried squashing more (max_block_size * shards), but it does not gain even 10% either
v5: move SHARDS into layout parameters (unknown simply ignored)
v6: tune params for perf tests (to avoid too long queries)
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-13 13:39:25 +01:00
avogar
82ff1fd343
Add tests and docs
2023-01-12 22:29:23 +00:00
Yakov Olkhovskiy
842a34666c
Merge pull request #44594 from arenadata/ADQM-634
...
Add keytab parameter in kerberos section of config.xml
2023-01-12 15:07:45 -05:00
avogar
87b934c472
Insert default values in case of missing tuple elements in JSONEachRow
2023-01-12 16:36:44 +00:00
DanRoscigno
69cdb838c5
temporarily replace mermaid with PNG
2023-01-12 11:03:40 -05:00
Dan Roscigno
8c94ed9597
Update docs/en/sql-reference/aggregate-functions/reference/maxintersections.md
2023-01-12 11:01:03 -05:00
Rich Raposa
759a4c0940
Update docs/en/sql-reference/aggregate-functions/reference/maxintersections.md
...
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-12 08:53:22 -07:00
rfraposa
69a11574d2
Update maxintersections.md
2023-01-12 08:30:54 -07:00
Kseniia Sumarokova
db3e0219fc
Merge pull request #41687 from ClickHouse/40907_Parameterized_views_as_table_functions
...
40907 Parameterized views as table functions
2023-01-12 14:24:32 +01:00
Alexander Tokmakov
e37f572c34
Revert "update function DAYOFWEEK and add new function WEEKDAY for mysql/spark compatiability"
2023-01-12 15:01:36 +03:00
Sergei Trifonov
8c40c4d79f
Update docs/en/development/architecture.md
...
Co-authored-by: Dmitry Novik <n0vik@clickhouse.com>
2023-01-12 02:25:03 +01:00
Sergei Trifonov
e784f65ee9
Update docs/en/development/architecture.md
...
Co-authored-by: Dmitry Novik <n0vik@clickhouse.com>
2023-01-12 02:23:58 +01:00
Sergei Trifonov
78a63e795c
Update docs/en/development/architecture.md
...
Co-authored-by: Dmitry Novik <n0vik@clickhouse.com>
2023-01-12 02:23:15 +01:00
Sergei Trifonov
d41dbdc4a6
Update docs/en/development/architecture.md
...
Co-authored-by: Dmitry Novik <n0vik@clickhouse.com>
2023-01-12 02:22:59 +01:00
Dan Roscigno
7a651d749c
Update docs/en/sql-reference/aggregate-functions/reference/maxintersections.md
2023-01-11 19:20:37 -05:00
rfraposa
2e44ad9d0f
Add maxIntersections to docs
2023-01-11 17:10:51 -07:00
Ilya Yatsishin
3a98f2bc12
Merge pull request #45190 from ClickHouse/add-query-params-to-docs
...
Add query parameters to the docs
2023-01-11 23:47:50 +01:00
Sergei Trifonov
1b94c839d5
Add docs for SYSTEM RELOAD USERS
2023-01-11 21:16:22 +01:00
serxa
b6e14f60d4
fix
2023-01-11 19:34:31 +00:00
serxa
2dae6a1ffe
add docs for concurrency control logic
2023-01-11 19:19:51 +00:00
Rich Raposa
f8ac49bb86
Update syntax.md
2023-01-11 12:09:23 -07:00
Rich Raposa
a389180f42
Update syntax.md
2023-01-11 12:05:35 -07:00
rfraposa
8b9d99e2e2
Update syntax.md
2023-01-11 11:51:53 -07:00
Sergei Trifonov
ec9f10e934
Merge pull request #45174 from ClickHouse/make-queries-copyable-from-docs
...
make more SQL queries copyable from docs in one click
2023-01-11 16:59:52 +01:00
DanRoscigno
7168c217b0
switch text to response for query blocks
2023-01-11 10:08:11 -05:00
Dan Roscigno
bb3c4641a7
Merge pull request #45167 from ClickHouse/add-sparse-serialization-to-docs
...
Add explanation of ratio_of_defaults_for_sparse_serialization setting to docs
2023-01-11 08:57:10 -05:00
Sergei Trifonov
8b724c6d58
Merge pull request #45169 from ClickHouse/moves-table-docs
...
Add docs for `system.moves` table
2023-01-11 14:48:19 +01:00
serxa
8d099a4417
make more SQL queries copyable from docs in one click
2023-01-11 13:43:51 +00:00
DanRoscigno
4767147745
format query response
2023-01-11 08:23:44 -05:00
Dan Roscigno
0ad969171e
Merge pull request #45127 from DanRoscigno/add-deltalake-docs
...
Add deltalake docs
2023-01-11 08:07:42 -05:00