Commit Graph

3290 Commits

Author SHA1 Message Date
Salvatore Mesoraca
15b1f3f6a1
doc: functions: siphash: use correct value in example 2023-01-27 13:00:53 +01:00
Alexey Milovidov
5b257ab806
Merge pull request #45233 from ClickHouse/improve_week_day
Revert "Revert "Improve week day""
2023-01-27 02:44:17 +03:00
Dan Roscigno
76d6e2edf9
Merge pull request #45569 from AVMusorin/docs_optimize_skip_merged_partitions
docs: added optimize_skip_merged_partitions setting
2023-01-26 16:04:27 -05:00
Dan Roscigno
f8bd83a94f
Merge pull request #45595 from den-crane/patch-59
Doc. Update range_hashed description
2023-01-26 16:01:52 -05:00
Dan Roscigno
b4ab361366
Merge branch 'master' into docs_optimize_skip_merged_partitions 2023-01-25 19:45:32 -05:00
Dan Roscigno
55c9b03c47
Update docs/en/sql-reference/statements/optimize.md 2023-01-25 19:45:25 -05:00
rfraposa
a1de808d11 Update mongodb.md 2023-01-25 11:44:13 -07:00
rfraposa
8d44f2aec1 Create mongodb.md 2023-01-25 11:31:07 -07:00
Robert Schulze
d1eb9264ed
Merge remote-tracking branch 'origin/master' into docs-explain-plan 2023-01-25 11:05:46 +00:00
Robert Schulze
da7a657f1f
Docs: fix docs of EXPLAIN PLAN indexes=1 2023-01-25 11:02:50 +00:00
Robert Schulze
0148ad0e0e
Merge pull request #45598 from ClickHouse/typo-fix
Typo: "Granulesis" --> "Granules"
2023-01-25 11:52:11 +01:00
Robert Schulze
fd45295ff4
Typo: "Granulesis" --> "Granules" 2023-01-25 10:49:30 +00:00
Denny Crane
fda47bf4f8
Update external-dicts-dict-layout.md 2023-01-24 21:31:43 -04:00
Dan Roscigno
1cc3708092
Merge branch 'master' into update-settings-url 2023-01-24 19:51:39 -05:00
DanRoscigno
77ae27f26c update for split of format settings 2023-01-24 19:37:55 -05:00
Robert Schulze
87ca5dcdf2
Document start of week in function date_diff() 2023-01-24 20:18:59 +00:00
AVMusorin
5f8658b2d6
docs: added optimize_skip_merged_partitions setting 2023-01-24 14:56:39 +01:00
rfraposa
33770303a0 Add DISTINCT to INTERSECT and EXCEPT 2023-01-23 17:10:30 -07:00
Dan Roscigno
ee31da9e1e
Merge pull request #45366 from gingerwizard/cross-link-docs
link docs to blogs
2023-01-23 14:04:59 -05:00
Robert Schulze
0ab7ae6c17
Merge pull request #43797 from ClickHouse/query-result-cache
Query result cache [experimental]
2023-01-23 19:54:35 +01:00
Sergei Trifonov
e1b20e4314
Merge pull request #45503 from ClickHouse/serxa-patch-3
Provide more clear syntax for `fromUnixTimestamp64*` and `toUnixTimestamp64*` functions
2023-01-23 16:28:32 +01:00
Denny Crane
531ad1661e
Update column.md 2023-01-23 09:57:42 -04:00
Denny Crane
4d9be1ef4f
Update column.md 2023-01-23 09:57:04 -04:00
Denny Crane
2bf59fd48c
Update column.md 2023-01-23 09:55:30 -04:00
Sergei Trifonov
e0fa5b9bac
Update type-conversion-functions.md 2023-01-23 12:26:24 +01:00
Robert Schulze
340f406553
Merge branch 'master' into query-result-cache 2023-01-22 13:21:36 +01:00
Robert Schulze
97d1bed114
Merge branch 'master' into improve_week_day 2023-01-21 20:40:33 +01:00
Robert Schulze
4ac17d71fa
Merge pull request #45470 from ClickHouse/rs-doc-typos
Fix typos
2023-01-20 14:39:27 +01:00
Robert Schulze
3f2e4c8217
Fix typos 2023-01-20 13:20:25 +00:00
Azat Khuzhin
4366f7fb3b Remove PREALLOCATE for HASHED/SPARSE_HASHED dictionaries
It does not give significant benefit, but now, you hashed/sparse_hashed
dictionaries can be filled in parallel (#40003), using sharded
dictionaries, and this should be used instead of PREALLOCATE.

Note, that dictionaries, that had been created with PREALLOCATE will
work, but simply ignore this attribute.

Fixes: #41985 (cc @alexey-milovidov)
Reverts: #23979 (cc @kitaisreal)

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-18 20:18:37 +01:00
Maksim Kita
8225d2814c
Merge pull request #40003 from azat/dict-shards
Add ability to load hashed dictionaries using multiple threads
2023-01-18 13:37:10 +03:00
Robert Schulze
4f90824347
Merge remote-tracking branch 'origin/master' into query-result-cache 2023-01-17 22:49:53 +00:00
Dale Mcdiarmid
9c4d3e0ffb link docs to blogs 2023-01-17 15:38:10 +00:00
Nikolay Degterinsky
70e79de69b
Merge pull request #38252 from bharatnc/ncb/weighted-quantile-approx
add quantileInterpolatedWeighted function
2023-01-16 13:41:13 +01:00
Sema Checherinda
d746a3c4ff
Merge pull request #44480 from wineternity/issue_43333_doc
[DOC] Add support for signed arguments in range() #43333
2023-01-16 10:26:49 +01:00
Robert Schulze
099e30ef2a
Merge remote-tracking branch 'origin/master' into query-result-cache 2023-01-16 08:04:49 +00:00
Ilya Yatsishin
cf5052c77e
Merge pull request #45291 from den-crane/patch-57 2023-01-16 02:33:54 +01:00
Dan Roscigno
adca0b64d3
use markdown file instead of URL to enforce 404 checks 2023-01-15 19:31:58 -05:00
Peignon Melvyn
674a1d1877
Update json.md 2023-01-16 01:27:08 +01:00
Denny Crane
6cf603e05f
Update index.md 2023-01-15 18:40:59 -04:00
Robert Schulze
bd41c74ddf
Various test, code and docs fixups 2023-01-15 13:47:34 +00:00
Ilya Yatsishin
96987b7cd8
Merge pull request #45239 from Avogar/generate-random 2023-01-15 00:37:34 +01:00
Rich Raposa
c7aad8e48b
Merge pull request #45207 from ClickHouse/add-maxintersections-to-docs
Add maxIntersections to docs
2023-01-13 10:27:59 -07:00
Robert Schulze
4ea836b87e
Revert "Revert "update function DAYOFWEEK and add new function WEEKDAY for mysql/spark compatiability""
This reverts commit e37f572c34.
2023-01-13 14:00:16 +00:00
Azat Khuzhin
99063b152f Allow to configure queue backlog of the parallel hashed dictionary loader
v2: Decrease default parallel_queue_backlog to 10000 (same speed)
v3: Rename parallel_queue_backlog to per_shard_load_backlog
v3: Rename per_shard_load_backlog to shard_load_queue_backlog
v4: Fix documentation
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-13 13:39:26 +01:00
Azat Khuzhin
345c422e28 Add ability to load hashed dictionaries using multiple threads
Right now dictionaries (here I will talk about only
HASHED/SPARSE_HASHED/COMPLEX_KEY_HASHED/COMPLEX_KEY_SPARSE_HASHED)
can load data only in one thread, since it uses one hash table that
cannot be filled from multiple threads.

And in case you have very big dictionary (i.e. 10e9 elements), it can
take a awhile to load them, especially for SPARSE_HASHED variants (and
if you have such amount of elements there, you are likely use
SPARSE_HASHED, since it requires less memory), in my env it takes ~4
hours, which is enormous amount of time.

So this patch add support of shards for dictionaries, number of shards
determine how much hash tables will use this dictionary, also, and which
is more important, how much threads it can use to load the data.

And with 16 threads this works 2x faster, not perfect though, see the
follow up patches in this series.

v0: PARTITION BY
v1: SHARDS 1
v2: SHARDS(1)
v3: tried optimized mod - logical and, but it does not gain even 10%
v4: tried squashing more (max_block_size * shards), but it does not gain even 10% either
v5: move SHARDS into layout parameters (unknown simply ignored)
v6: tune params for perf tests (to avoid too long queries)
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-13 13:39:25 +01:00
avogar
82ff1fd343 Add tests and docs 2023-01-12 22:29:23 +00:00
Dan Roscigno
8c94ed9597
Update docs/en/sql-reference/aggregate-functions/reference/maxintersections.md 2023-01-12 11:01:03 -05:00
Rich Raposa
759a4c0940
Update docs/en/sql-reference/aggregate-functions/reference/maxintersections.md
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-12 08:53:22 -07:00
rfraposa
69a11574d2 Update maxintersections.md 2023-01-12 08:30:54 -07:00