Commit Graph

9146 Commits

Author SHA1 Message Date
Robert Schulze
fdd6c77480
Merge pull request #45382 from bigo-sg/improve_position
Add fastpath for function position when needle is empty
2023-01-31 11:22:31 +01:00
Robert Schulze
325c6bdf3d
Renaming: "Query Result Cache" --> "Query Cache"
Reasons:

- The cache will at some point store intermediate results as opposed to
  only query results. We should change the terminology now without
  having to worry about backward compat.

- Equivalent caches in MySQL (1) and Starrocks (2) are called "query
  cache".

- The new name is ca. 13.8% more catchy.

(1) https://dev.mysql.com/doc/refman/5.6/en/query-cache.html
(2) https://docs.starrocks.io/en-us/2.5/using_starrocks/query_cache
2023-01-31 09:54:34 +00:00
Robert Schulze
659a64a1d9
Merge pull request #45679 from ClickHouse/untangle-query-result-cache-settings
Untangle configuration of query result cache
2023-01-31 10:44:14 +01:00
taiyang-li
4cf11d9a84 change as request 2023-01-31 10:48:02 +08:00
Ilya Yatsishin
8d6df7ed58
Merge pull request #45785 from Algunenano/generate_random_types 2023-01-30 23:03:12 +01:00
Dan Roscigno
7752e7f8df
Update backup.md
fix link
2023-01-30 14:51:51 -05:00
Igor Nikonov
741c3ba23c Fix non-intentional change 2023-01-30 18:43:01 +01:00
Igor Nikonov
01b225f57e
Update docs/en/operations/settings/settings.md
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-30 18:30:59 +01:00
Igor Nikonov
211449d916
Update docs/en/operations/settings/settings.md
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-30 18:30:29 +01:00
Igor Nikonov
b095dab141
Update docs/en/operations/settings/settings.md
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-30 18:29:32 +01:00
Igor Nikonov
0c4bc489da
Update docs/en/operations/settings/settings.md
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-30 18:28:53 +01:00
Igor Nikonov
30c949ba27
Update docs/en/operations/settings/settings.md
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-30 18:28:34 +01:00
Nikita Taranov
581f31ad3d better 2023-01-30 17:11:56 +00:00
Nikita Taranov
d8c191f53b add docs 2023-01-30 17:11:56 +00:00
Alexander Gololobov
65d39a616d Added ON CLUSTER 2023-01-30 17:02:29 +01:00
Igor Nikonov
c30a55b7ee Some polishing 2023-01-30 16:53:37 +01:00
Igor Nikonov
d8ba469b97
Fixes 2023-01-30 16:24:02 +01:00
Igor Nikonov
65caa77a0e
Update settings.md 2023-01-30 16:13:08 +01:00
Arthur Passos
7b1ceaa2c5
Merge branch 'master' into fix_cares_crash 2023-01-30 12:05:56 -03:00
Raúl Marín
997fad4e8d GenerateRandom: Add support for maps 2023-01-30 15:23:22 +01:00
Robert Schulze
27b978381b
Use more speaking setting names 2023-01-30 14:20:43 +00:00
Dan Roscigno
a33465cf2f
Merge branch 'master' into patch-1 2023-01-30 07:33:20 -05:00
Dan Roscigno
f91f2a3e24
Merge pull request #45771 from ClickHouse/troubleshoot-rpm
Add troubleshooting for wrong RPM repos signature
2023-01-30 07:28:46 -05:00
Ilya Yatsishin
ebce26b50e
Merge pull request #45775 from DerekChia/patch-1 2023-01-30 13:20:56 +01:00
Robert Schulze
9124e56e2a
Docs: Remove non existing function 2023-01-30 11:47:58 +00:00
Dan Roscigno
86acb61849
Merge branch 'master' into patch-1 2023-01-30 06:18:23 -05:00
Derek Chia
692b7ab211
Update dictionary.md
Remove "statement:" from a query
2023-01-30 17:56:17 +08:00
Mikhail f. Shiryaev
b1bc3b6b43
Add troubleshooting for wrong RPM repos signature 2023-01-30 10:27:50 +01:00
taiyang-li
d25740da83 change as request 2023-01-30 16:13:12 +08:00
Roman Heinrich
bcc933098b
Fix broken examples for inverted indexes 2023-01-30 00:12:32 +01:00
Robert Schulze
15ae2d1de5
Merge branch 'master' into aiven-sal/siphash_pr 2023-01-29 21:06:52 +01:00
Robert Schulze
6195456d5b
Fix formatting 2023-01-29 19:58:59 +00:00
Robert Schulze
0a8dee7000
Docs: Fix link 2023-01-29 11:26:49 +00:00
Alexey Milovidov
5f4726eb2a
Merge pull request #45717 from Performica/topKWeighted-better-example
improve explanatory of topKWeighted example
2023-01-29 06:24:32 +03:00
Alexey Milovidov
0f7f490302
Merge pull request #45716 from Performica/maxMap-intuitive-example
More intuitive maxMap example
2023-01-29 06:23:54 +03:00
freedomDR
876a92d4d1 add doc about system sync file cache. 2023-01-28 04:27:10 +00:00
robot-ch-test-poll
cb803a887a
Merge pull request #45660 from abidino/patch-2
Update sum.md
2023-01-28 01:52:06 +01:00
Alexey Milovidov
bc2f454522
Merge branch 'master' into block-non-float-gorilla-v2 2023-01-28 03:30:12 +03:00
Alexey Milovidov
0ffc8f048b
Merge pull request #45464 from lesandie/patch-1
Update skipping-index.md
2023-01-28 03:27:44 +03:00
Alexey Milovidov
a62ca0f016
Merge pull request #45688 from jus1096/master
Add info about connect to remote Clickhouse
2023-01-28 02:21:28 +03:00
ivan-klass
b83e60b1d7
improve explanatory of topKWeighted example 2023-01-28 00:20:15 +01:00
ivan-klass
8a6c4bec13
More intuitive maxMap example
Different types used to easier distinguish maxMap keys vs values
2023-01-27 23:41:55 +01:00
Kruglov Pavel
96700abbe1
Merge pull request #45678 from azat/formats/json-parse-tupels
Add ability to ignore unknown keys in JSON object for named tuples
2023-01-27 21:11:05 +01:00
Kseniia Sumarokova
7d178e5ee6
Merge pull request #45680 from DanRoscigno/docs-mergetree-s3-cache
doc updates related to s3 cache change
2023-01-27 17:54:31 +01:00
DanRoscigno
a88c222f44 add missing heading 2023-01-27 08:56:25 -05:00
DanRoscigno
80b3b882c3 review from ksenii 2023-01-27 08:48:29 -05:00
Robert Schulze
4abd395153
Docs: Fix typo 2023-01-27 13:39:50 +00:00
Robert Schulze
5edf321927
Some docs fixes 2023-01-27 13:00:54 +01:00
Salvatore Mesoraca
464ecf50ef
doc: functions: hash: add sipHash{64,128}Keyed 2023-01-27 13:00:54 +01:00
Salvatore Mesoraca
dfd6dfc155
doc: functions: siphash: fix broken links 2023-01-27 13:00:53 +01:00
Salvatore Mesoraca
15b1f3f6a1
doc: functions: siphash: use correct value in example 2023-01-27 13:00:53 +01:00
Robert Schulze
e30c38cab3
Fix up some leftovers 2023-01-27 10:43:43 +00:00
Azat Khuzhin
1a8437f2c9 Add ability to ignore unknown keys in JSON object for named tuples
This can be useful in case your input JSON is complex, while you need
only few fields in it.

This behaviour is controlled by the
input_format_json_ignore_unknown_keys_in_named_tuple setting name, that
is turned OFF by default.

This will, almost, allow to parse gharchive dataset without jq. "almost"
because of two things:
- Tuple cannot be Nullable, so such keys with Tuple type in ClickHouse
  cannot be `null` in JSON
- You cannot use dot.dot notation to extract columns for file() engine,
  only tupleElement()

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-27 10:01:08 +01:00
Jus
9ce4bf1baf
Update named-collections.md
small fix name host on the remote host
2023-01-27 12:42:53 +04:00
Alexey Milovidov
5b257ab806
Merge pull request #45233 from ClickHouse/improve_week_day
Revert "Revert "Improve week day""
2023-01-27 02:44:17 +03:00
DanRoscigno
15b4c712df updates related to s3 cache change 2023-01-26 16:32:39 -05:00
Robert Schulze
7d97fab2dc
Untangle query result cache settings
This PR modularizes the configuration options of the query result cache
(QRC).

1. Add a feature toggle "allow_experimental_query_result_cache = false"
   which controls whether the experimental QRC can be used at all. This
   configuration option will be removed once the QRC becomes
   non-experimental.

2. Modularize the existing two configuration settings
   "enable_experimental_query_result_cache" and
   "enable_experimental_query_result_cache_passive_usage" into three
   orthogonal settings:

   - "use_query_result_cache = false" controls whether a query (or all
     queries of the session) utilize the query result cache

   - settings "use_query_result_cache_passive_usage = true" and
     "use_query_result_cache_active_usage = true" control the way a
     query (or all queries of the session) utilize the query result
     cache: "passive" use is reading, "active" use is writing.
2023-01-26 21:17:48 +00:00
Dan Roscigno
76d6e2edf9
Merge pull request #45569 from AVMusorin/docs_optimize_skip_merged_partitions
docs: added optimize_skip_merged_partitions setting
2023-01-26 16:04:27 -05:00
Dan Roscigno
f8bd83a94f
Merge pull request #45595 from den-crane/patch-59
Doc. Update range_hashed description
2023-01-26 16:01:52 -05:00
Dan Roscigno
d1a81b2776
Update docs/en/engines/table-engines/mergetree-family/invertedindexes.md 2023-01-26 14:07:13 -05:00
Dan Roscigno
00b5354a93
Update invertedindexes.md 2023-01-26 14:05:26 -05:00
Dan Roscigno
f71b68bc46
Merge pull request #45651 from mrcrypster/patch-5
Added markdown format docs
2023-01-26 11:42:22 -05:00
Diego Nieto
836e14e52b
Update skipping-index.md
Deleted the confusing comment and added implemented as a mutation, like MATERIALIZE INDEX.
2023-01-26 17:23:39 +01:00
Dan Roscigno
777ddf80ab
Update docs/en/interfaces/formats.md 2023-01-26 11:18:18 -05:00
Diego Nieto
ec378fc4d6
Merge branch 'ClickHouse:master' into patch-1 2023-01-26 17:17:35 +01:00
Dan Roscigno
8549af4853
Update docs/en/engines/table-engines/mergetree-family/invertedindexes.md 2023-01-26 10:51:08 -05:00
DanRoscigno
710384e69a remove sidebar position 2023-01-26 10:13:06 -05:00
DanRoscigno
88076b6db0 add keywords 2023-01-26 10:06:06 -05:00
abidino
cba266c965
Update sum.md 2023-01-26 17:33:30 +03:00
DanRoscigno
c130050572 capitalize sidebar 2023-01-26 09:05:56 -05:00
Dan Roscigno
03c80d2794
Merge branch 'master' into docs-inverted-index-edits 2023-01-26 09:03:50 -05:00
DanRoscigno
310797d9f0 edits, add frontmatter 2023-01-26 08:58:10 -05:00
Dan Roscigno
efe94b727c
Merge pull request #45574 from FArthur-cmd/laion_doc
Add documentation for laion dataset
2023-01-26 08:43:28 -05:00
FArthur-cmd
71f7274f1f small fixes 2023-01-26 15:24:41 +03:00
Igor Nikonov
ed01f76c6a
Merge pull request #45592 from ClickHouse/update-insert-delay-doc-2
Update INSERT delay doc with example
2023-01-26 12:55:35 +01:00
Denys Golotiuk
0eadb7112d
Added markdown format docs 2023-01-26 13:33:14 +02:00
Arthur Passos
433eda70fd add reverseDNSQuery docs 2023-01-26 07:57:43 -03:00
Dan Roscigno
b4ab361366
Merge branch 'master' into docs_optimize_skip_merged_partitions 2023-01-25 19:45:32 -05:00
Dan Roscigno
55c9b03c47
Update docs/en/sql-reference/statements/optimize.md 2023-01-25 19:45:25 -05:00
sichenzhao
243ac52259
Added two metrics about memory usage in cgroup to asynchronous metrics (#45301) 2023-01-25 20:32:17 +01:00
rfraposa
a1de808d11 Update mongodb.md 2023-01-25 11:44:13 -07:00
rfraposa
8d44f2aec1 Create mongodb.md 2023-01-25 11:31:07 -07:00
DanRoscigno
fe1d99a2d3 add PARTITION BY to file and url docs 2023-01-25 09:18:55 -05:00
DanRoscigno
917e3d4244 add PARTITION BY to s3 and hdfs docs 2023-01-25 09:09:28 -05:00
FArthur-cmd
304d1cc53e fix path in docs 2023-01-25 14:39:41 +03:00
Robert Schulze
d1eb9264ed
Merge remote-tracking branch 'origin/master' into docs-explain-plan 2023-01-25 11:05:46 +00:00
Robert Schulze
da7a657f1f
Docs: fix docs of EXPLAIN PLAN indexes=1 2023-01-25 11:02:50 +00:00
Robert Schulze
0148ad0e0e
Merge pull request #45598 from ClickHouse/typo-fix
Typo: "Granulesis" --> "Granules"
2023-01-25 11:52:11 +01:00
Robert Schulze
fd45295ff4
Typo: "Granulesis" --> "Granules" 2023-01-25 10:49:30 +00:00
Denny Crane
fda47bf4f8
Update external-dicts-dict-layout.md 2023-01-24 21:31:43 -04:00
Dan Roscigno
1cc3708092
Merge branch 'master' into update-settings-url 2023-01-24 19:51:39 -05:00
DanRoscigno
77ae27f26c update for split of format settings 2023-01-24 19:37:55 -05:00
Dan Roscigno
e520bd8052
Apply suggestions from code review 2023-01-24 19:10:30 -05:00
Igor Nikonov
29ef6b9574
Update INSERT delay doc properly 2023-01-25 00:26:32 +01:00
Dan Roscigno
dbb83dc9a4
Merge branch 'master' into docs-for-s3-settings 2023-01-24 17:55:31 -05:00
DanRoscigno
fe56967af7 fix note formatting 2023-01-24 17:44:10 -05:00
DanRoscigno
9775987ee9 add new settings for s3 and hdfs 2023-01-24 17:40:37 -05:00
DanRoscigno
f95bc55923 split Format settings out 2023-01-24 17:19:50 -05:00
rfraposa
6f7755ebcd Added optimize using constraints settings 2023-01-24 15:05:34 -07:00
Robert Schulze
d6ab376b02
Merge pull request #45586 from ClickHouse/document-datediff-week-start
Document start of week in function date_diff()
2023-01-24 21:38:00 +01:00
Robert Schulze
87ca5dcdf2
Document start of week in function date_diff() 2023-01-24 20:18:59 +00:00
Robert Schulze
5edea1b461
Docs: mini semicolon fix 2023-01-24 19:25:37 +00:00
robot-ch-test-poll1
0dab6f0208
Merge pull request #45573 from ClickHouse/update-docs-for-insert-delay
Update formula for INSERT delay
2023-01-24 17:07:46 +01:00
Igor Nikonov
e585a1228f
Update docs/en/operations/settings/merge-tree-settings.md
Co-authored-by: Alexander Tokmakov <tavplubix@clickhouse.com>
2023-01-24 16:52:32 +01:00
Kruglov Pavel
23c12ac8ee
Merge branch 'master' into parquet-fixed-binary 2023-01-24 16:51:05 +01:00
Denys Golotiuk
c1de756d8a
DELETE + WHERE in TTL
ClickHouse actually supports only `DELETE` action with `WHERE` clause of a `TTL` statement
2023-01-24 17:50:29 +02:00
FArthur-cmd
b1107eaff4 small improvement 2023-01-24 17:58:31 +03:00
FArthur-cmd
a7def60de8 resolve conflicts 2023-01-24 17:52:20 +03:00
FArthur-cmd
a8ed15650f add laion documentation 2023-01-24 17:47:04 +03:00
Igor Nikonov
21b009fdec
Update merge-tree-settings.md 2023-01-24 15:45:21 +01:00
Kseniia Sumarokova
dd18dd2953
Merge pull request #45341 from ClibMouse/feature/non-throwing-hastoken
Introduce non-throwing variants of hasToken
2023-01-24 15:37:36 +01:00
Kruglov Pavel
4bd3f0e5ef
Merge pull request #44953 from Avogar/tsv-csv-detect-header
Detect header in CSV/TSV/CustomSeparated files automatically
2023-01-24 15:13:52 +01:00
AVMusorin
5f8658b2d6
docs: added optimize_skip_merged_partitions setting 2023-01-24 14:56:39 +01:00
DanRoscigno
72b1a65984 WIP 2023-01-24 07:45:59 -05:00
Robert Schulze
574cab5d7e
Remove transitory parameter 2023-01-24 11:05:29 +00:00
Kseniia Sumarokova
ac1e7264fd
Merge pull request #45528 from ClickHouse/add-distinct-to-intersect
Add DISTINCT to INTERSECT and EXCEPT docs
2023-01-24 11:14:32 +01:00
Robert Schulze
aaf7653108
Merge remote-tracking branch 'origin/master' into block-non-float-gorilla-v2 2023-01-24 10:14:10 +00:00
rfraposa
33770303a0 Add DISTINCT to INTERSECT and EXCEPT 2023-01-23 17:10:30 -07:00
Rich Raposa
429e93965c
Update formats.md
Google has a new website for Protocol Buffers. The old link expires on Jan 31, 2023
2023-01-23 15:42:35 -07:00
ltrk2
9710b8643c Refine the solution 2023-01-23 14:27:48 -08:00
DanRoscigno
dedb4e4de2 add docs for PR 33302 2023-01-23 16:14:39 -05:00
Kruglov Pavel
478a552a0a
Merge branch 'master' into tsv-csv-detect-header 2023-01-23 21:47:17 +01:00
Diego Nieto
c4266588c6
Merge branch 'master' into patch-1 2023-01-23 20:50:37 +01:00
Dan Roscigno
ee31da9e1e
Merge pull request #45366 from gingerwizard/cross-link-docs
link docs to blogs
2023-01-23 14:04:59 -05:00
Robert Schulze
0ab7ae6c17
Merge pull request #43797 from ClickHouse/query-result-cache
Query result cache [experimental]
2023-01-23 19:54:35 +01:00
Dale Mcdiarmid
227b772dd6 remove JSON 2023-01-23 16:51:11 +00:00
Sergei Trifonov
e1b20e4314
Merge pull request #45503 from ClickHouse/serxa-patch-3
Provide more clear syntax for `fromUnixTimestamp64*` and `toUnixTimestamp64*` functions
2023-01-23 16:28:32 +01:00
Kruglov Pavel
01ddf326ac
Merge branch 'master' into parquet-fixed-binary 2023-01-23 15:31:45 +01:00
Denny Crane
531ad1661e
Update column.md 2023-01-23 09:57:42 -04:00
Denny Crane
4d9be1ef4f
Update column.md 2023-01-23 09:57:04 -04:00
Denny Crane
2bf59fd48c
Update column.md 2023-01-23 09:55:30 -04:00
Sergei Trifonov
e0fa5b9bac
Update type-conversion-functions.md 2023-01-23 12:26:24 +01:00
Robert Schulze
ed84021aa5
Formatting fixup 2023-01-22 16:45:26 +01:00
Robert Schulze
0ffffdd442
Docs: Fix weird formatting 2023-01-22 15:38:11 +00:00
Robert Schulze
340f406553
Merge branch 'master' into query-result-cache 2023-01-22 13:21:36 +01:00
Robert Schulze
97d1bed114
Merge branch 'master' into improve_week_day 2023-01-21 20:40:33 +01:00
Robert Schulze
e6167d6b36
Deprecate Gorilla compression of non-float columns
Reasons:

1. The original Gorilla paper proposed a compression schema for pairs of
   time stamps and double-precision FP values. ClickHouse's Gorilla
   codec only implements compression of the latter and it does not
   impose any data type restrictions.
   - Data types != Float* or (U)Int* (e.g. Decimal, Point etc.) are
     definitely not supposed to be used with Gorilla.
   - (U)Int* types are debatable. The paper only considers
     integers-stored-as-FP-values, a practical use case for which
     Gorilla works well. Standalone integers are not considered which
     makes them at least suspicious.

2. Achieve consistency with FPC, another specialized floating-point
   timeseries codec, which rejects non-float data.

3. On practical datasets, ZSTD is often "good enough" (**) so it should
   be okay to disincentive non-ZSTD codecs a little bit. If needed,
   Delta and DoubleDelta codecs are viable alternative for slowly
   changing (time-series-like) integer sequences.

Since on-prem and hosted users may still have Gorilla-compressed
non-float data, this combination is only deprecated for now. No warning
or error will be emitted. Users are encouraged to migrate
Gorilla-compressed non-float data to an alternative codec. It is planned
to treat Gorilla-compressed non-float columns as "suspicious" six months
after this commit (i.e. in v23.6). Even then, it will still be possible
to set "allow_suspicious_codecs = true" and read and write
Gorilla-compressed non-float data.

(*) Sec. 4.1.2, "Gorilla restricts the value element in its tuple to a
    double floating point type.", https://doi.org/10.14778/2824032.2824078

(**) https://clickhouse.com/blog/optimize-clickhouse-codecs-compression-schema
2023-01-20 17:31:16 +00:00
Han Fei
4bbe90f6b4
Merge pull request #45473 from hanfei1991/hanfei/async-inserts-doc
update docs for async insert deduplication
2023-01-20 18:25:19 +01:00
Han Fei
18a397f8c9 address comments 2023-01-20 18:09:42 +01:00
Han Fei
449ace3373
Update docs/en/operations/settings/settings.md
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-20 18:07:19 +01:00
Han Fei
9d87bd10ee
Update docs/en/operations/settings/settings.md
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-20 18:07:08 +01:00
Han Fei
badfbcb3d8
Update docs/en/operations/settings/settings.md
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-20 18:06:58 +01:00
Han Fei
e9c4cf46cd
Update docs/en/operations/settings/merge-tree-settings.md
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-20 18:06:46 +01:00
Han Fei
9d254f7d87
Update docs/en/operations/settings/merge-tree-settings.md
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-01-20 18:06:32 +01:00
robot-ch-test-poll4
2066581d8f
Merge pull request #45451 from evillique/default_granularity
Add default GRANULARITY argument for secondary indexes
2023-01-20 17:46:21 +01:00
Nikolay Degterinsky
02142596fb Add docs 2023-01-20 15:22:13 +00:00
Kruglov Pavel
28ddcc2432
Merge branch 'master' into tsv-csv-detect-header 2023-01-20 15:08:38 +01:00
Han Fei
5fc4998f10 update docs for async insert deduplication 2023-01-20 14:42:11 +01:00
Robert Schulze
4ac17d71fa
Merge pull request #45470 from ClickHouse/rs-doc-typos
Fix typos
2023-01-20 14:39:27 +01:00
Robert Schulze
3f2e4c8217
Fix typos 2023-01-20 13:20:25 +00:00
Robert Schulze
687f9c35a7
Merge pull request #45469 from ClickHouse/inv-idx-docs
Docs for inverted index
2023-01-20 14:17:58 +01:00
Robert Schulze
7e6d3163b1
Initial inverted index docs 2023-01-20 12:12:20 +00:00
Diego Nieto
a2b342bcbe
Update skipping-index.md
ADD and DROP are not such lightweight command they generate mutations to deal with the changes and they will take time to complete depending on how much data the table has.
2023-01-20 12:32:50 +01:00
Maksim Kita
758c8f2776
Merge branch 'master' into dict/remove-preallocate 2023-01-20 13:15:37 +03:00
Antonio Andelic
0ad37ad286
Merge pull request #45320 from stigsb/system_tables_volume_config
Add <storage_policy> config parameter for system logs
2023-01-20 10:27:57 +01:00
Robert Schulze
099f50c0e3
Remove redundancy 2023-01-19 18:35:19 +00:00
Robert Schulze
d8144ffafc
Merge branch 'master' into index-docs 2023-01-19 18:05:27 +01:00
Robert Schulze
c9988f4411
Mini fix 2023-01-19 17:04:17 +00:00
Robert Schulze
7355e3bc77
Docs: Beautify section on secondary index types 2023-01-19 17:00:31 +00:00
robot-clickhouse-ci-1
13d4d40568
Merge pull request #45325 from ClickHouse/download-builds
Improve manual to get binaries from CI
2023-01-19 17:49:52 +01:00
Robert Schulze
bfcb5e7915
Fix typos 2023-01-19 16:08:21 +00:00
Kruglov Pavel
9820beae68
Apply suggestions from code review
Co-authored-by: Antonio Andelic <antonio2368@users.noreply.github.com>
2023-01-19 16:11:13 +01:00
Stig Bakken
420c179b55 Add <storage_policy> config parameter for system logs 2023-01-19 10:25:28 +01:00
DanRoscigno
5d88a2b344 clarify autodetect 2023-01-18 15:51:31 -05:00
Azat Khuzhin
4366f7fb3b Remove PREALLOCATE for HASHED/SPARSE_HASHED dictionaries
It does not give significant benefit, but now, you hashed/sparse_hashed
dictionaries can be filled in parallel (#40003), using sharded
dictionaries, and this should be used instead of PREALLOCATE.

Note, that dictionaries, that had been created with PREALLOCATE will
work, but simply ignore this attribute.

Fixes: #41985 (cc @alexey-milovidov)
Reverts: #23979 (cc @kitaisreal)

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-18 20:18:37 +01:00
DanRoscigno
cd3ecb9091 add support_batch_delete to mergetree docs 2023-01-18 12:16:31 -05:00
Maksim Kita
8225d2814c
Merge pull request #40003 from azat/dict-shards
Add ability to load hashed dictionaries using multiple threads
2023-01-18 13:37:10 +03:00
Robert Schulze
4f90824347
Merge remote-tracking branch 'origin/master' into query-result-cache 2023-01-17 22:49:53 +00:00
Robert Schulze
fd7f5dad1c
Docs improvements 2023-01-17 22:03:48 +00:00
Sergei Trifonov
ca0fb785f4
Merge pull request #45193 from ClickHouse/concurrency-control-docs
Add docs for concurrency control logic
2023-01-17 21:10:17 +01:00
Dale Mcdiarmid
9c4d3e0ffb link docs to blogs 2023-01-17 15:38:10 +00:00
Kruglov Pavel
96bb99f864
Merge branch 'master' into tsv-csv-detect-header 2023-01-17 15:33:02 +01:00
avogar
5bf4704e7a Support FixedSizeBinary type in Parquet/Arrow 2023-01-16 21:01:31 +00:00
Vladimir C
b80ee8df50
Merge pull request #45256 from DanRoscigno/add-grace-hash-docs 2023-01-16 18:43:19 +01:00
Kruglov Pavel
e9d6590926
Merge branch 'master' into tsv-csv-detect-header 2023-01-16 17:50:24 +01:00
Kruglov Pavel
bdb3517512
Merge pull request #45231 from Avogar/json-tuples
Insert default values in case of missing tuple elements in JSONEachRow
2023-01-16 17:49:50 +01:00
avogar
1c0941d72a Add docs and examples 2023-01-16 16:46:41 +00:00
Mikhail f. Shiryaev
08c9d81c90
Apply suggestions from code review
Co-authored-by: Igor Nikonov <954088+devcrafter@users.noreply.github.com>
2023-01-16 16:55:24 +01:00
Mikhail f. Shiryaev
1bd7279d5f
Improve manual to get binaries from CI 2023-01-16 16:45:30 +01:00
Nikolay Degterinsky
70e79de69b
Merge pull request #38252 from bharatnc/ncb/weighted-quantile-approx
add quantileInterpolatedWeighted function
2023-01-16 13:41:13 +01:00
Sema Checherinda
d746a3c4ff
Merge pull request #44480 from wineternity/issue_43333_doc
[DOC] Add support for signed arguments in range() #43333
2023-01-16 10:26:49 +01:00
Robert Schulze
099e30ef2a
Merge remote-tracking branch 'origin/master' into query-result-cache 2023-01-16 08:04:49 +00:00
Ilya Yatsishin
cf5052c77e
Merge pull request #45291 from den-crane/patch-57 2023-01-16 02:33:54 +01:00
Dan Roscigno
adca0b64d3
use markdown file instead of URL to enforce 404 checks 2023-01-15 19:31:58 -05:00
Peignon Melvyn
674a1d1877
Update json.md 2023-01-16 01:27:08 +01:00
Denny Crane
6cf603e05f
Update index.md 2023-01-15 18:40:59 -04:00
Dan Roscigno
f13fdfb431
Merge pull request #45280 from tbragin/patch-5
Update postgresql.md
2023-01-15 11:52:47 -05:00
Robert Schulze
27fe7ebd93
Cosmetics 2023-01-15 16:12:48 +00:00
Robert Schulze
bd41c74ddf
Various test, code and docs fixups 2023-01-15 13:47:34 +00:00
Alexey Milovidov
a5244f8d1d
Merge pull request #45279 from tbragin/patch-4
Update postgresql.md
2023-01-15 12:34:09 +03:00
Ilya Yatsishin
96987b7cd8
Merge pull request #45239 from Avogar/generate-random 2023-01-15 00:37:34 +01:00
Tanya Bragin
d825f50c84
Update postgresql.md
Adding Postgres blog to Postgres DB engine page

cc @DanRoscigno @gingerwizard
2023-01-14 14:39:41 -08:00
Tanya Bragin
b5773caeb2
Update postgresql.md
Adding related Postgres blog here. 

cc @gingerwizard @DanRoscigno
2023-01-14 14:34:33 -08:00
Tanya Bragin
36ec76f53d
Update clickhouse-local.md
cc @DanRoscigno @gingerwizard Adding latest clickhouse-local blog to related content on this page.
2023-01-14 07:50:10 -08:00
DanRoscigno
d0a55f6dc9 doc grace_hash algorithm for join 2023-01-13 13:17:03 -05:00
Rich Raposa
c7aad8e48b
Merge pull request #45207 from ClickHouse/add-maxintersections-to-docs
Add maxIntersections to docs
2023-01-13 10:27:59 -07:00
Robert Schulze
4ea836b87e
Revert "Revert "update function DAYOFWEEK and add new function WEEKDAY for mysql/spark compatiability""
This reverts commit e37f572c34.
2023-01-13 14:00:16 +00:00
Azat Khuzhin
99063b152f Allow to configure queue backlog of the parallel hashed dictionary loader
v2: Decrease default parallel_queue_backlog to 10000 (same speed)
v3: Rename parallel_queue_backlog to per_shard_load_backlog
v3: Rename per_shard_load_backlog to shard_load_queue_backlog
v4: Fix documentation
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-13 13:39:26 +01:00
Azat Khuzhin
345c422e28 Add ability to load hashed dictionaries using multiple threads
Right now dictionaries (here I will talk about only
HASHED/SPARSE_HASHED/COMPLEX_KEY_HASHED/COMPLEX_KEY_SPARSE_HASHED)
can load data only in one thread, since it uses one hash table that
cannot be filled from multiple threads.

And in case you have very big dictionary (i.e. 10e9 elements), it can
take a awhile to load them, especially for SPARSE_HASHED variants (and
if you have such amount of elements there, you are likely use
SPARSE_HASHED, since it requires less memory), in my env it takes ~4
hours, which is enormous amount of time.

So this patch add support of shards for dictionaries, number of shards
determine how much hash tables will use this dictionary, also, and which
is more important, how much threads it can use to load the data.

And with 16 threads this works 2x faster, not perfect though, see the
follow up patches in this series.

v0: PARTITION BY
v1: SHARDS 1
v2: SHARDS(1)
v3: tried optimized mod - logical and, but it does not gain even 10%
v4: tried squashing more (max_block_size * shards), but it does not gain even 10% either
v5: move SHARDS into layout parameters (unknown simply ignored)
v6: tune params for perf tests (to avoid too long queries)
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-13 13:39:25 +01:00
Robert Schulze
15e11741cb
Cosmetics 2023-01-13 00:00:23 +00:00