Commit Graph

14794 Commits

Author SHA1 Message Date
Dmitry Kardymon
c032dee3b4 Merge remote-tracking branch 'origin/master' into ADQM-867 2023-06-04 18:45:47 +00:00
Alexey Milovidov
054b908b9c
Merge pull request #50416 from ClickHouse/revert-49656-rs/entropy-learned-hashing
Revert "Add SQL functions for Entropy Learned Hashing"
2023-06-03 06:50:48 +03:00
Alexey Milovidov
129fe9158f
Merge pull request #50486 from Algunenano/system_on_cluster
Mention ON CLUSTER options for SYSTEM queries
2023-06-03 06:48:54 +03:00
Nikita Mikhaylov
362055ce60
Merge branch 'master' into 46229-repl-clickhouse-keeper 2023-06-02 23:58:07 +02:00
DanRoscigno
87eaaa0f7b address review comments 2023-06-02 16:30:18 -04:00
pufit
a3ed86a528 Documentation 2023-06-02 14:36:44 -04:00
Dan Roscigno
a84eba6ccf
Merge pull request #50490 from DanRoscigno/docs-svg-funxtion
add svg function docs
2023-06-02 13:20:52 -04:00
DanRoscigno
9b89751948 add docs for parallel view processing 2023-06-02 12:52:06 -04:00
Kruglov Pavel
f87f98fb80
Merge pull request #47409 from Avogar/random-structure
Add new function generateRandomStructure
2023-06-02 18:50:35 +02:00
DanRoscigno
71c5b1d9c6 add svg function docs 2023-06-02 12:42:35 -04:00
Dan Roscigno
fb11f7eb6f
Merge pull request #50478 from DanRoscigno/user-settings-docs
move from server to user settings
2023-06-02 10:56:28 -04:00
Raúl Marín
dcd5579851 Mention ON CLUSTER options for SYSTEM queries 2023-06-02 16:51:04 +02:00
Robert Schulze
f0bfd44e13
Merge pull request #50466 from rschu1ze/update-function-docs
Update function docs
2023-06-02 16:20:36 +02:00
DanRoscigno
17cca6ed75 add direct join docs 2023-06-02 10:08:48 -04:00
Robert Schulze
2a4f1c8275
Fix typos 2023-06-02 14:00:57 +00:00
robot-ch-test-poll3
3a2cba89ad
Merge pull request #50479 from Klavionik/fix-json-import-example
Docs: remove needless minus sign
2023-06-02 15:54:36 +02:00
Robert Schulze
bfcd9dde61
Merge remote-tracking branch 'rschu1ze/master' into update-function-docs 2023-06-02 13:41:01 +00:00
Robert Schulze
c37a071004
Merge remote-tracking branch 'rschu1ze/master' into update-function-docs 2023-06-02 13:39:15 +00:00
Robert Schulze
12993890a8
Merge branch 'master' into revert-49656-rs/entropy-learned-hashing 2023-06-02 15:32:33 +02:00
Robert Schulze
54872f9e7e
Typos: Follow-up to #50476 2023-06-02 13:28:09 +00:00
DanRoscigno
079008058a move from server to user settings 2023-06-02 08:49:26 -04:00
Roman Vlasenko
c3a888a47b Remove needless minus sign 2023-06-02 15:41:43 +03:00
Robert Schulze
a22bb07fbd
Merge remote-tracking branch 'rschu1ze/master' into fix-typo-check-on-nested-docs 2023-06-02 12:33:16 +00:00
Robert Schulze
65cc92a78d
CI: Fix aspell on nested docs 2023-06-02 12:24:41 +00:00
Kruglov Pavel
9922053419
Merge pull request #49948 from Avogar/handshake-timeout
Add separate handshake_timeout for receiving Hello packet from replica
2023-06-02 14:07:27 +02:00
Dmitry Kardymon
fa5f890a7a Added ru function descriptions (docs) 2023-06-02 12:03:50 +00:00
Robert Schulze
572f15b2cd
Fix typo 2023-06-02 11:14:52 +00:00
Robert Schulze
f6da66cbb8
Fix some typos 2023-06-02 09:41:52 +00:00
DanRoscigno
6b2c906dfd add docs for boundingRatio 2023-06-01 19:03:35 -04:00
Dan Roscigno
bd047ed9e0
Update order-by.md
add a note that ORDER BY sorting is case sensitive.  

closes https://github.com/ClickHouse/clickhouse-docs/issues/81
2023-06-01 14:16:49 -04:00
Dan Roscigno
d302eae85e
Merge pull request #50433 from DanRoscigno/add-explain-video-to-docs
add explain video to docs
2023-06-01 10:31:38 -04:00
Dan Roscigno
c70aa9592b
Merge pull request #50419 from ClickHouse/reddit-fixes
Reddit dataset fixes
2023-06-01 10:30:56 -04:00
Dmitry Kardymon
54d526c75c Add cast type supprt to DateTimeTransformImpl 2023-06-01 14:22:57 +00:00
Robert Schulze
45b1fcdeda
Merge pull request #50315 from arenadata/ADQM-810
Add new function toLastDayOfWeek()
2023-06-01 16:19:14 +02:00
DanRoscigno
97e120ee3c add explain video to docs 2023-06-01 09:31:05 -04:00
Alexander Tokmakov
9a2b3afb7e
Revert "Fix msan issue in keyed siphash" 2023-06-01 15:02:32 +03:00
rfraposa
86e97f5f5c Update reddit-comments.md 2023-06-01 03:19:23 -06:00
Robert Schulze
08d98329b0
Revert "Add SQL functions for Entropy Learned Hashing" 2023-06-01 10:12:09 +02:00
rfraposa
bed7443181 Fixes 2023-05-31 09:31:46 -06:00
Kruglov Pavel
4d1b5cc89f
Merge branch 'master' into renamefile 2023-05-31 13:25:11 +02:00
Robert Schulze
c5343679df
Merge branch 'master' into msan-siphash-keyed 2023-05-31 09:05:24 +02:00
Robert Schulze
f171ad93bc
Merge pull request #49656 from ClickHouse/rs/entropy-learned-hashing
Add SQL functions for Entropy Learned Hashing
2023-05-31 08:26:57 +02:00
Alexey Gerasimchuk
44ba35d2c1
Merge branch 'master' into ADQM-830 2023-05-31 15:07:37 +10:00
Dan Roscigno
a01e8644c5
Settings update (#50350)
* sync system.server_settings with doc

* typos

* update DNS cache setting

* add server settings from system tbl

* catch up

* catch up

* Trigger Build

---------

Co-authored-by: Nikita Mikhaylov <mikhaylovnikitka@gmail.com>
2023-05-31 03:09:56 +02:00
robot-clickhouse
bf80f8d8f7
Merge pull request #50360 from rschu1ze/remove-obsolete-install-scripts
Remove legacy install scripts superseded by universal.sh
2023-05-31 02:57:41 +02:00
Dan Roscigno
3bc4d11b46
Merge pull request #50359 from rschu1ze/docs-advanced-install
Docs: Explain non-production / advanced install options in more detail
2023-05-30 20:32:39 -04:00
Robert Schulze
fc157b3dad
Merge branch 'master' into msan-siphash-keyed 2023-05-30 21:30:33 +02:00
Robert Schulze
d7002f8b94
Incorporate Dan's feedback 2023-05-30 17:46:21 +00:00
Robert Schulze
e0759eff97
Remove install scripts superseded by universal.sh 2023-05-30 17:28:48 +00:00
Robert Schulze
8d6244ed99
Docs: Explain non-production / advanced install options in more detail
Fixes: #49878
2023-05-30 17:13:43 +00:00
Dan Roscigno
cfa3fbf5b8
add .md extensions 2023-05-30 12:06:27 -04:00
Denny Crane
060d0118d7
Update merge-tree-settings.md 2023-05-30 12:08:24 -03:00
rfraposa
308db6784c Update environmental-sensors.md 2023-05-30 08:50:58 -05:00
Kruglov Pavel
f863dee8e7
Merge branch 'master' into renamefile 2023-05-30 12:26:40 +02:00
sslouis
a278d13043
Update first_value.md:remove redundant 's'
remove redundant 's'
2023-05-30 10:09:12 +08:00
Victor Krasnov
9a8086a377 Reorder toLastDayOf* functions in settings.md 2023-05-29 22:12:44 +00:00
Robert Schulze
aefa9df1ce
Merge branch 'master' into rs/entropy-learned-hashing 2023-05-29 22:20:50 +02:00
Alexey Milovidov
32d4221097
Merge pull request #50270 from ClickHouse/evillique-patch-1
Docs: Add PostgreSQL and ODBC related settings
2023-05-29 20:52:44 +03:00
Robert Schulze
516fa1c375
Merge branch 'master' into rs/entropy-learned-hashing 2023-05-29 17:40:14 +02:00
alekseygolub
63225b37a2
Merge branch 'ClickHouse:master' into renamefile 2023-05-29 16:52:41 +03:00
alesapin
6b7e2a0041
Merge pull request #49961 from CheSema/mem-consumption-s3-write
Introduce a limitation of async calls inside s3 write buffer, make a test that memory is bounded even when s3 is slow
2023-05-29 13:29:01 +02:00
Victor Krasnov
0ad5b9f598 Merge branch 'master' of github.com:ClickHouse/ClickHouse into ADQM-810-dev 2023-05-29 08:26:26 +00:00
Robert Schulze
47daaad172
Merge pull request #49778 from helifu/master2
Support custom 'order by' expression for system log tables
2023-05-29 10:05:14 +02:00
robot-ch-test-poll
d2f4b4d395
Merge pull request #50250 from johanngan/regexptree-doc-tweaks
Clean up regexp tree dictionary documentation
2023-05-29 02:08:47 +02:00
helifu
6e8136e6f4 Add 'settings' expression for system log tables 2023-05-28 12:59:34 +08:00
helifu
59a815e4e6 Update according to review comments 2023-05-28 12:59:34 +08:00
helifu
57e5a4aa63 Add the 'order_by' description for system log tables 2023-05-28 12:59:34 +08:00
Nikita Mikhaylov
5de6dc87ec
Merge branch 'master' into usernam3-sample-clause-links-fix 2023-05-27 14:38:34 +02:00
Alexey Gerasimchuk
12105c3dfa
Merge branch 'master' into ADQM-830 2023-05-27 07:35:05 +10:00
Sema Checherinda
7c9d2ab6d0 working with review comments 2023-05-26 19:00:13 +02:00
Sema Checherinda
b0eff95388 update doc, add profile event WriteBufferFromS3WaitInflightLimitMicroseconds 2023-05-26 18:59:21 +02:00
Robert Schulze
ac1dc7237e
Update docs 2023-05-26 15:37:33 +00:00
Robert Schulze
f49160ef4e
Build partial key positions from entire training data 2023-05-26 15:27:56 +00:00
Nikolay Degterinsky
c57fa9d496
Docs: Add PostgreSQL and ODBC related settings 2023-05-26 16:50:43 +02:00
Denny Crane
b9ef5151ad
Update settings.md 2023-05-26 11:18:25 -03:00
Denny Crane
349c92ccaf
Merge branch 'master' into patch-26 2023-05-26 10:54:10 -03:00
Ilya Yatsishin
3635a7bf5a
Merge pull request #50261 from ClickHouse/fix-doc-with-fill-by-sorting-prefix
Fixes for WITH FILL grouped by sorting prefix
2023-05-26 14:03:52 +02:00
Nikolay Degterinsky
5be47338f1
Merge pull request #50258 from den-crane/patch-27
Doc. fix typos
2023-05-26 13:17:15 +02:00
Robert Schulze
2298eeb2b2
Merge branch 'master' into rs/entropy-learned-hashing 2023-05-26 12:04:49 +02:00
Mohammad Arab Anvari
0d0e53ecc0
Update distributed.md
Fix broken link in `**See Also**` section.
2023-05-26 13:07:37 +03:30
Robert Schulze
e2a5acbeb9
Merge branch 'master' into space 2023-05-26 10:40:02 +02:00
Igor Nikonov
c6c4ded5e1
Fixes for WITH FILL grouped by sorting prefix 2023-05-26 10:12:34 +02:00
Denny Crane
08051917c7
Update named-collections.md 2023-05-25 21:35:51 -03:00
robot-ch-test-poll3
09d7512f45
Merge pull request #50248 from rschu1ze/document-system-build_options
Document system.build_options
2023-05-26 01:06:12 +02:00
Denny Crane
6b98e02857
Update settings.md 2023-05-25 19:21:32 -03:00
Robert Schulze
bc869eac7b
Merge branch 'master' into msan-siphash-keyed 2023-05-26 00:18:07 +02:00
Alexey Gerasimchuk
8d7cb7fc3b
Merge branch 'master' into ADQM-830 2023-05-26 07:49:51 +10:00
alekseygolub
36e8f13242 Added docs for feature 2023-05-25 20:10:02 +00:00
johanngan
de3b08aa5b Clean up regexp tree dictionary documentation
dictGetOrNull() relies on IDictionary::hasKeys(), which
RegExpTreeDictionary doesn't implement, so this probably never worked.
If you try to use it, an exception is thrown. The docs shouldn't
indicate that this is supported.

Also fix a markdown hyperlink in the docs.
2023-05-25 14:35:24 -05:00
Stanislav Dobrovolschii
8c4f0c763e Update SAMPLE clause documentation links 2023-05-25 18:52:38 +00:00
Robert Schulze
231d52d0e6
Document system.build_options 2023-05-25 18:29:15 +00:00
Robert Schulze
c4f91a1c45
Merge branch 'master' into space 2023-05-25 19:56:20 +02:00
Robert Schulze
ad4a21034f
Fix msan issue in keyed siphash
Issue:
https://s3.amazonaws.com/clickhouse-test-reports/0/ffdd91669471f4934704f98f0191524496b4e85b/fuzzer_astfuzzermsan/report.html

Repro:
SELECT hex(sipHash128ReferenceKeyed((toUInt64(2), toUInt64(-9223372036854775807)))) GROUP BY (toUInt64(506097522914230528), toUInt64(now64(2, NULL + NULL), 1084818905618843912)), toUInt64(2), NULL + NULL, char(-2147483649, 1)

Minimal repro:
SELECT sipHash64Keyed((2::UInt64, toUInt64(2)), 4) GROUP BY toUInt64(2)
2023-05-25 17:52:03 +00:00
Sema Checherinda
23f894b995
Merge pull request #49777 from helifu/master1
Add 'initial_query_id' field for system.processors_profile_log
2023-05-25 12:55:32 +02:00
Sema Checherinda
3329a8428d
Merge pull request #49779 from helifu/master3
Add 'partitions' field for system.query_log
2023-05-25 12:51:40 +02:00
Igor Nikonov
1c0b02c3c4
Merge pull request #49503 from ClickHouse/fill_with_by_sorting_prefix_2
WITH FILL by sorting prefix
2023-05-25 12:37:40 +02:00
Robert Schulze
4f5cf656eb
Minor adjustment of clickhouse-client/local parameter docs
Related to #50210
2023-05-25 09:36:03 +00:00
Alexey Gerasimchuck
4eb944fef1 minor changes in documentation 2023-05-25 08:05:11 +00:00
Alexey Gerasimchuck
75791d7a63 Added input_format_csv_trim_whitespaces parameter 2023-05-25 07:51:32 +00:00
helifu
515d30caea Add test case. 2023-05-25 09:37:02 +08:00
Sorck
034a8bd6b3
Fix: typo in last_value.md (#50148)
Correct typo in page title

Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-05-24 22:14:01 +02:00
Dan Roscigno
0219f78630
Testing algolia index 2023-05-24 12:40:12 -04:00
DanRoscigno
d4927e4fe5 add docs 2023-05-24 10:51:43 -04:00
Dan Roscigno
9b014eb1d2
Merge pull request #49512 from DanRoscigno/polygon-docs
Add docs for polygon functions
2023-05-24 10:37:01 -04:00
DanRoscigno
9f349e2102 add params 2023-05-24 10:35:20 -04:00
Dan Roscigno
c5b521498a
Merge pull request #50191 from den-crane/patch-69
Doc. Remove excessive tuple from ip_trie description
2023-05-24 10:15:28 -04:00
Denny Crane
8a00be69b3
Update index.md 2023-05-24 10:40:33 -03:00
Dan Roscigno
9082e7feec
Merge branch 'master' into url-function-docs 2023-05-24 09:03:01 -04:00
Dan Roscigno
c2f6999aca
Merge pull request #50157 from den-crane/Doc/greatest_least
Doc. Move least/greatest to conditional-functions
2023-05-24 09:01:58 -04:00
Dan Roscigno
c53bdc6927
Merge branch 'master' into polygon-docs 2023-05-24 08:56:46 -04:00
helifu
4724745b4c Add the 'partitions' field description in docs 2023-05-24 20:42:31 +08:00
Igor Nikonov
2f5ed81e0d
Merge branch 'master' into fill_with_by_sorting_prefix_2 2023-05-24 14:40:44 +02:00
Victor Krasnov
99a7967f11 Amend the dicumentation with regard to toLastDayOfWeek 2023-05-24 11:58:39 +00:00
Victor Krasnov
3bce2bbc37 Fix settings.md in regard to enable_extended_results_for_datetime_functions 2023-05-24 11:40:41 +00:00
Kruglov Pavel
9545100c9e
Merge pull request #45427 from attack204/urlCluster
Add urlCluster table function and refactor all *Cluster table functions
2023-05-24 13:32:56 +02:00
DanRoscigno
48080468ce add description 2023-05-23 19:13:11 -04:00
DanRoscigno
3edffa3ced add examples 2023-05-23 19:09:32 -04:00
Robert Schulze
889489b02e
Merge branch 'master' into space 2023-05-23 23:18:19 +02:00
Igor Nikonov
e9c86527b0
Merge branch 'master' into fill_with_by_sorting_prefix_2 2023-05-23 22:58:21 +02:00
Igor Nikonov
fbeba9b591 Documentation 2023-05-23 22:55:58 +02:00
Denny Crane
4a9b269cc4 move least/greatest to conditional-functions 2023-05-23 16:09:54 -03:00
Denny Crane
483a3cc5b1
Update comparison-functions.md 2023-05-23 15:59:08 -03:00
DanRoscigno
fcd6cb1f80 add descriptions 2023-05-23 12:53:53 -04:00
zy-kkk
90ee85d33a
Errata build osx zh doc (#50054) 2023-05-23 17:53:27 +02:00
DanRoscigno
f9f98ac503 note option 2023-05-23 09:58:59 -04:00
Dan Roscigno
fd62a1473c
Update build instructions
I am testing the build on Ubuntu 22.04 and found that I needed to add some prereqs.
2023-05-23 09:35:23 -04:00
Dan Roscigno
0c3392662c
Merge branch 'master' into url-function-docs 2023-05-23 08:42:13 -04:00
Robert Schulze
91bc0fad1b
Merge pull request #50138 from rschu1ze/docs-clickhouse-local
Docs: Update clickhouse-local arguments
2023-05-23 14:19:09 +02:00
Robert Schulze
b1a3b8536f
Docs: Update clickhouse-local arguments
Follow-up to #49870
2023-05-23 11:55:27 +00:00
avogar
3c1aeaaa79 Change default value of handshake_timeout to 10 sec, fix possible use-after-free 2023-05-23 11:39:40 +00:00
Kruglov Pavel
136c3caf03
Merge branch 'master' into handshake-timeout 2023-05-23 11:53:54 +02:00
Alexey Gerasimchuk
30f3b3ba04
Merge branch 'master' into ADQM-808 2023-05-23 17:03:54 +10:00
Robert Schulze
f4c73e94d2
Merge pull request #49989 from arenadata/ADQM-811
Add support of Date|Date32 arguments to the toUnixTimestamp() function
2023-05-23 08:55:56 +02:00
Alexey Gerasimchuk
df751f1bca
Merge branch 'master' into ADQM-808 2023-05-23 13:43:18 +10:00
Timur Solodovnikov
1f0250f730 Updating url table functions docs 2023-05-22 13:00:38 -07:00
avogar
4f85d6a1bb Merge branch 'master' of github.com:ClickHouse/ClickHouse into random-structure 2023-05-22 19:43:24 +00:00
avogar
88e4c93abc Merge branch 'master' of github.com:ClickHouse/ClickHouse into urlCluster 2023-05-22 19:19:57 +00:00
Dan Roscigno
bb91e3ac2e
Merge pull request #50106 from ClickHouse/DanRoscigno-patch-5
Dan roscigno patch 5
2023-05-22 13:41:03 -04:00
DanRoscigno
eb7b5a5bb7 add more info to NLP docs 2023-05-22 13:02:39 -04:00
Dan Roscigno
ebff47ad3a
Update nlp-functions.md 2023-05-22 12:14:23 -04:00
Nikolay Degterinsky
d4b89cb643
Merge pull request #49356 from Ziy1-Tan/vcol
Support for `_path` and `_file` virtual columns for table function `url`.
2023-05-22 18:10:32 +02:00
Robert Schulze
df436b2cd4
Spark compatibility: Add new function space() 2023-05-22 14:52:51 +00:00
Kruglov Pavel
b5cad024e0
Merge branch 'master' into urlCluster 2023-05-22 14:59:34 +02:00
Victor Krasnov
a7b641134a Follow the reviewer's recommendations regarding the documentation 2023-05-22 08:28:11 +00:00
lgbo-ustc
cc93f9ab52 update tests for flaky test 2023-05-22 10:17:41 +08:00
Han Fei
2625696591
Merge branch 'master' into hanfei/regexp-doc 2023-05-21 23:42:01 +02:00
Robert Schulze
2a9ff30a7f
Merge pull request #49380 from azat/dict/hashed-memory
Improve memory usage and speed of SPARSE_HASHED/HASHED dictionaries
2023-05-21 15:46:41 +02:00
Robert Schulze
491cf8b6e1
Fix minor mistakes 2023-05-21 13:43:05 +00:00
Robert Schulze
9d9d4e3d62
Some fixups 2023-05-21 13:40:52 +00:00
Robert Schulze
312f751503
Uppercase remaining SQL keywords 2023-05-21 13:08:55 +00:00
Yusuke Tanaka
2444e93ad7
Fix broken link to async_insert_deduplicate in doc 2023-05-21 14:42:17 +09:00
Alexey Milovidov
2323542e47
Merge pull request #50022 from ClickHouse/geo-types-production-ready
Geo types are production ready
2023-05-20 02:02:23 +03:00
Dan Roscigno
1ded6d6409
Update grant.md 2023-05-19 16:53:26 -04:00
Dan Roscigno
ceb9b8cbc8
Merge branch 'master' into allow_experimental_parallel_reading_from_replicas 2023-05-19 14:25:07 -04:00
Dan Roscigno
1ef93e85aa
Merge branch 'master' into hanfei/regexp-doc 2023-05-19 14:23:23 -04:00
Dan Roscigno
8745b8196d
Merge pull request #49987 from den-crane/patch-67
Doc. Fix in partition description, for consistency with other documentation.
2023-05-19 14:18:33 -04:00
Dan Roscigno
75727bf072
Merge pull request #49967 from DanRoscigno/docs-ddl-named-collections
Docs ddl named collections
2023-05-19 14:16:22 -04:00
DanRoscigno
4260e86b1a fix typo 2023-05-19 13:00:18 -04:00
Sergei Trifonov
90872c2671
Update query_log.md
Fix links to data types
2023-05-19 17:22:37 +02:00
DanRoscigno
ff63a73cf5 review 2023-05-19 11:03:53 -04:00
DanRoscigno
10a382bd12 add ALTER and DROP 2023-05-19 10:49:43 -04:00
mateng915
5237dd0245
New system table zookeeper connection (#45245)
* Feature: Support new system table to show which zookeeper node be connected

Description:
============
Currently we have no place to check which zk node be connected otherwise using
lsof command. It not convenient

Solution:
=========
Implemented a new system table, system.zookeeper_host when CK Server has zk
this table will show the zk node dir which connected by current CK server

Noted: This table can support multi-zookeeper cluster scenario.

* fixed review comments

* added test case

* update test cases

* remove unused code

* fixed review comments and removed unused code

* updated test cases for print host, port and is_expired

* modify the code comments

* fixed CI Failed

* fixed code style check failure

* updated test cases by added Tags

* update test reference

* update test cases

* added system.zookeeper_connection doc

* Update docs/en/operations/system-tables/zookeeper_connection.md

* Update docs/en/operations/system-tables/zookeeper_connection.md

* Update docs/en/operations/system-tables/zookeeper_connection.md

---------

Co-authored-by: Alexander Tokmakov <tavplubix@gmail.com>
2023-05-19 17:06:43 +03:00
Alexey Milovidov
f5506210d6 Geo types are production ready 2023-05-19 12:43:55 +02:00
Azat Khuzhin
2b240d3721 Improve documentation for HASHED/SPARSE_HASHED/COMPLEX_KEY_HASHED/COMPLEX_KEY_SPARSE_HASHED
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-05-19 06:07:21 +02:00
Azat Khuzhin
2996b38606 Add ability to configure maximum load factor for the HASHED/SPARSE_HASHED layout
As it turns out, HashMap/PackedHashMap works great even with max load
factor of 0.99. By "great" I mean it least it works faster then
google sparsehash, and not to mention it's friendliness to the memory
allocator (it has zero fragmentation since it works with a continuious
memory region, in comparison to the sparsehash that doing lots of
realloc, which jemalloc does not like, due to it's slabs).

Here is a table of different setups:

settings                         | load (sec) | read (sec) | read (million rows/s) | bytes_allocated | RSS
-                                | -          | -          | -                     | -               | -
HASHED upstream                  | -          | -          | -                     | -               | 35GiB
SPARSE_HASHED upstream           | -          | -          | -                     | -               | 26GiB
-                                | -          | -          | -                     | -               | -
sparse_hash_map glibc hashbench  | -          | -          | -                     | -               | 17.5GiB
sparse_hash_map packed allocator | 101.878    | 231.48     | 4.32                  | -               | 17.7GiB
PackedHashMap 0.5                | 15.514     | 42.35      | 23.61                 | 20GiB           | 22GiB
hashed 0.95                      | 34.903     | 115.615    | 8.65                  | 16GiB           | 18.7GiB
**PackedHashMap 0.95**           | **93.6**   | **19.883** | **10.68**             | **10GiB**       | **12.8GiB**
PackedHashMap 0.99               | 26.113     | 83.6       | 11.96                 | 10GiB           | 12.3GiB

As it shows, PackedHashMap with 0.95 max_load_factor, eats 2.6x less
memory then SPARSE_HASHED in upstream, and it also 2x faster for read!

v2: fix grower
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-05-19 06:07:21 +02:00
Denny Crane
94fe224935
Update partition.md 2023-05-18 10:06:59 -03:00
Victor Krasnov
3c68d63213 Amend the toUnixTimestamp function documentation 2023-05-18 09:11:48 +00:00
libin
d294ecbc16
Update grant.md
docs: Modifying grant example
2023-05-18 15:50:19 +08:00
Alexey Gerasimchuk
1fb9e36b81
Merge branch 'master' into ADQM-808 2023-05-18 07:59:02 +10:00
Rich Raposa
03b5bfe218
Merge pull request #49968 from ClickHouse/reddit
Add Reddit comments to datasets
2023-05-17 15:26:29 -06:00
rfraposa
6a136897e3 Create reddit-comments.md 2023-05-17 13:23:53 -06:00
Han Fei
549af4d351 address comments 2023-05-17 21:23:32 +02:00
DanRoscigno
a1fc96953f reorder 2023-05-17 14:48:16 -04:00
Dan Roscigno
addc0c0ece
Merge branch 'master' into allow_experimental_parallel_reading_from_replicas 2023-05-17 13:20:14 -04:00
avogar
846804fed0 Add separate handshake_timeout for receiving Hello packet from replica 2023-05-17 11:39:04 +00:00
Alexey Gerasimchuck
253fadb676 Updated documentation for --multiquery 2023-05-17 03:45:09 +00:00
Thom O'Connor
15cb627688
Update postgresql.md
The type cannot be optional, so we need to be clear in the docs that [type1] [type2] cannot be in square brackets, so removed them
2023-05-16 16:35:41 -06:00
DanRoscigno
64bf477ff2 add DDL 2023-05-16 14:58:04 -04:00
Kruglov Pavel
4530f38fdf
Merge branch 'master' into urlCluster 2023-05-16 16:21:23 +02:00
Han Fei
7df0e9d933 fix broken link 2023-05-16 15:33:08 +02:00
Han Fei
a40d86b921
Update docs/en/sql-reference/dictionaries/index.md
Co-authored-by: Sergei Trifonov <svtrifonov@gmail.com>
2023-05-16 11:22:42 +02:00
Han Fei
ed5906f15d
Update docs/en/sql-reference/dictionaries/index.md
Co-authored-by: Sergei Trifonov <svtrifonov@gmail.com>
2023-05-16 11:22:31 +02:00
Han Fei
31b8e3c489
Update docs/en/sql-reference/dictionaries/index.md
Co-authored-by: Sergei Trifonov <svtrifonov@gmail.com>
2023-05-16 11:22:24 +02:00
Han Fei
e4e473ef30
Update docs/en/sql-reference/dictionaries/index.md
Co-authored-by: Sergei Trifonov <svtrifonov@gmail.com>
2023-05-16 11:22:14 +02:00
Han Fei
29aa960377 refine docs for regexp tree dictionary 2023-05-16 09:07:35 +02:00
Robert Schulze
59bc3e25be
Merge pull request #49824 from AVMusorin/allow-alias-column-kafka
KafkaEngine: Allow usage of Alias column type
2023-05-15 23:40:03 +02:00
robot-ch-test-poll4
e58d1821eb
Merge pull request #49788 from lesandie/master
Reworked Local File Cache documentation
2023-05-15 21:38:43 +02:00
Vitaly Baranov
bf3336a84e
Merge pull request #47640 from ilejn/row_policy_template
Row policy for database
2023-05-15 20:05:15 +02:00
avogar
f9e0eb47d7 Merge branch 'master' of github.com:ClickHouse/ClickHouse into urlCluster 2023-05-15 16:35:24 +00:00
Diego Nieto
bf201a09b7
Update docs/en/operations/storing-data.md
@kssenii comments

Co-authored-by: Kseniia Sumarokova <54203879+kssenii@users.noreply.github.com>
2023-05-15 15:43:39 +02:00
AVMusorin
418a61a68c
Allow using Alias column type for KafkaEngine
```
create table kafka
(
 a UInt32,
 a_str String Alias toString(a)
) engine = Kafka;

create table data
(
  a UInt32;
  a_str String
) engine = MergeTree
order by tuple();

create materialized view data_mv to data
(
  a UInt32,
  a_str String
) as
select a, a_str from kafka;
```
Alias type works as expected in comparison with MATERIALIZED/EPHEMERAL
or column with default expression.

Ref: https://github.com/ClickHouse/ClickHouse/pull/47138

Co-authored-by: Azat Khuzhin <a3at.mail@gmail.com>
2023-05-15 15:39:58 +02:00
avogar
eef0a433e5 Add note about possible huge output 2023-05-15 11:24:18 +00:00
avogar
aa7ab1f23b Fix comments 2023-05-15 11:20:03 +00:00
Diego Nieto
52fdc49ef3
Merge branch 'master' into master 2023-05-15 10:28:31 +02:00
robot-clickhouse
bec4edad65
Merge pull request #49856 from darkkeks/fix-date-time-functions-toc
[docs] Remove "example" section from date-time-functions page toc
2023-05-14 23:43:00 +02:00
Ilya Golshtein
9ef610040f row_policy_template - table * is respected 2023-05-13 22:30:02 +00:00
Ilya Golshtein
7fa6ea4ccc row policy template: tests and code cleanup, code review changes 2023-05-13 22:30:02 +00:00
Nikolay Degterinsky
a2e7c77fe2
Merge pull request #49860 from ucasfl/doc
update doc
2023-05-13 23:26:40 +02:00
flynn
f775a466f7 update doc 2023-05-13 16:12:51 +00:00
Robert Schulze
c4f7c3daa1
Merge branch 'master' into rs/entropy-learned-hashing 2023-05-13 17:33:12 +02:00