robot-ch-test-poll
d2f4b4d395
Merge pull request #50250 from johanngan/regexptree-doc-tweaks
...
Clean up regexp tree dictionary documentation
2023-05-29 02:08:47 +02:00
Nikita Mikhaylov
5de6dc87ec
Merge branch 'master' into usernam3-sample-clause-links-fix
2023-05-27 14:38:34 +02:00
Ilya Yatsishin
3635a7bf5a
Merge pull request #50261 from ClickHouse/fix-doc-with-fill-by-sorting-prefix
...
Fixes for WITH FILL grouped by sorting prefix
2023-05-26 14:03:52 +02:00
Igor Nikonov
c6c4ded5e1
Fixes for WITH FILL grouped by sorting prefix
2023-05-26 10:12:34 +02:00
johanngan
de3b08aa5b
Clean up regexp tree dictionary documentation
...
dictGetOrNull() relies on IDictionary::hasKeys(), which
RegExpTreeDictionary doesn't implement, so this probably never worked.
If you try to use it, an exception is thrown. The docs shouldn't
indicate that this is supported.
Also fix a markdown hyperlink in the docs.
2023-05-25 14:35:24 -05:00
Stanislav Dobrovolschii
8c4f0c763e
Update SAMPLE clause documentation links
2023-05-25 18:52:38 +00:00
Robert Schulze
c4f91a1c45
Merge branch 'master' into space
2023-05-25 19:56:20 +02:00
Igor Nikonov
1c0b02c3c4
Merge pull request #49503 from ClickHouse/fill_with_by_sorting_prefix_2
...
WITH FILL by sorting prefix
2023-05-25 12:37:40 +02:00
Sorck
034a8bd6b3
Fix: typo in last_value.md ( #50148 )
...
Correct typo in page title
Co-authored-by: Dan Roscigno <dan@roscigno.com>
2023-05-24 22:14:01 +02:00
Dan Roscigno
0219f78630
Testing algolia index
2023-05-24 12:40:12 -04:00
Dan Roscigno
9b014eb1d2
Merge pull request #49512 from DanRoscigno/polygon-docs
...
Add docs for polygon functions
2023-05-24 10:37:01 -04:00
DanRoscigno
9f349e2102
add params
2023-05-24 10:35:20 -04:00
Dan Roscigno
c5b521498a
Merge pull request #50191 from den-crane/patch-69
...
Doc. Remove excessive tuple from ip_trie description
2023-05-24 10:15:28 -04:00
Denny Crane
8a00be69b3
Update index.md
2023-05-24 10:40:33 -03:00
Dan Roscigno
9082e7feec
Merge branch 'master' into url-function-docs
2023-05-24 09:03:01 -04:00
Dan Roscigno
c2f6999aca
Merge pull request #50157 from den-crane/Doc/greatest_least
...
Doc. Move least/greatest to conditional-functions
2023-05-24 09:01:58 -04:00
Dan Roscigno
c53bdc6927
Merge branch 'master' into polygon-docs
2023-05-24 08:56:46 -04:00
Igor Nikonov
2f5ed81e0d
Merge branch 'master' into fill_with_by_sorting_prefix_2
2023-05-24 14:40:44 +02:00
Kruglov Pavel
9545100c9e
Merge pull request #45427 from attack204/urlCluster
...
Add urlCluster table function and refactor all *Cluster table functions
2023-05-24 13:32:56 +02:00
DanRoscigno
48080468ce
add description
2023-05-23 19:13:11 -04:00
DanRoscigno
3edffa3ced
add examples
2023-05-23 19:09:32 -04:00
Robert Schulze
889489b02e
Merge branch 'master' into space
2023-05-23 23:18:19 +02:00
Igor Nikonov
e9c86527b0
Merge branch 'master' into fill_with_by_sorting_prefix_2
2023-05-23 22:58:21 +02:00
Igor Nikonov
fbeba9b591
Documentation
2023-05-23 22:55:58 +02:00
Denny Crane
4a9b269cc4
move least/greatest to conditional-functions
2023-05-23 16:09:54 -03:00
Denny Crane
483a3cc5b1
Update comparison-functions.md
2023-05-23 15:59:08 -03:00
DanRoscigno
fcd6cb1f80
add descriptions
2023-05-23 12:53:53 -04:00
Dan Roscigno
0c3392662c
Merge branch 'master' into url-function-docs
2023-05-23 08:42:13 -04:00
Robert Schulze
f4c73e94d2
Merge pull request #49989 from arenadata/ADQM-811
...
Add support of Date|Date32 arguments to the toUnixTimestamp() function
2023-05-23 08:55:56 +02:00
Timur Solodovnikov
1f0250f730
Updating url table functions docs
2023-05-22 13:00:38 -07:00
avogar
88e4c93abc
Merge branch 'master' of github.com:ClickHouse/ClickHouse into urlCluster
2023-05-22 19:19:57 +00:00
Dan Roscigno
bb91e3ac2e
Merge pull request #50106 from ClickHouse/DanRoscigno-patch-5
...
Dan roscigno patch 5
2023-05-22 13:41:03 -04:00
DanRoscigno
eb7b5a5bb7
add more info to NLP docs
2023-05-22 13:02:39 -04:00
Dan Roscigno
ebff47ad3a
Update nlp-functions.md
2023-05-22 12:14:23 -04:00
Nikolay Degterinsky
d4b89cb643
Merge pull request #49356 from Ziy1-Tan/vcol
...
Support for `_path` and `_file` virtual columns for table function `url`.
2023-05-22 18:10:32 +02:00
Robert Schulze
df436b2cd4
Spark compatibility: Add new function space()
2023-05-22 14:52:51 +00:00
Kruglov Pavel
b5cad024e0
Merge branch 'master' into urlCluster
2023-05-22 14:59:34 +02:00
Victor Krasnov
a7b641134a
Follow the reviewer's recommendations regarding the documentation
2023-05-22 08:28:11 +00:00
Han Fei
2625696591
Merge branch 'master' into hanfei/regexp-doc
2023-05-21 23:42:01 +02:00
Robert Schulze
2a9ff30a7f
Merge pull request #49380 from azat/dict/hashed-memory
...
Improve memory usage and speed of SPARSE_HASHED/HASHED dictionaries
2023-05-21 15:46:41 +02:00
Robert Schulze
491cf8b6e1
Fix minor mistakes
2023-05-21 13:43:05 +00:00
Robert Schulze
9d9d4e3d62
Some fixups
2023-05-21 13:40:52 +00:00
Robert Schulze
312f751503
Uppercase remaining SQL keywords
2023-05-21 13:08:55 +00:00
Alexey Milovidov
2323542e47
Merge pull request #50022 from ClickHouse/geo-types-production-ready
...
Geo types are production ready
2023-05-20 02:02:23 +03:00
Dan Roscigno
1ded6d6409
Update grant.md
2023-05-19 16:53:26 -04:00
Dan Roscigno
1ef93e85aa
Merge branch 'master' into hanfei/regexp-doc
2023-05-19 14:23:23 -04:00
Alexey Milovidov
f5506210d6
Geo types are production ready
2023-05-19 12:43:55 +02:00
Azat Khuzhin
2b240d3721
Improve documentation for HASHED/SPARSE_HASHED/COMPLEX_KEY_HASHED/COMPLEX_KEY_SPARSE_HASHED
...
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-05-19 06:07:21 +02:00
Azat Khuzhin
2996b38606
Add ability to configure maximum load factor for the HASHED/SPARSE_HASHED layout
...
As it turns out, HashMap/PackedHashMap works great even with max load
factor of 0.99. By "great" I mean it least it works faster then
google sparsehash, and not to mention it's friendliness to the memory
allocator (it has zero fragmentation since it works with a continuious
memory region, in comparison to the sparsehash that doing lots of
realloc, which jemalloc does not like, due to it's slabs).
Here is a table of different setups:
settings | load (sec) | read (sec) | read (million rows/s) | bytes_allocated | RSS
- | - | - | - | - | -
HASHED upstream | - | - | - | - | 35GiB
SPARSE_HASHED upstream | - | - | - | - | 26GiB
- | - | - | - | - | -
sparse_hash_map glibc hashbench | - | - | - | - | 17.5GiB
sparse_hash_map packed allocator | 101.878 | 231.48 | 4.32 | - | 17.7GiB
PackedHashMap 0.5 | 15.514 | 42.35 | 23.61 | 20GiB | 22GiB
hashed 0.95 | 34.903 | 115.615 | 8.65 | 16GiB | 18.7GiB
**PackedHashMap 0.95** | **93.6** | **19.883** | **10.68** | **10GiB** | **12.8GiB**
PackedHashMap 0.99 | 26.113 | 83.6 | 11.96 | 10GiB | 12.3GiB
As it shows, PackedHashMap with 0.95 max_load_factor, eats 2.6x less
memory then SPARSE_HASHED in upstream, and it also 2x faster for read!
v2: fix grower
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-05-19 06:07:21 +02:00
Denny Crane
94fe224935
Update partition.md
2023-05-18 10:06:59 -03:00