Commit Graph

3942 Commits

Author SHA1 Message Date
Dan Roscigno
bb91e3ac2e
Merge pull request #50106 from ClickHouse/DanRoscigno-patch-5
Dan roscigno patch 5
2023-05-22 13:41:03 -04:00
DanRoscigno
eb7b5a5bb7 add more info to NLP docs 2023-05-22 13:02:39 -04:00
Dan Roscigno
ebff47ad3a
Update nlp-functions.md 2023-05-22 12:14:23 -04:00
Nikolay Degterinsky
d4b89cb643
Merge pull request #49356 from Ziy1-Tan/vcol
Support for `_path` and `_file` virtual columns for table function `url`.
2023-05-22 18:10:32 +02:00
Robert Schulze
df436b2cd4
Spark compatibility: Add new function space() 2023-05-22 14:52:51 +00:00
Kruglov Pavel
b5cad024e0
Merge branch 'master' into urlCluster 2023-05-22 14:59:34 +02:00
Victor Krasnov
a7b641134a Follow the reviewer's recommendations regarding the documentation 2023-05-22 08:28:11 +00:00
Han Fei
2625696591
Merge branch 'master' into hanfei/regexp-doc 2023-05-21 23:42:01 +02:00
Robert Schulze
2a9ff30a7f
Merge pull request #49380 from azat/dict/hashed-memory
Improve memory usage and speed of SPARSE_HASHED/HASHED dictionaries
2023-05-21 15:46:41 +02:00
Robert Schulze
491cf8b6e1
Fix minor mistakes 2023-05-21 13:43:05 +00:00
Robert Schulze
9d9d4e3d62
Some fixups 2023-05-21 13:40:52 +00:00
Robert Schulze
312f751503
Uppercase remaining SQL keywords 2023-05-21 13:08:55 +00:00
Alexey Milovidov
2323542e47
Merge pull request #50022 from ClickHouse/geo-types-production-ready
Geo types are production ready
2023-05-20 02:02:23 +03:00
Dan Roscigno
1ded6d6409
Update grant.md 2023-05-19 16:53:26 -04:00
Dan Roscigno
1ef93e85aa
Merge branch 'master' into hanfei/regexp-doc 2023-05-19 14:23:23 -04:00
Alexey Milovidov
f5506210d6 Geo types are production ready 2023-05-19 12:43:55 +02:00
Azat Khuzhin
2b240d3721 Improve documentation for HASHED/SPARSE_HASHED/COMPLEX_KEY_HASHED/COMPLEX_KEY_SPARSE_HASHED
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-05-19 06:07:21 +02:00
Azat Khuzhin
2996b38606 Add ability to configure maximum load factor for the HASHED/SPARSE_HASHED layout
As it turns out, HashMap/PackedHashMap works great even with max load
factor of 0.99. By "great" I mean it least it works faster then
google sparsehash, and not to mention it's friendliness to the memory
allocator (it has zero fragmentation since it works with a continuious
memory region, in comparison to the sparsehash that doing lots of
realloc, which jemalloc does not like, due to it's slabs).

Here is a table of different setups:

settings                         | load (sec) | read (sec) | read (million rows/s) | bytes_allocated | RSS
-                                | -          | -          | -                     | -               | -
HASHED upstream                  | -          | -          | -                     | -               | 35GiB
SPARSE_HASHED upstream           | -          | -          | -                     | -               | 26GiB
-                                | -          | -          | -                     | -               | -
sparse_hash_map glibc hashbench  | -          | -          | -                     | -               | 17.5GiB
sparse_hash_map packed allocator | 101.878    | 231.48     | 4.32                  | -               | 17.7GiB
PackedHashMap 0.5                | 15.514     | 42.35      | 23.61                 | 20GiB           | 22GiB
hashed 0.95                      | 34.903     | 115.615    | 8.65                  | 16GiB           | 18.7GiB
**PackedHashMap 0.95**           | **93.6**   | **19.883** | **10.68**             | **10GiB**       | **12.8GiB**
PackedHashMap 0.99               | 26.113     | 83.6       | 11.96                 | 10GiB           | 12.3GiB

As it shows, PackedHashMap with 0.95 max_load_factor, eats 2.6x less
memory then SPARSE_HASHED in upstream, and it also 2x faster for read!

v2: fix grower
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-05-19 06:07:21 +02:00
Denny Crane
94fe224935
Update partition.md 2023-05-18 10:06:59 -03:00
Victor Krasnov
3c68d63213 Amend the toUnixTimestamp function documentation 2023-05-18 09:11:48 +00:00
Han Fei
549af4d351 address comments 2023-05-17 21:23:32 +02:00
Kruglov Pavel
4530f38fdf
Merge branch 'master' into urlCluster 2023-05-16 16:21:23 +02:00
Han Fei
7df0e9d933 fix broken link 2023-05-16 15:33:08 +02:00
Han Fei
a40d86b921
Update docs/en/sql-reference/dictionaries/index.md
Co-authored-by: Sergei Trifonov <svtrifonov@gmail.com>
2023-05-16 11:22:42 +02:00
Han Fei
ed5906f15d
Update docs/en/sql-reference/dictionaries/index.md
Co-authored-by: Sergei Trifonov <svtrifonov@gmail.com>
2023-05-16 11:22:31 +02:00
Han Fei
31b8e3c489
Update docs/en/sql-reference/dictionaries/index.md
Co-authored-by: Sergei Trifonov <svtrifonov@gmail.com>
2023-05-16 11:22:24 +02:00
Han Fei
e4e473ef30
Update docs/en/sql-reference/dictionaries/index.md
Co-authored-by: Sergei Trifonov <svtrifonov@gmail.com>
2023-05-16 11:22:14 +02:00
Han Fei
29aa960377 refine docs for regexp tree dictionary 2023-05-16 09:07:35 +02:00
Vitaly Baranov
bf3336a84e
Merge pull request #47640 from ilejn/row_policy_template
Row policy for database
2023-05-15 20:05:15 +02:00
avogar
f9e0eb47d7 Merge branch 'master' of github.com:ClickHouse/ClickHouse into urlCluster 2023-05-15 16:35:24 +00:00
robot-clickhouse
bec4edad65
Merge pull request #49856 from darkkeks/fix-date-time-functions-toc
[docs] Remove "example" section from date-time-functions page toc
2023-05-14 23:43:00 +02:00
Ilya Golshtein
7fa6ea4ccc row policy template: tests and code cleanup, code review changes 2023-05-13 22:30:02 +00:00
Nikolay Degterinsky
a2e7c77fe2
Merge pull request #49860 from ucasfl/doc
update doc
2023-05-13 23:26:40 +02:00
flynn
f775a466f7 update doc 2023-05-13 16:12:51 +00:00
Robert Schulze
c4f7c3daa1
Merge branch 'master' into rs/entropy-learned-hashing 2023-05-13 17:33:12 +02:00
Robert Schulze
0d9a6b5545
Update QPL docs 2023-05-13 13:26:47 +00:00
darkkeks
2b793e3a14 [docs] Remove "example" section from date-time-functions page toc 2023-05-13 16:22:17 +03:00
Alexey Milovidov
6df457dcaf
Merge pull request #49811 from jinjunzh/iaadeflate_swpath_compat
qpl_deflate codec lower the minimum SIMD version to SSE 4.2
2023-05-13 04:21:54 +03:00
Robert Schulze
f011421f32
Merge pull request #49825 from den-crane/patch-66
Doc. Note about why Google CityHash <> cityHash64
2023-05-12 20:01:58 +02:00
Yakko Majuri
eb6cbe63ee
Minor correction to example in COMMENT COLUMN 2023-05-12 11:54:53 -03:00
Robert Schulze
5133597661
+m 2023-05-12 16:51:47 +02:00
Robert Schulze
9d34233e65
Merge branch 'master' into iaadeflate_swpath_compat 2023-05-12 16:39:17 +02:00
Robert Schulze
5235f52de3
Small rewrite, removed note about farm hash (it seemed fragile: https://github.com/ClickHouse/ClickHouse/issues/8354#issuecomment-974808920) 2023-05-12 16:38:32 +02:00
jinjunzh
e72a6fe973 qpl-deflate codec lower the minimum SIMD version to SSE 4.2 2023-05-12 09:56:01 -04:00
DanRoscigno
0b9e575588 uplevel UDFs and link 2023-05-12 08:05:38 -04:00
Denny Crane
051456ba92
Update hash-functions.md 2023-05-12 08:33:27 -03:00
Denny Crane
47b27ca5fb
Update hash-functions.md 2023-05-12 08:23:02 -03:00
Dan Roscigno
56a563f2bd
Merge pull request #49810 from DanRoscigno/doc-greatest-least
add greatest() and least() docs
2023-05-12 05:52:01 -04:00
Han Fei
07341a04fd
Merge pull request #49666 from hanfei1991/hanfei/regexp-dict-read
Support `dictionary` table function for `RegExpTreeDictionary`
2023-05-12 11:40:46 +02:00
DanRoscigno
11f78fed2b fix markdown spaces 2023-05-11 21:40:47 -04:00
DanRoscigno
a14ad025ee add greatest and least docs 2023-05-11 21:33:36 -04:00
Han Fei
ef74e64336 address comments 2023-05-11 22:18:08 +02:00
Robert Schulze
37c1b1aa58
Some fixups 2023-05-11 18:49:05 +00:00
avogar
3ee8de792c Merge branch 'master' of github.com:ClickHouse/ClickHouse into urlCluster 2023-05-11 12:46:20 +00:00
vdimir
1b7f54e886
Merge pull request #46467 from bigo-sg/first_value 2023-05-10 14:56:14 +02:00
Roman Vasin
632c0eeefc
Add support of temporary tables in FROM part of ATTACH|REPLACE PARTITION FROM (#49436)
* Add temp tables after FROM

* Resolve works fine with temp and non-temp tables

* Add usage of temporary table in  00626_replace_partition_from_table test

* Update documentation

* Add tests for ReplicatedMergeTree

* Move tests into 02731_replace_partition_from_temporary_table
2023-05-10 15:25:12 +03:00
Robert Schulze
bf3eacf697
Merge pull request #49730 from ClickHouse/rs/docs-fix-sidebar
Docs: Fix sidebar label for dictionary table function
2023-05-10 13:54:29 +02:00
Robert Schulze
405e371aca
Docs: Fix sidebar label for dictionary table function 2023-05-10 11:18:15 +00:00
alekar
605991d4d5
Merge branch 'master' into 48716 2023-05-09 13:38:11 -07:00
larryluogit
2341c5e6a3
Merge branch 'master' into saveIntervals 2023-05-08 12:49:21 -04:00
Robert Schulze
93a2b44702
Add docs 2023-05-08 13:11:21 +00:00
Robert Schulze
d8d2b0af76
Merge pull request #49466 from ucasfl/str_to_map
add alias str_to_map and mapFromString for extractKeyValuePairs
2023-05-08 10:11:06 +02:00
Dan Roscigno
0830153a52
Merge pull request #49501 from ivan-klass/patch-1
Note for multiple arrayJoin of same array expression
2023-05-07 20:17:09 -04:00
Robert Schulze
3cadcaeaf0
Add new aliases to docs 2023-05-07 19:24:49 +00:00
Robert Schulze
d5f08ba87b
Move extractKeyValuePairs*() docs into maps docs 2023-05-07 19:22:29 +00:00
Robert Schulze
d614bde1c7
Fixup docs of extractKeyValuePairs*() 2023-05-07 19:20:16 +00:00
Robert Schulze
c893302a08
Implement a MySQL-compatible variant of makeDate()
Fixes #49143
2023-05-06 20:11:36 +00:00
Robert Schulze
2986c28761
Small fixes 2023-05-06 18:12:10 +00:00
Robert Schulze
7137c8811a
Merge pull request #49158 from ClickHouse/rs/show-indexes
Implement SHOW INDEX
2023-05-06 15:30:29 +02:00
Alexey Milovidov
965956ad55
Merge pull request #47815 from jkaflik/add-gcs-table-function
Add Google Cloud Storage S3 compatible table function
2023-05-05 23:18:58 +03:00
Robert Schulze
b2f0108150
Merge pull request #49292 from exmy/allow_int_type_for_bit_agg
Allow Int arguments for aggregation function groupBitAnd/Or/Xor
2023-05-05 21:23:13 +02:00
xmy
180bb9f13d fix 2023-05-05 18:05:09 +08:00
DanRoscigno
9815b65849 add description 2023-05-04 12:47:39 -04:00
Ivan Takarlikov
8873856ce5 Fix some grammar mistakes in documentation, code and tests 2023-05-04 13:35:18 -03:00
Kuba Kaflik
c6953f4452 Merge branch 'master' of github.com:clickhouse/ClickHouse into add-gcs-table-function 2023-05-04 16:03:16 +02:00
DanRoscigno
6670cc2154 add description 2023-05-04 09:43:01 -04:00
DanRoscigno
b9afc70583 WIP 2023-05-04 09:36:27 -04:00
ivan-klass
1da30e2830
Better text 2023-05-04 13:06:07 +02:00
ivan-klass
a788d0b124
Note for multiple arrayJoin of same array expression 2023-05-04 13:03:27 +02:00
alesapin
89caf40840
Merge pull request #46528 from myrrc/feature/show-secrets-option
Show secrets in `SHOW` and `SELECT` queries
2023-05-04 11:21:30 +02:00
alekar
af81f056b4
Merge branch 'master' into 48716 2023-05-03 18:35:54 -07:00
candiduslynx
d5ccc99b1c add map to list of unsupported nullable types 2023-05-03 18:29:55 +03:00
Alexander Tokmakov
e399903030
Merge pull request #48548 from ClickHouse/clusters_is_active_column
Add some columns to system.clusters
2023-05-03 17:42:40 +03:00
Nikolay Degterinsky
4c038ef538
Merge branch 'master' into bcrypt 2023-05-02 21:52:21 +02:00
Ziy1-Tan
2c159061ed Support _path and _file virtual columns for table function url. 2023-05-01 21:40:30 +08:00
larryluogit
e18395b265
Merge branch 'master' into saveIntervals 2023-05-01 08:15:47 -04:00
Raqbit
2bce8a1b44
Fix typo "nullbale" in data-types reference doc 2023-05-01 10:14:46 +02:00
Nikolay Degterinsky
f64ef88197 Merge remote-tracking branch 'upstream/master' into bcrypt 2023-04-30 21:39:16 +00:00
Kseniia Sumarokova
7d867d7632
Merge pull request #49225 from MikhailBurdukov/mongo_dict_tls
Connection options for MongoDB dictionaries
2023-04-29 11:37:45 +02:00
alekar
29d96afae8
Merge branch 'master' into 48716 2023-04-28 14:24:46 -07:00
Mike Kot
2d20658885
Merge branch 'master' into feature/show-secrets-option 2023-04-28 20:19:35 +03:00
Nikolay Degterinsky
69d8d41197 Add docs, better tests 2023-04-28 15:12:48 +00:00
Mike Kot
f1d0ed367d separate name for format setting 2023-04-28 13:39:38 +00:00
Mike Kot
7bffa89e96 add all 3 settings 2023-04-28 13:22:36 +00:00
Alexey Milovidov
6dcfdbfaf2
Merge branch 'master' into clusters_is_active_column 2023-04-28 16:16:25 +03:00
xmy
2da33b96eb Allow Int* type argument for groupBitAnd/GroupBitOr/groupBitXor 2023-04-28 11:03:50 +08:00
Mike Kot
e428af0b63 replace format settings with server setting 2023-04-27 21:27:18 +00:00
Mike Kot
357cde95e9 Merge branch 'master' into feature/show-secrets-option 2023-04-27 20:19:40 +00:00
Nikita Mikhaylov
da72eb630e Done 2023-04-27 18:14:46 +02:00
Nikita Mikhaylov
fe9cc7ad90 Better 2023-04-27 17:38:15 +02:00