Commit Graph

14517 Commits

Author SHA1 Message Date
Alexey Milovidov
f5506210d6 Geo types are production ready 2023-05-19 12:43:55 +02:00
Azat Khuzhin
2b240d3721 Improve documentation for HASHED/SPARSE_HASHED/COMPLEX_KEY_HASHED/COMPLEX_KEY_SPARSE_HASHED
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-05-19 06:07:21 +02:00
Azat Khuzhin
2996b38606 Add ability to configure maximum load factor for the HASHED/SPARSE_HASHED layout
As it turns out, HashMap/PackedHashMap works great even with max load
factor of 0.99. By "great" I mean it least it works faster then
google sparsehash, and not to mention it's friendliness to the memory
allocator (it has zero fragmentation since it works with a continuious
memory region, in comparison to the sparsehash that doing lots of
realloc, which jemalloc does not like, due to it's slabs).

Here is a table of different setups:

settings                         | load (sec) | read (sec) | read (million rows/s) | bytes_allocated | RSS
-                                | -          | -          | -                     | -               | -
HASHED upstream                  | -          | -          | -                     | -               | 35GiB
SPARSE_HASHED upstream           | -          | -          | -                     | -               | 26GiB
-                                | -          | -          | -                     | -               | -
sparse_hash_map glibc hashbench  | -          | -          | -                     | -               | 17.5GiB
sparse_hash_map packed allocator | 101.878    | 231.48     | 4.32                  | -               | 17.7GiB
PackedHashMap 0.5                | 15.514     | 42.35      | 23.61                 | 20GiB           | 22GiB
hashed 0.95                      | 34.903     | 115.615    | 8.65                  | 16GiB           | 18.7GiB
**PackedHashMap 0.95**           | **93.6**   | **19.883** | **10.68**             | **10GiB**       | **12.8GiB**
PackedHashMap 0.99               | 26.113     | 83.6       | 11.96                 | 10GiB           | 12.3GiB

As it shows, PackedHashMap with 0.95 max_load_factor, eats 2.6x less
memory then SPARSE_HASHED in upstream, and it also 2x faster for read!

v2: fix grower
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-05-19 06:07:21 +02:00
Denny Crane
94fe224935
Update partition.md 2023-05-18 10:06:59 -03:00
Victor Krasnov
3c68d63213 Amend the toUnixTimestamp function documentation 2023-05-18 09:11:48 +00:00
libin
d294ecbc16
Update grant.md
docs: Modifying grant example
2023-05-18 15:50:19 +08:00
Alexey Gerasimchuk
1fb9e36b81
Merge branch 'master' into ADQM-808 2023-05-18 07:59:02 +10:00
Rich Raposa
03b5bfe218
Merge pull request #49968 from ClickHouse/reddit
Add Reddit comments to datasets
2023-05-17 15:26:29 -06:00
rfraposa
6a136897e3 Create reddit-comments.md 2023-05-17 13:23:53 -06:00
Han Fei
549af4d351 address comments 2023-05-17 21:23:32 +02:00
DanRoscigno
a1fc96953f reorder 2023-05-17 14:48:16 -04:00
Dan Roscigno
addc0c0ece
Merge branch 'master' into allow_experimental_parallel_reading_from_replicas 2023-05-17 13:20:14 -04:00
Alexey Gerasimchuck
253fadb676 Updated documentation for --multiquery 2023-05-17 03:45:09 +00:00
Thom O'Connor
15cb627688
Update postgresql.md
The type cannot be optional, so we need to be clear in the docs that [type1] [type2] cannot be in square brackets, so removed them
2023-05-16 16:35:41 -06:00
DanRoscigno
64bf477ff2 add DDL 2023-05-16 14:58:04 -04:00
Kruglov Pavel
4530f38fdf
Merge branch 'master' into urlCluster 2023-05-16 16:21:23 +02:00
Han Fei
7df0e9d933 fix broken link 2023-05-16 15:33:08 +02:00
Han Fei
a40d86b921
Update docs/en/sql-reference/dictionaries/index.md
Co-authored-by: Sergei Trifonov <svtrifonov@gmail.com>
2023-05-16 11:22:42 +02:00
Han Fei
ed5906f15d
Update docs/en/sql-reference/dictionaries/index.md
Co-authored-by: Sergei Trifonov <svtrifonov@gmail.com>
2023-05-16 11:22:31 +02:00
Han Fei
31b8e3c489
Update docs/en/sql-reference/dictionaries/index.md
Co-authored-by: Sergei Trifonov <svtrifonov@gmail.com>
2023-05-16 11:22:24 +02:00
Han Fei
e4e473ef30
Update docs/en/sql-reference/dictionaries/index.md
Co-authored-by: Sergei Trifonov <svtrifonov@gmail.com>
2023-05-16 11:22:14 +02:00
Han Fei
29aa960377 refine docs for regexp tree dictionary 2023-05-16 09:07:35 +02:00
Robert Schulze
59bc3e25be
Merge pull request #49824 from AVMusorin/allow-alias-column-kafka
KafkaEngine: Allow usage of Alias column type
2023-05-15 23:40:03 +02:00
robot-ch-test-poll4
e58d1821eb
Merge pull request #49788 from lesandie/master
Reworked Local File Cache documentation
2023-05-15 21:38:43 +02:00
Vitaly Baranov
bf3336a84e
Merge pull request #47640 from ilejn/row_policy_template
Row policy for database
2023-05-15 20:05:15 +02:00
avogar
f9e0eb47d7 Merge branch 'master' of github.com:ClickHouse/ClickHouse into urlCluster 2023-05-15 16:35:24 +00:00
Diego Nieto
bf201a09b7
Update docs/en/operations/storing-data.md
@kssenii comments

Co-authored-by: Kseniia Sumarokova <54203879+kssenii@users.noreply.github.com>
2023-05-15 15:43:39 +02:00
AVMusorin
418a61a68c
Allow using Alias column type for KafkaEngine
```
create table kafka
(
 a UInt32,
 a_str String Alias toString(a)
) engine = Kafka;

create table data
(
  a UInt32;
  a_str String
) engine = MergeTree
order by tuple();

create materialized view data_mv to data
(
  a UInt32,
  a_str String
) as
select a, a_str from kafka;
```
Alias type works as expected in comparison with MATERIALIZED/EPHEMERAL
or column with default expression.

Ref: https://github.com/ClickHouse/ClickHouse/pull/47138

Co-authored-by: Azat Khuzhin <a3at.mail@gmail.com>
2023-05-15 15:39:58 +02:00
Diego Nieto
52fdc49ef3
Merge branch 'master' into master 2023-05-15 10:28:31 +02:00
robot-clickhouse
bec4edad65
Merge pull request #49856 from darkkeks/fix-date-time-functions-toc
[docs] Remove "example" section from date-time-functions page toc
2023-05-14 23:43:00 +02:00
Ilya Golshtein
9ef610040f row_policy_template - table * is respected 2023-05-13 22:30:02 +00:00
Ilya Golshtein
7fa6ea4ccc row policy template: tests and code cleanup, code review changes 2023-05-13 22:30:02 +00:00
Nikolay Degterinsky
a2e7c77fe2
Merge pull request #49860 from ucasfl/doc
update doc
2023-05-13 23:26:40 +02:00
flynn
f775a466f7 update doc 2023-05-13 16:12:51 +00:00
Robert Schulze
0d9a6b5545
Update QPL docs 2023-05-13 13:26:47 +00:00
darkkeks
2b793e3a14 [docs] Remove "example" section from date-time-functions page toc 2023-05-13 16:22:17 +03:00
Alexey Milovidov
6df457dcaf
Merge pull request #49811 from jinjunzh/iaadeflate_swpath_compat
qpl_deflate codec lower the minimum SIMD version to SSE 4.2
2023-05-13 04:21:54 +03:00
Rich Raposa
944100c4b1
Merge pull request #49841 from nellicus/dedicate_keepers_prod
[docs] strongly recommend dedicated keeper in prod
2023-05-12 14:08:47 -06:00
Robert Schulze
f011421f32
Merge pull request #49825 from den-crane/patch-66
Doc. Note about why Google CityHash <> cityHash64
2023-05-12 20:01:58 +02:00
nellicus
5366da8bcf fix links 2023-05-12 18:07:38 +02:00
nellicus
2960a0420b strongly recommend ded keeper in prod 2023-05-12 17:58:41 +02:00
robot-ch-test-poll1
6314813e7b
Merge pull request #49840 from yakkomajuri/patch-3
Minor correction to example in COMMENT COLUMN docs
2023-05-12 17:31:58 +02:00
Yakko Majuri
eb6cbe63ee
Minor correction to example in COMMENT COLUMN 2023-05-12 11:54:53 -03:00
Robert Schulze
5133597661
+m 2023-05-12 16:51:47 +02:00
Robert Schulze
9d34233e65
Merge branch 'master' into iaadeflate_swpath_compat 2023-05-12 16:39:17 +02:00
Robert Schulze
5235f52de3
Small rewrite, removed note about farm hash (it seemed fragile: https://github.com/ClickHouse/ClickHouse/issues/8354#issuecomment-974808920) 2023-05-12 16:38:32 +02:00
DanRoscigno
b8b4723aa6 add max_rows_in_set_to_optimize_join docs 2023-05-12 10:19:30 -04:00
Diego Nieto
78127d8a2c
Merge branch 'ClickHouse:master' into master 2023-05-12 16:08:48 +02:00
jinjunzh
e72a6fe973 qpl-deflate codec lower the minimum SIMD version to SSE 4.2 2023-05-12 09:56:01 -04:00
DanRoscigno
0f7c0859b0 add allow_experimental_parallel_reading_from_replicas docs 2023-05-12 09:22:29 -04:00
Dan Roscigno
596e50ebc0
Merge pull request #49830 from DanRoscigno/docs-udfs
uplevel UDFs and link
2023-05-12 09:05:39 -04:00
DanRoscigno
0b9e575588 uplevel UDFs and link 2023-05-12 08:05:38 -04:00
Robert Schulze
b9c185af44
Merge pull request #49678 from azat/build/llvm-16
Switch to LLVM/clang 16 (16.0.3)
2023-05-12 13:47:36 +02:00
Denny Crane
051456ba92
Update hash-functions.md 2023-05-12 08:33:27 -03:00
Diego Nieto
ba44b23889
Merge branch 'ClickHouse:master' into master 2023-05-12 13:31:12 +02:00
Denny Crane
47b27ca5fb
Update hash-functions.md 2023-05-12 08:23:02 -03:00
Dan Roscigno
56a563f2bd
Merge pull request #49810 from DanRoscigno/doc-greatest-least
add greatest() and least() docs
2023-05-12 05:52:01 -04:00
Diego Nieto
7dcaa0a25c
Merge branch 'master' into master 2023-05-12 11:46:37 +02:00
Diego Nieto (lesandie)
8fc0083a26 Rewrite following conversation/comments 2023-05-12 11:45:38 +02:00
Han Fei
07341a04fd
Merge pull request #49666 from hanfei1991/hanfei/regexp-dict-read
Support `dictionary` table function for `RegExpTreeDictionary`
2023-05-12 11:40:46 +02:00
Diego Nieto
4b69a203ea
Merge branch 'ClickHouse:master' into master 2023-05-12 10:07:18 +02:00
DanRoscigno
11f78fed2b fix markdown spaces 2023-05-11 21:40:47 -04:00
DanRoscigno
a14ad025ee add greatest and least docs 2023-05-11 21:33:36 -04:00
Alexey Milovidov
cf2b158072
Update build-cross-riscv.md 2023-05-12 03:18:43 +03:00
robot-clickhouse
6f19a374f2
Merge pull request #49791 from den-crane/patch-65
Update settings.md
2023-05-12 01:02:58 +02:00
robot-ch-test-poll2
91fd3a3d16
Merge pull request #49745 from jrdi/improve-local-cache-documentation
Improve local cache documentation
2023-05-12 00:30:19 +02:00
Han Fei
ef74e64336 address comments 2023-05-11 22:18:08 +02:00
Denny Crane
fcf1dd52bc
Update settings.md 2023-05-11 14:05:30 -03:00
Vitaly Baranov
ec011b9113
Merge branch 'master' into user-grants-from-config 2023-05-11 17:08:06 +02:00
Rich Raposa
2f8a530bf3
Merge pull request #49741 from ClickHouse/rfraposa-patch-2
Update async_insert explanation in docs
2023-05-11 08:36:53 -06:00
Diego Nieto (lesandie)
10e0c1d832 Reworked documentation using local cache section 2023-05-11 16:06:14 +02:00
Diego Nieto (lesandie)
26743b5439 Fix Local Cache documentation explanations 2023-05-11 15:36:47 +02:00
avogar
3ee8de792c Merge branch 'master' of github.com:ClickHouse/ClickHouse into urlCluster 2023-05-11 12:46:20 +00:00
Jordi Villar
3780a07e46 Improve local cache documentation 2023-05-11 12:40:48 +02:00
Alexey Milovidov
91ffe8be73
Merge branch 'master' into build/llvm-16 2023-05-11 05:38:40 +03:00
Azat Khuzhin
d56e1423ef
Merge branch 'master' into build/llvm-16 2023-05-10 21:23:26 +02:00
Dan Roscigno
8f573aba2d
docs for connect_timeout_with_failover_secure_ms
closes #47267
2023-05-10 14:00:05 -04:00
Rich Raposa
4be0372eb4
Update settings.md
Removed the mention that async_insert is  only useful for HTTP
2023-05-10 09:48:33 -06:00
Alexey Milovidov
33a58aacea
Merge pull request #49719 from ClickHouse/make-projections-non-experimental
Make projections production-ready
2023-05-10 18:07:02 +03:00
Azat Khuzhin
a0cc5b7e47 Merge remote-tracking branch 'u/master' into build/llvm-16 2023-05-10 16:28:36 +02:00
vdimir
1b7f54e886
Merge pull request #46467 from bigo-sg/first_value 2023-05-10 14:56:14 +02:00
Roman Vasin
632c0eeefc
Add support of temporary tables in FROM part of ATTACH|REPLACE PARTITION FROM (#49436)
* Add temp tables after FROM

* Resolve works fine with temp and non-temp tables

* Add usage of temporary table in  00626_replace_partition_from_table test

* Update documentation

* Add tests for ReplicatedMergeTree

* Move tests into 02731_replace_partition_from_temporary_table
2023-05-10 15:25:12 +03:00
Robert Schulze
bf3eacf697
Merge pull request #49730 from ClickHouse/rs/docs-fix-sidebar
Docs: Fix sidebar label for dictionary table function
2023-05-10 13:54:29 +02:00
Robert Schulze
405e371aca
Docs: Fix sidebar label for dictionary table function 2023-05-10 11:18:15 +00:00
Alexey Milovidov
8a6e07f0ea Make projections production-ready 2023-05-10 03:35:13 +02:00
alekar
67ad681e41
Merge branch 'master' into 48716 2023-05-09 14:34:32 -07:00
Dan Roscigno
915903b42d
Merge pull request #49704 from pkit/pkit/docs_is_deleted
docs: fix the is_deleted option docs
2023-05-09 16:59:25 -04:00
alekar
605991d4d5
Merge branch 'master' into 48716 2023-05-09 13:38:11 -07:00
Azat Khuzhin
be39e8e566 Merge remote-tracking branch 'u/master' into build/llvm-16
Conflicts:
	docker/test/codebrowser/Dockerfile
2023-05-09 20:40:43 +02:00
Azat Khuzhin
432250a85d Update instructions for building under OSX to clang-16
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-05-09 20:28:38 +02:00
DanRoscigno
0a83f2aee8 fix broken note rendering 2023-05-09 13:29:59 -04:00
Dan Roscigno
c75e056d98
Update docs/en/engines/table-engines/mergetree-family/replacingmergetree.md 2023-05-09 13:18:42 -04:00
Dan Roscigno
199838e7fb
Update docs/en/engines/table-engines/mergetree-family/replacingmergetree.md 2023-05-09 13:18:35 -04:00
Dan Roscigno
cd62df4887
Update docs/en/engines/table-engines/mergetree-family/replacingmergetree.md 2023-05-09 13:18:26 -04:00
Dan Roscigno
582767e9ab
Update docs/en/engines/table-engines/mergetree-family/replacingmergetree.md 2023-05-09 13:18:18 -04:00
Dan Roscigno
2e20603b01
Update docs/en/engines/table-engines/mergetree-family/replacingmergetree.md 2023-05-09 13:18:09 -04:00
Constantine Peresypkin
cef9862628 docs: fix the is_deleted option docs
- `is_deleted` type __must__ be `UInt8` and not `Int8`
- strange wording on how `clean_deleted_rows` works is fixed
2023-05-09 12:40:33 -04:00
Duyet Le
4c7849a9c8
Fix incorrect docs for use_query_cache 2023-05-09 17:08:49 +07:00
vdimir
6bb552377f
Merge pull request #49085 from ClibMouse/saveIntervals 2023-05-09 11:33:05 +02:00
Alexey Milovidov
512db049e0
Merge pull request #49652 from vrfn/documentation-update-backup-all
Documentation: reflect that `BACKUP ALL` is supported
2023-05-09 08:05:30 +03:00