Commit Graph

120789 Commits

Author SHA1 Message Date
Azat Khuzhin
68aed0d16e RFC: Fix filtering by virtual columns with OR expression
Virtual columns did not supports queries with OR, for example query like
this (here `m` is the `Merge` table, see the test):

    select key from m where (value = 10 and _table = 'v1') or (value = 20 and _table = 'v1');

Will always leads to:

    Cannot find column `value` in source stream, there are only columns ...

The reason for this is that it actually executes the following queries:

    SELECT key, value FROM default.d1 WHERE ((value = 10) AND ('v1' = 'v1')) OR ((value = 20) AND ('v1' = 'v1'));
    SELECT key FROM default.d2 WHERE 0;

And this kind of filtering is used not only for `Merge` table but also:
- `_table` for `Merge` (already mentioned)
- `_file` for `File`
- `_idx` for `S3`
- and as well as filtering `system.*` tables by `database`/`table`/...

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-07-27 16:35:17 +02:00
Azat Khuzhin
5611b2fff4 Add a note about not working _table filter for Merge with analyzer
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-07-27 16:35:17 +02:00
Han Fei
9488567bf6
Update src/Storages/MergeTree/MergeTreeDeduplicationLog.cpp 2023-07-27 16:28:08 +02:00
Han Fei
f6ca013c53
Update src/Storages/MergeTree/MergeTreeDeduplicationLog.cpp 2023-07-27 16:28:00 +02:00
Han Fei
0d44d527ef
Update src/Storages/MergeTree/MergeTreeDeduplicationLog.cpp
Co-authored-by: Alexander Tokmakov <tavplubix@gmail.com>
2023-07-27 16:27:04 +02:00
Han Fei
f5dfb70f5c
Update src/Storages/MergeTree/MergeTreeDeduplicationLog.cpp 2023-07-27 16:12:20 +02:00
Han Fei
ce38d3c5ea address comment 2023-07-27 16:11:08 +02:00
robot-clickhouse
28c49e3f20 Update version_date.tsv and changelogs after v23.7.1.2470-stable 2023-07-27 13:47:11 +00:00
Alexey Milovidov
8d0a201468
Merge pull request #52661 from ClickHouse/23.8-prepare
Update version after release
2023-07-27 16:35:10 +03:00
Alexey Milovidov
671128140d Update autogenerated version to 23.8.1.1 and contributors 2023-07-27 15:34:05 +02:00
Jiebin Sun
78f3a575f9
Convert hashSets in parallel before merge (#50748)
* Convert hashSets in parallel before merge

Before merge, if one of the lhs and rhs is singleLevelSet and the other is twoLevelSet,
then the SingleLevelSet will call convertToTwoLevel(). The convert process is not in parallel
and it will cost lots of cycle if it cosume all the singleLevelSet.

The idea of the patch is to convert all the singleLevelSets to twoLevelSets in parallel if
the hashsets are not all singleLevel or not all twoLevel.

I have tested the patch on Intel 2 x 112 vCPUs SPR server with clickbench and latest upstream
ClickHouse.
Q5 has got a big 264% performance improvement and 24 queries have got at least 5% performance
gain. The overall geomean of 43 queries has gained 7.4% more than the base code.

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>

* add resize() for the data_vec in parallelizeMergePrepare()

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>

* Add the performance test prepare_hash_before_merge.xml

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>

* Fit the CI to rename the data set from hits_v1 to test.hits.

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>

* remove the redundant branch in UniqExactSet

Co-authored-by: Nikita Taranov <nickita.taranov@gmail.com>

* Remove the empty methods and add throw exception in parallelizeMergePrepare()

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>

---------

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>
Co-authored-by: Nikita Taranov <nickita.taranov@gmail.com>
2023-07-27 15:06:34 +02:00
Alexey Milovidov
33300a978e
Update CHANGELOG (#52655) 2023-07-27 14:38:53 +02:00
Yakov Olkhovskiy
8bba7baeaa fix style 2023-07-27 12:34:22 +00:00
Alexander Tokmakov
a70127baec
Merge pull request #52657 from ClickHouse/revert-52518-add-refcount-to-system-table
Revert "Added field `refcount` to `system.remote_data_paths` table"
2023-07-27 15:26:13 +03:00
Alexander Tokmakov
f3dc6dd061
Revert "Added field refcount to system.remote_data_paths table" 2023-07-27 15:23:57 +03:00
Yakov Olkhovskiy
9d73be6fca test is added 2023-07-27 12:14:46 +00:00
Robert Schulze
f7bd3a2a23
Merge branch 'master' into follow-up-to-50986 2023-07-27 13:14:47 +02:00
Alexander Tokmakov
228de12d94
Update ReplicatedMergeTreeQueue.cpp (#52648) 2023-07-27 13:59:23 +03:00
Alexey Milovidov
d7a2a76438
Merge pull request #52644 from ClickHouse/changelog-23.7
Changelog for 23.7
2023-07-27 13:00:18 +03:00
Robert Schulze
043ad45ec3
Incorporate review feedback, pt. II 2023-07-27 09:18:24 +00:00
Alexey Milovidov
17b647f6b1
Merge pull request #52518 from CurtizJ/add-refcount-to-system-table
Added field `refcount` to `system.remote_data_paths` table
2023-07-27 12:13:24 +03:00
Alexey Milovidov
4cf8de9e36
Merge branch 'master' into changelog-23.7 2023-07-27 12:10:44 +03:00
Alexander Tokmakov
16d5c7ae59
Merge pull request #52647 from ClickHouse/fix-style
Fix style
2023-07-27 12:09:23 +03:00
Robert Schulze
2b18872e86
Incorporate review feedback 2023-07-27 08:55:19 +00:00
Antonio Andelic
f61f36800c Fix style 2023-07-27 08:48:23 +00:00
János Benjamin Antal
ac51ade45c Do not run the test without Rust libraries 2023-07-27 08:45:53 +00:00
Robert Schulze
c04fb61487
Merge branch 'master' into follow-up-to-50986 2023-07-27 10:28:59 +02:00
robot-ch-test-poll1
abc48a0b73
Merge pull request #52627 from ClickHouse/fix_build_with_clang15
Fix build with clang-15
2023-07-27 09:49:29 +02:00
robot-clickhouse
ddab6590b2
Merge pull request #52626 from ClickHouse/fix_deadlock_in_persistent_table_functions
Fix deadlocks in StorageTableFunctionProxy
2023-07-27 09:38:48 +02:00
Alexey Milovidov
d2d7139da3 Changelog for 23.7 2023-07-27 08:58:23 +02:00
robot-ch-test-poll1
a5e073ff7d
Merge pull request #52602 from CheSema/finalize-gindexstore
GinIndexStore: fix a bug when files are finalizated after first write,
2023-07-27 08:31:24 +02:00
Alexey Milovidov
65ffe91bf2 Fix double whitespace 2023-07-27 07:13:26 +02:00
Yakov Olkhovskiy
d35c87c1bd allow positional options for clickhouse-local and populate global udf settings 2023-07-27 05:06:16 +00:00
Kruglov Pavel
fab77783f1
Merge pull request #49367 from ClickHouse/enc
Partially reimplement Parquet encoder to make it faster and parallelizable
2023-07-27 00:48:54 +02:00
robot-ch-test-poll2
7bcef0a6c0
Merge pull request #52520 from zvonand/revert-52450-remove-to-decimal-string
Fix `toDecimalString` function
2023-07-27 00:18:36 +02:00
Alexey Milovidov
3387b02ede
Merge pull request #52557 from azat/tests/fix-network_receive_time_metric_insert
tests: increase throttling for 01923_network_receive_time_metric_insert
2023-07-27 01:11:18 +03:00
robot-ch-test-poll3
5db88e677b
Merge pull request #52328 from ClickHouse/Fix_parameterized_view_with_cte_multiple_usage
Fix parameterized view with cte and multiple usage
2023-07-27 00:06:10 +02:00
robot-clickhouse-ci-1
b6bcc32acb
Merge pull request #52606 from nickitat/port_use_prewhere_cols
Port one more optimisation to `MergeTreePrefetchedReadPool`
2023-07-27 00:03:47 +02:00
Alexander Tokmakov
954a1d3ede
Improve logging macros (#52519)
* improve logging macros

* fix

* Update logger_useful.h

* fix

* fix
2023-07-26 23:38:14 +03:00
Alexey Milovidov
4586290e89
Merge pull request #52509 from ucasfl/operator
Replace with three way comparison
2023-07-26 23:30:08 +03:00
Alexey Milovidov
07193904b6
Merge pull request #52598 from ClickHouse/remove-peak-memory-usage-from-client
Remove peak memory usage from the final message in the client
2023-07-26 23:26:58 +03:00
Arthur Passos
5aeeaebd00 init and destroy ares channel on demand.. 2023-07-26 17:09:23 -03:00
Nikolay Degterinsky
d89e2e6a27
Add SYSTEM STOP LISTEN query (#51016)
Co-authored-by: Nikita Mikhaylov <nikitamikhaylov@clickhouse.com>
Co-authored-by: Nikita Mikhaylov <mikhaylovnikitka@gmail.com>
2023-07-26 19:58:41 +02:00
Nikita Taranov
0a838dc6d1
Revert "Remove mmap/mremap/munmap from Allocator.h" (#52589) 2023-07-26 18:30:18 +02:00
Yakov Olkhovskiy
9a1c59a2f1
Merge branch 'master' into fix-ip-dict 2023-07-26 12:08:49 -04:00
János Benjamin Antal
3a6aaa29c9 Do not load suggestions in case not ClickHouse dialects 2023-07-26 15:26:21 +00:00
János Benjamin Antal
b8cac9499d Add tests to reproduce the problem 2023-07-26 15:26:21 +00:00
Alexander Tokmakov
74f3e76b18 fix build with clang-15 2023-07-26 17:15:28 +02:00
Alexander Tokmakov
2479f1352a fix deadlocks in StorageTableFunctionProxy 2023-07-26 17:11:28 +02:00
xiedeyantu
89f2e8cdea Fix S3 table function does not work for pre-signed URL 2023-07-26 23:02:57 +08:00