Commit Graph

121004 Commits

Author SHA1 Message Date
Yakov Olkhovskiy
565204f491
Merge branch 'master' into fix-nullable-ipv4-intdiv 2023-07-27 17:34:53 -04:00
chen
0c3118c9af
Merge branch 'master' into s3-url 2023-07-28 05:26:10 +08:00
Dmitry Novik
18c1fd6f08 Refactor InDepthQueryTreeVisitorWithContext 2023-07-27 21:24:39 +00:00
robot-clickhouse
7d8dc92ed0 Automatic style fix 2023-07-27 21:07:54 +00:00
Sema Checherinda
4629ab1df1 add test with broken pipe 2023-07-28 00:24:05 +04:00
Robert Schulze
0b0ad7d45d
Merge branch 'master' into follow-up-to-50986 2023-07-27 22:08:26 +02:00
Yakov Olkhovskiy
9448d42aea
Merge pull request #51756 from ClickHouse/fix-ip-dict
Support IPv4 and IPv6 as dictionary attributes
2023-07-27 15:59:21 -04:00
Jai Jhala
dba8b445bd
Update default output_format_arrow_compression.md
Updates the default parameter of output_format_arrow_compression_method from "none" to "lz4_frame".
2023-07-27 12:32:53 -07:00
Alexey Milovidov
f7d8930906
Merge pull request #52664 from ClickHouse/auto/v23.7.1.2470-stable
Update version_date.tsv and changelogs after v23.7.1.2470-stable
2023-07-27 22:14:36 +03:00
Nikolay Degterinsky
eb74e658b7 Fix build 2023-07-27 19:12:06 +00:00
Yakov Olkhovskiy
5942c80fae fix test: '--option' now is allowed after terminating '--' 2023-07-27 19:11:44 +00:00
Han Fei
4f9075e819
Merge pull request #52623 from hanfei1991/hanfei/MergeTreeDeduplicationLog
Fix stress test: check if storage shutdown before we operate MergeTreeDeduplicationLog
2023-07-27 20:50:17 +02:00
Robert Schulze
9340f02d26
Silence spell check 2023-07-27 18:33:07 +00:00
Yakov Olkhovskiy
b95745d916 fix: check positional options 2023-07-27 17:52:44 +00:00
Anton Kozlov
2717be7c47 [minor][bugfix] fix connected_zk_index column of system.zookeeper_connection table 2023-07-27 17:34:53 +00:00
Azat Khuzhin
a8a48af7cb Fix 02417_opentelemetry_insert_on_distributed_table flakiness
Looks like everything is OK with opentelemetry, and the reason of the
flakiness is this:

    $ gg opentelemetry_start_trace_probability tests/**.xml
    tests/config/users.d/opentelemetry.xml:            <opentelemetry_start_trace_probability>0.1</opentelemetry_start_trace_probability>

So let's simply disable it.

And also let's stop the distributed sends to increase the failure rate
if there is some problem left.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-07-27 19:19:24 +02:00
Smita Kulkarni
380da31512 Improvements to backup restore disallow_concurrency test 2023-07-27 19:03:44 +02:00
Dmitry Novik
9e1e005bcf Merge remote-tracking branch 'origin/master' into query-tree-visitor 2023-07-27 16:44:08 +00:00
Dmitry Novik
03f025ddd1 Fix typo 2023-07-27 16:38:02 +00:00
Vitaly Baranov
faca843ac0 Add utility to parse a backup metadata file and print information about the backup. 2023-07-27 18:25:21 +02:00
Anton Popov
dfc06d2714 fix reading of unneded column in case of multistage prewhere 2023-07-27 16:15:23 +00:00
Robert Schulze
c2ee774350
Add query cache metrics to system.asynchronous_metrics
Cf. https://github.com/ClickHouse/ClickHouse/pull/52384#issuecomment-1653241216
2023-07-27 15:19:01 +00:00
Robert Schulze
d48b1cadb0
Merge pull request #52384 from rschu1ze/query_cache_usage_in_query_log
Include query cache usage in SYSTEM.QUERY_LOG
2023-07-27 17:16:54 +02:00
Azat Khuzhin
68aed0d16e RFC: Fix filtering by virtual columns with OR expression
Virtual columns did not supports queries with OR, for example query like
this (here `m` is the `Merge` table, see the test):

    select key from m where (value = 10 and _table = 'v1') or (value = 20 and _table = 'v1');

Will always leads to:

    Cannot find column `value` in source stream, there are only columns ...

The reason for this is that it actually executes the following queries:

    SELECT key, value FROM default.d1 WHERE ((value = 10) AND ('v1' = 'v1')) OR ((value = 20) AND ('v1' = 'v1'));
    SELECT key FROM default.d2 WHERE 0;

And this kind of filtering is used not only for `Merge` table but also:
- `_table` for `Merge` (already mentioned)
- `_file` for `File`
- `_idx` for `S3`
- and as well as filtering `system.*` tables by `database`/`table`/...

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-07-27 16:35:17 +02:00
Azat Khuzhin
5611b2fff4 Add a note about not working _table filter for Merge with analyzer
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-07-27 16:35:17 +02:00
Han Fei
9488567bf6
Update src/Storages/MergeTree/MergeTreeDeduplicationLog.cpp 2023-07-27 16:28:08 +02:00
Han Fei
f6ca013c53
Update src/Storages/MergeTree/MergeTreeDeduplicationLog.cpp 2023-07-27 16:28:00 +02:00
Han Fei
0d44d527ef
Update src/Storages/MergeTree/MergeTreeDeduplicationLog.cpp
Co-authored-by: Alexander Tokmakov <tavplubix@gmail.com>
2023-07-27 16:27:04 +02:00
Nikolay Degterinsky
ee9bad7a31 Fix style 2023-07-27 14:18:46 +00:00
Nikolay Degterinsky
b5160c8072 Fix bugs and better test for SYSTEM STOP LISTEN 2023-07-27 14:16:19 +00:00
Han Fei
f5dfb70f5c
Update src/Storages/MergeTree/MergeTreeDeduplicationLog.cpp 2023-07-27 16:12:20 +02:00
Han Fei
ce38d3c5ea address comment 2023-07-27 16:11:08 +02:00
robot-clickhouse
28c49e3f20 Update version_date.tsv and changelogs after v23.7.1.2470-stable 2023-07-27 13:47:11 +00:00
Alexey Milovidov
8d0a201468
Merge pull request #52661 from ClickHouse/23.8-prepare
Update version after release
2023-07-27 16:35:10 +03:00
Alexey Milovidov
671128140d Update autogenerated version to 23.8.1.1 and contributors 2023-07-27 15:34:05 +02:00
Jiebin Sun
78f3a575f9
Convert hashSets in parallel before merge (#50748)
* Convert hashSets in parallel before merge

Before merge, if one of the lhs and rhs is singleLevelSet and the other is twoLevelSet,
then the SingleLevelSet will call convertToTwoLevel(). The convert process is not in parallel
and it will cost lots of cycle if it cosume all the singleLevelSet.

The idea of the patch is to convert all the singleLevelSets to twoLevelSets in parallel if
the hashsets are not all singleLevel or not all twoLevel.

I have tested the patch on Intel 2 x 112 vCPUs SPR server with clickbench and latest upstream
ClickHouse.
Q5 has got a big 264% performance improvement and 24 queries have got at least 5% performance
gain. The overall geomean of 43 queries has gained 7.4% more than the base code.

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>

* add resize() for the data_vec in parallelizeMergePrepare()

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>

* Add the performance test prepare_hash_before_merge.xml

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>

* Fit the CI to rename the data set from hits_v1 to test.hits.

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>

* remove the redundant branch in UniqExactSet

Co-authored-by: Nikita Taranov <nickita.taranov@gmail.com>

* Remove the empty methods and add throw exception in parallelizeMergePrepare()

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>

---------

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>
Co-authored-by: Nikita Taranov <nickita.taranov@gmail.com>
2023-07-27 15:06:34 +02:00
Alexander Gololobov
006453b102
Merge branch 'master' into return_zxid 2023-07-27 14:57:39 +02:00
Anton Popov
541a253580 fix crash with function tuple and one sparse argument 2023-07-27 12:43:38 +00:00
Alexey Milovidov
33300a978e
Update CHANGELOG (#52655) 2023-07-27 14:38:53 +02:00
Yakov Olkhovskiy
8bba7baeaa fix style 2023-07-27 12:34:22 +00:00
Alexander Tokmakov
a70127baec
Merge pull request #52657 from ClickHouse/revert-52518-add-refcount-to-system-table
Revert "Added field `refcount` to `system.remote_data_paths` table"
2023-07-27 15:26:13 +03:00
Alexander Tokmakov
f3dc6dd061
Revert "Added field refcount to system.remote_data_paths table" 2023-07-27 15:23:57 +03:00
Yakov Olkhovskiy
9d73be6fca test is added 2023-07-27 12:14:46 +00:00
Robert Schulze
f7bd3a2a23
Merge branch 'master' into follow-up-to-50986 2023-07-27 13:14:47 +02:00
Alexander Tokmakov
228de12d94
Update ReplicatedMergeTreeQueue.cpp (#52648) 2023-07-27 13:59:23 +03:00
Antonio Andelic
04361908e6 Fix test_keeper_reconfig_replace_leader 2023-07-27 10:36:02 +00:00
Alexey Milovidov
d7a2a76438
Merge pull request #52644 from ClickHouse/changelog-23.7
Changelog for 23.7
2023-07-27 13:00:18 +03:00
Roman Vasin
578ea9c5e8 Update documentation 2023-07-27 09:50:42 +00:00
Robert Schulze
043ad45ec3
Incorporate review feedback, pt. II 2023-07-27 09:18:24 +00:00
Alexey Milovidov
17b647f6b1
Merge pull request #52518 from CurtizJ/add-refcount-to-system-table
Added field `refcount` to `system.remote_data_paths` table
2023-07-27 12:13:24 +03:00