ClickHouse/tests
Jiebin Sun 78f3a575f9
Convert hashSets in parallel before merge (#50748)
* Convert hashSets in parallel before merge

Before merge, if one of the lhs and rhs is singleLevelSet and the other is twoLevelSet,
then the SingleLevelSet will call convertToTwoLevel(). The convert process is not in parallel
and it will cost lots of cycle if it cosume all the singleLevelSet.

The idea of the patch is to convert all the singleLevelSets to twoLevelSets in parallel if
the hashsets are not all singleLevel or not all twoLevel.

I have tested the patch on Intel 2 x 112 vCPUs SPR server with clickbench and latest upstream
ClickHouse.
Q5 has got a big 264% performance improvement and 24 queries have got at least 5% performance
gain. The overall geomean of 43 queries has gained 7.4% more than the base code.

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>

* add resize() for the data_vec in parallelizeMergePrepare()

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>

* Add the performance test prepare_hash_before_merge.xml

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>

* Fit the CI to rename the data set from hits_v1 to test.hits.

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>

* remove the redundant branch in UniqExactSet

Co-authored-by: Nikita Taranov <nickita.taranov@gmail.com>

* Remove the empty methods and add throw exception in parallelizeMergePrepare()

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>

---------

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>
Co-authored-by: Nikita Taranov <nickita.taranov@gmail.com>
2023-07-27 15:06:34 +02:00
..
ci Fix attaching gdb in stress tests (#51445) 2023-07-26 12:53:19 +03:00
config
fuzz
instructions
integration Add SYSTEM STOP LISTEN query (#51016) 2023-07-26 19:58:41 +02:00
jepsen.clickhouse
perf_drafts
performance Convert hashSets in parallel before merge (#50748) 2023-07-27 15:06:34 +02:00
queries Revert "Added field refcount to system.remote_data_paths table" 2023-07-27 15:23:57 +03:00
sqllogic
.gitignore
.rgignore
analyzer_integration_broken_tests.txt
analyzer_tech_debt.txt Merge pull request #52328 from ClickHouse/Fix_parameterized_view_with_cte_multiple_usage 2023-07-27 00:06:10 +02:00
clickhouse-test
CMakeLists.txt
tsan_suppressions.txt
ubsan_suppressions.txt