Commit Graph

142837 Commits

Author SHA1 Message Date
Robert Schulze
3d20703958
Force-enable analyzer so that tests without Analyzer can no longer fail 2024-05-24 12:27:19 +00:00
Nikita Fomichev
a6e06b27d2 Update description for settings cross_join_min_rows_to_compress and cross_join_min_bytes_to_compress 2024-05-24 14:17:37 +02:00
Jiebin Sun
d40c5a07be Avoid UBSan warning while buf is nullptr 2024-05-24 19:35:07 +08:00
Jiebin Sun
d1d57caf0a Generate the seperate function firstPopulatedCell() and lastPopulatedCell() 2024-05-24 19:35:07 +08:00
Jiebin Sun
ca88da11e0 Update src/Common/HashTable/FixedHashTable.h
Revise the method name by Nikita.

Co-authored-by: Nikita Taranov <nickita.taranov@gmail.com>
2024-05-24 19:35:07 +08:00
Jiebin Sun
4e6f5fba83 Update src/Common/HashTable/FixedHashTable.h
Add comment by Nikita.

Co-authored-by: Nikita Taranov <nickita.taranov@gmail.com>
2024-05-24 19:35:07 +08:00
Jiebin Sun
7f960e4e8a Add the use_emplace_to_insert_data flag. emplace() is the only interface
to update min/max. If the FixedHashTable.emplace() is not used to revise
the hashtable value, then we should not continue the min/max optimization.
2024-05-24 19:35:07 +08:00
Jiebin Sun
60420f2a8e Fix a bug if data will be inserted not by emplace(). 2024-05-24 19:35:07 +08:00
Jiebin Sun
69960a5735 Fix a bug if the container is empty 2024-05-24 19:35:07 +08:00
Jiebin Sun
d48fba5b2b Limit the array index of FixedHashTable by min/max
If the type of key is 8 bits or 16 bits in aggregation, ClickHouse will use array
of 256 or 65536 length to store the key and boost the mergeSingleLevel, rather than
key comparison. However, if the key has occupied only small range of the total 65536
cells, most of the cycles are wasted on the `isZero()` to find the next cell which
is not zero in iterator++.
The solution is to use min/max and update min/max when emplace. Then we can set the
upper searching limit to max in iterator++. And just set min as the value of `begin()`,
rather than searching the first cell that not equals to 0.
We have tested the patch on 2x80 vCPUs server, Query 7 of ClickBench has gained 2.1x
performance improvement.

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>
2024-05-24 19:35:07 +08:00
Raúl Marín
534f996be3 Change input_format_parquet_use_native_reader to 24.6 2024-05-24 13:07:37 +02:00
Max K
7ccb776ed9 mcheck fix 2024-05-24 12:39:35 +02:00
Robert Schulze
1f1c2c21b1
Fix spelling 2024-05-24 10:32:42 +00:00
Robert Schulze
3d91937a94
Merge remote-tracking branch 'ClickHouse/master' into add-serial-function 2024-05-24 10:32:20 +00:00
Max K
f1421c9e5c style fix 2024-05-24 12:27:05 +02:00
Robert Schulze
0e758722c6
Enable 02494_query_cache_nested_query_bug for Analyzer 2024-05-24 10:19:01 +00:00
Robert Schulze
a725112c4c
Fix different hashes for reading/writing from/to query cache 2024-05-24 10:12:08 +00:00
Max K
22b441ed40 fix PR template 2024-05-24 12:00:24 +02:00
Max K
d4fb2d50e9 CI: Sync, Merge check, CI gh's statuses fixes 2024-05-24 12:00:16 +02:00
Max K
b6dd8446f5
Merge pull request #64340 from ClickHouse/ci_foldable_master_wf
CI: master workflow with folded jobs
2024-05-24 09:43:01 +00:00
Nikolai Kochetov
523634b1c7
Merge pull request #63993 from ClickHouse/experiment-do-not-add-identifiers-to-expr-map
Check what would be broken if do not add all the identifiers to functions map.
2024-05-24 09:29:40 +00:00
Robert Schulze
f924316b9d
Merge pull request #64300 from Blargian/docs_return_type_consistency
[Docs] More consistent formatting of function documentation
2024-05-24 09:03:24 +00:00
Max K
3e21ff92a3 CI: master workflow with folded jobs 2024-05-24 10:53:19 +02:00
Kseniia Sumarokova
c10fc6d80e
Merge branch 'master' into use-iobject-storage-for-table-engines-1 2024-05-24 10:46:45 +02:00
Robert Schulze
480f911c76
Fix spelling 2024-05-24 08:33:44 +00:00
Blargian
c638de90c2 Fix incorrectly placed :::note blocks 2024-05-24 10:01:06 +02:00
Han Fei
f21c7505a2
Merge pull request #64181 from ClickHouse/fix-test-deduplication_token
Fix: 02124_insert_deduplication_token_multiple_blocks_replica
2024-05-24 07:32:45 +00:00
Blargian
8783647703 Revert addressToLine to original formatting 2024-05-24 07:03:15 +02:00
Blargian
b19c5ad13a Revert roundAge to original formatting 2024-05-24 07:01:15 +02:00
Blargian
3071909aca Revert roundAge to original formatting 2024-05-24 07:00:47 +02:00
Blargian
67ff6883fd Restore original formatting for logical functions and, or, not, xor 2024-05-24 06:51:38 +02:00
Michael Kolupaev
ee3e7f2fd0
Merge pull request #60361 from copperybean/gcmaster-parquet
A native parquet reader for primitive types
2024-05-24 04:50:12 +00:00
Blargian
e87c168bd8 Turn multi-line returns into a single line 2024-05-24 06:42:13 +02:00
Blargian
029e2ea226 Standardize references to data type docs 2024-05-24 05:54:16 +02:00
jsc0218
27627f603f fix 2024-05-24 03:04:36 +00:00
Max K
08d895e297
Merge pull request #64299 from ClickHouse/ci_cancel_sync_wf_on_new_push
CI: Cancel sync wf on new push
2024-05-24 03:02:11 +00:00
jsc0218
19cf12e205 Merge remote-tracking branch 'origin/master' into BetterLWDProjException 2024-05-24 02:33:10 +00:00
Alexey Milovidov
dddcfaab65
Merge pull request #64245 from den-crane/test/issue_45804
test for #45804
2024-05-24 04:32:53 +02:00
Alexey Milovidov
6c06772e5c
Merge pull request #64285 from ClickHouse/vdimir/squashing_transform_followup
Followup for #63691
2024-05-24 04:32:08 +02:00
Alexey Milovidov
2040a4d209
Merge pull request #64243 from Michicosun/fix_hdfs_unset_skip_unavailable_shards
Fix test_hdfsCluster_unset_skip_unavailable_shards
2024-05-24 01:14:15 +00:00
Alexey Milovidov
476bdb7174
Merge pull request #64325 from natalyjazzviolin/nataly/file-engine-cloud
Adds note: File engine unavailable in ClickHouse Cloud.
2024-05-24 01:13:23 +00:00
TTPO100AJIEX
251010f109 Move protocol-server and inter-server management into separate classes
Co-authored-by: Alex Koledaev <ax3l3rator@gmail.com>
2024-05-24 02:14:26 +03:00
Nataly Merezhuk
5710b5852f Adds note - file engine unavailable in ClickHouse Cloud. 2024-05-23 17:45:58 -04:00
Robert Schulze
6e6e2944b5
Fix glitch in #62696 2024-05-23 21:29:41 +00:00
Robert Schulze
dac31fb92a
Include settings into query cache key 2024-05-23 21:29:41 +00:00
Robert Schulze
0bfa56b468
Merge pull request #64199 from rschu1ze/fix-64136
Query Cache: Consider identical queries against different databases as different
2024-05-23 20:52:32 +00:00
Max K
bd15e1311a CI: fix 2024-05-23 22:35:21 +02:00
Robert Schulze
578dc58521
Merge pull request #62696 from rschu1ze/revert-revert-62392
Revert "Revert "Speed up `splitByRegexp`""
2024-05-23 20:21:21 +00:00
pufit
8ce3f38eeb
Merge pull request #64079 from ClickHouse/pufit/63777-fix-sql-security-with-analyzer
Fix SQL security access checks with analyzer
2024-05-23 20:00:16 +00:00
kssenii
40753ddefb Update hdfs test 2024-05-23 21:10:40 +02:00