Commit Graph

125751 Commits

Author SHA1 Message Date
Azat Khuzhin
b107712e0c Fix filtering by virtual columns with OR filter in query
The problem with the initial implementation #52653 was:
- OR can have multiple arguments
- It simply not correct to assume that if there are two arguments this is OK.
  Consider the following example:

    "WHERE (column_not_from_partition_by = 1) OR false OR false"

  Will be converted to:

    "WHERE false OR false"

And it will simply read nothing.

Yes, we could apply some optimization for bool, but this will not always
work, since to optimize things like "0 = 1" we need to execute it.

And the only way to make handle this correctly (with ability to ignore
some commands during filtering) is to make is_constant() function return
has it use something from the input block, so that we can be sure, that
we have some sensible, and not just "false".

Plus we cannot simply ignore the difference of the input and output
arguments of handling OR, we need to add always-true (1/true) if the
size is different, since otherwise it could break invariants (see
comment in the code).

This includes (but not limited to):
- _part* filtering for MergeTree
- _path/_file for various File/HDFS/... engines
- _table for Merge
- ...

P.S. analyzer does not have this bug, since it execute expression as
whole, and this is what filterBlockWithQuery() should do actually
instead, but this will be a more complex patch.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-10-10 20:47:52 +02:00
Alexey Milovidov
ba12695525
Merge pull request #55276 from azat/SummingMergeTree-Nested-fix
Fix incorrect merging of Nested for SELECT FINAL FROM SummingMergeTree
2023-10-08 16:08:25 +02:00
Alexey Milovidov
caf3c85b38
Merge pull request #55275 from azat/sparse-columns-function-fix
Fix functions execution over sparse columns
2023-10-08 15:27:46 +02:00
Alexey Milovidov
a7716a56ac
Update IFunction.cpp 2023-10-08 16:27:33 +03:00
Alexey Milovidov
781da580ac
Merge pull request #55283 from bytadaniel/patch-2
Update client-libraries.md
2023-10-08 15:25:51 +02:00
Alexey Milovidov
6e502a899d
Merge pull request #55311 from rschu1ze/croaring-tsan
Fix tsan issue in croaring
2023-10-08 13:31:06 +02:00
alesapin
55fc3fe8ab
Merge pull request #55247 from azat/tests/test_throttling-https
tests: switch test_throttling to S3 over https to make it more production like
2023-10-08 12:39:54 +02:00
Robert Schulze
282200ef50
Merge pull request #55215 from rschu1ze/more-mysql-compat
Add `information_schema.columns.extra` and `information_schema.tables.table_rows`
2023-10-08 12:21:34 +02:00
Kseniia Sumarokova
20868f3b65
Merge pull request #55252 from kssenii/fix-background-download-in-cache
Fix for background download in fs cache
2023-10-08 10:38:23 +02:00
Kseniia Sumarokova
0af252ae31
Merge pull request #55297 from ClickHouse/kssenii-patch-6
MaterializedPostgreSQL: remove back check
2023-10-08 10:31:23 +02:00
Robert Schulze
f4bc58ea92
Merge remote-tracking branch 'rschu1ze/master' into more-mysql-compat 2023-10-08 07:49:37 +00:00
Azat Khuzhin
099665478d Fix incorrect merging of Nested for SELECT FINAL FROM SummingMergeTree
The problem was the order of the columns, in case of SELECT FINAL it got
"counters_Map.count", "counters_Map.id"

But in case of OPTIMIZE FINAL it got "counters_Map.id",
"counters_Map.count" correctly.

Note, that this bugs exists there from the very recent versions, I've
checked 19.x and it was there.

P.S. there is a workaround for this problem, if you will use one of the
following patterns for key columns:
- *ID
- *Key
- *Type

That way it will be explicitly matched as key and everything will work.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-10-08 07:32:47 +02:00
Azat Khuzhin
2a27a8b0bf tests/test_throttling: check the maximum execution time as well
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-10-08 07:31:33 +02:00
Azat Khuzhin
c883da2b26 tests/test_throttling: adjust delays due to HTTPS usage
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-10-08 07:31:33 +02:00
Azat Khuzhin
9c8096455e tests: switch test_throttling to S3 over https to make it more production like
One interesting thing about S3 C++ SDK is that it can read file multiple
times for calculating checksums and signature, but the last is not done
for the https protocol, though the checksum "could".

And indeed it does, since default checksum algorithm (MD5) does not
support streaming, and so it always calculated, regardless of the
protocol, however everything else (CRC*/SHA*) supports streamming and
actually will not be calculated for https at all!

This will be fixed in the follow up patch.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-10-08 07:31:33 +02:00
Alexey Milovidov
b38d4b5b0f
Merge pull request #55119 from jmaicher/fix/54318/deadlock-ldap-role-update
Fix deadlock in LDAP assigned role update
2023-10-08 00:38:10 +02:00
robot-clickhouse-ci-1
48ce595e24
Merge pull request #55220 from den-crane/Docs/use_local_cache_for_remote_storage
Doc. Update use_local_cache_for_remote_storage settings name
2023-10-08 00:23:54 +02:00
Alexey Milovidov
25a7f86da7
Update hive.md 2023-10-08 00:51:19 +03:00
Alexey Milovidov
857ac2705d
Merge pull request #55226 from ClickHouse/server-provides-keeper
Make clickhouse-server provides keeper
2023-10-07 23:49:14 +02:00
Alexey Milovidov
186bd9c859
Merge pull request #55243 from den-crane/patch-62
Doc. fix a mistake about refcount
2023-10-07 23:43:16 +02:00
Alexey Milovidov
6b9696fd97
Update 02891_functions_over_sparse_columns.sql 2023-10-08 00:38:04 +03:00
robot-clickhouse-ci-1
2cd23c293a
Merge pull request #55307 from ClickHouse/make_code_more_stupid
Make `HandlingRuleHTTPHandlerFactory` more stupid, but less error prone
2023-10-07 22:12:43 +02:00
Robert Schulze
0450da9bac
Naming fixes 2023-10-07 20:12:10 +00:00
Robert Schulze
312fe8e31d
Merge pull request #55298 from rschu1ze/use_mysql_types_in_show_columns
Make `use_mysql_types_in_show_columns` independent from the connection
2023-10-07 22:03:24 +02:00
Robert Schulze
727f831c0c
Update croaring
Fixes: #55270
2023-10-07 19:44:19 +00:00
Robert Schulze
4c4f81d2a8
Remove x bit from sql test file 2023-10-07 17:12:12 +00:00
alesapin
4f7b81b6a5 Make HandlingRuleHTTPHandlerFactory more stupid, but less error prone 2023-10-07 18:12:58 +02:00
Kseniia Sumarokova
ab9e6f5f6a
Merge pull request #55282 from Priyansh121096/patch-1
Fix typo in TableFunctionS3.cpp
2023-10-07 18:05:52 +02:00
robot-ch-test-poll1
59a3b3356f
Merge pull request #55296 from rschu1ze/qc-better-logging
Improve logging in query cache
2023-10-07 17:59:19 +02:00
kssenii
1a839adec7 Add one more try-catch 2023-10-07 15:31:15 +02:00
Robert Schulze
cfa2e99b6e
Merge remote-tracking branch 'rschu1ze/master' into more-mysql-compat 2023-10-07 13:08:24 +00:00
Robert Schulze
2e8cac6e9c
Make use_mysql_types_in_show_columns independent from the connection
Fixes: #55273
2023-10-07 13:06:41 +00:00
Kseniia Sumarokova
f0747b7a6e
Update MaterializedPostgreSQLConsumer.cpp 2023-10-07 14:28:52 +02:00
Robert Schulze
a72ae92328
Make the logging in the query cache easier to understand 2023-10-07 11:33:11 +00:00
robot-ch-test-poll4
2406542931
Merge pull request #55266 from AleksaC/patch-1
Fix query_log cache usage column name in docs
2023-10-07 13:00:24 +02:00
alesapin
e55a615f06
Merge pull request #55253 from vitlibar/fix-async-insert-access-check-for-defaults
Evaluate defaults during async insert safer
2023-10-07 12:26:35 +02:00
kssenii
289020f454 Merge remote-tracking branch 'upstream/master' into fix-background-download-in-cache 2023-10-07 12:19:33 +02:00
Kseniia Sumarokova
97976bc978
Merge pull request #55260 from ClickHouse/kssenii-patch-6
Fix data race in context
2023-10-07 12:17:54 +02:00
Ilya Yatsishin
262373c9ef
Merge pull request #51435 from qoega/sources-with-subs
Build artifact with sources with submodules in Darwin special build
2023-10-07 11:07:27 +02:00
Daniel Byta
158e1ed703
Update client-libraries.md 2023-10-07 03:50:08 +03:00
Priyansh Agrawal
2562f66efd
Fix typo in TableFunctionS3.cpp 2023-10-07 00:55:08 +01:00
Azat Khuzhin
ce7f0e5ae7 Fix functions execution over sparse columns
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-10-06 16:16:50 +02:00
kssenii
9df0b8f67c Fix 2023-10-06 12:41:37 +02:00
Aleksa Cukovic
db0cf729f5
Fix query_log cache usage column name in docs
It is not called `query_log_usage` but `query_cache_usage`
2023-10-05 22:31:48 +02:00
Vitaly Baranov
cf7eea569b Fix quota consumption 2023-10-05 15:46:36 +02:00
Mikhail f. Shiryaev
351aeb0189
Beautify packages/build a bit 2023-10-05 15:43:47 +02:00
Mikhail f. Shiryaev
da6a4528f1
Make clickhouse-server package providing clickhouse-keeper 2023-10-05 15:43:47 +02:00
Mikhail f. Shiryaev
be763ee437
Use proper bash syntax for default values 2023-10-05 15:43:46 +02:00
Mikhail f. Shiryaev
2b2459b0e2
Reorder the packages configs for maintainability 2023-10-05 15:43:18 +02:00
Kseniia Sumarokova
68a3cd57a1
Fix data race in context 2023-10-05 15:29:56 +02:00