Commit Graph

823 Commits

Author SHA1 Message Date
Amos Bird
882b2a3348
CTE 2020-09-13 09:39:17 +08:00
alexey-milovidov
f2e9a09b11
Merge pull request #14585 from ClickHouse/join-processor
Remove Join from ExpressionActions
2020-09-08 23:56:09 +03:00
Nikolai Kochetov
39aad9979a Remove Join from ExpressionActions. 2020-09-08 13:40:53 +03:00
Nikolai Kochetov
c761970bc5 Merge branch 'master' into pipes-and-creating-sets 2020-09-04 11:38:35 +03:00
Nikolai Kochetov
b7e53208d1 Fix tests. 2020-09-04 11:36:47 +03:00
Azat Khuzhin
10b4f3b41f Optimize queries with LIMIT/LIMIT BY/ORDER BY for distributed with GROUP BY sharding_key
Previous set of QueryProcessingStage does not allow to do this.
But after WithMergeableStateAfterAggregation had been introduced the
following queries can be optimized too under
optimize_distributed_group_by_sharding_key:
- GROUP BY sharding_key LIMIT
- GROUP BY sharding_key LIMIT BY
- GROUP BY sharding_key ORDER BY

And right now it is still not supports:
- WITH TOTALS (looks like it can be supported)
- WITH ROLLUP (looks like it can be supported)
- WITH CUBE
- SETTINGS extremes=1 (looks like it can be supported)
But will be implemented separatelly.

vX: fixes
v2: fix WITH *
v3: fix extremes
v4: fix LIMIT OFFSET (and make a little bit cleaner)
v5: fix HAVING
v6: fix ORDER BY
v7: rebase against 20.7
v8: move out WithMergeableStateAfterAggregation
v9: add optimize_distributed_group_by_sharding_key into test names
2020-09-03 00:52:51 +03:00
Azat Khuzhin
4043be3121 Add new query processing stage WithMergeableStateAfterAggregation
Process query until the stage where the aggregate functions were
calculated and finalized.

It will be used for optimize_distributed_group_by_sharding_key.

v2: fix aliases
v3: Fix protocol ABI breakage due to WithMergeableStateAfterAggregation
    Conditions >= for QueryProcessingStage::Enum has been verified, and they
    are ok (in InterpreterSelectQuery).
2020-09-03 00:51:08 +03:00
Nikolai Kochetov
acfd35a908 Update QueryPipeline::addCreatingSetsTransform 2020-09-02 16:04:54 +03:00
Nikolai Kochetov
95bbda3667 Specify columns after ARRAY JOIN and JOIN.
Add ArrayJoinTransform and ArrayJoinStep.
2020-08-14 21:24:03 +03:00
Vitaly Baranov
cd372de417 Use SettingUInt64 only in Settings, use UInt64 in other places. 2020-07-31 19:11:27 +03:00
Nikolai Kochetov
be24184deb Ignore constants for remote query from local replica. 2020-07-29 17:27:37 +03:00
Artem Zuikov
2afd123eda
Refactoring: extract TreeOptimizer from SyntaxAnalyzer (#12645) 2020-07-22 20:13:05 +03:00
Alexey Milovidov
ef930df484 Avoid overflow in LIMIT #10470 #11372 2020-07-12 08:18:01 +03:00
Nikolai Kochetov
8587c2025d Merge branch 'master' into initial-explain 2020-06-29 23:35:56 +03:00
Nikolai Kochetov
40b398c8c4 Merge branch 'master' into better-distinct-for-query-plan 2020-06-29 11:14:05 +03:00
Nikolai Kochetov
893a75bcf7 Merge branch 'master' into initial-explain 2020-06-27 12:45:40 +03:00
Nikolai Kochetov
ca01094f4b More actions description. 2020-06-27 12:31:11 +03:00
Nikolai Kochetov
e1481ccfc2 Add initial explain. 2020-06-27 12:30:38 +03:00
Artem Zuikov
54fd9ce76b fix totals 2020-06-26 03:23:11 +03:00
Artem Zuikov
2d7d389b77 better ExpressionAction::execute() 2020-06-25 23:28:41 +03:00
Artem Zuikov
ae5ed098f4 remove useless logic 2020-06-25 21:18:27 +03:00
Artem Zuikov
801efc387a extract JOIN in own plan step 2020-06-25 20:40:25 +03:00
Nikolai Kochetov
58b3b5c2f9 Rename step. 2020-06-22 13:21:13 +03:00
Nikolai Kochetov
d7d334bf6f Update query plan traits for DISTINCT. Add more comments. 2020-06-22 13:18:28 +03:00
alesapin
b1e8976df4 Merge with master 2020-06-22 12:04:27 +03:00
alesapin
b9e74f4e82 Merge branch 'master' into atomic_metadata5 2020-06-22 12:03:53 +03:00
Nikolai Kochetov
819ea4fb89 Fix tests. 2020-06-20 16:48:21 +03:00
Nikolai Kochetov
0c6129ac7c Fix build. 2020-06-20 11:52:41 +03:00
Nikolai Kochetov
4d70cc34c1 Merge branch 'master' into query-plan 2020-06-20 10:53:48 +03:00
alexey-milovidov
4129300fdb
Merge pull request #11788 from ClickHouse/fix-too-many-threads
Fix too many threads
2020-06-20 08:24:47 +03:00
alesapin
85070ea2fd Remove unused variable 2020-06-19 22:41:24 +03:00
alesapin
4c0879ae30 Better logging in storages 2020-06-19 20:17:13 +03:00
Nikolai Kochetov
9ca456f255 Try fix totals. 2020-06-19 19:58:27 +03:00
Nikolai Kochetov
4cafa3d703 Try fix totals. 2020-06-19 19:51:44 +03:00
Nikolai Kochetov
f6cd3f43bb Fix Context leak in InterpreterSelectQuery 2020-06-19 17:42:01 +03:00
Nikolai Kochetov
6c9c33f870 Fix Context leak in InterpreterSelectQuery 2020-06-19 17:00:11 +03:00
Nikolai Kochetov
ec892775a5 Fix Context leak in InterpreterSelectQuery 2020-06-19 16:34:40 +03:00
alexey-milovidov
3e06182079
Update InterpreterSelectQuery.cpp 2020-06-19 14:28:48 +03:00
Nikolai Kochetov
0335c698e0 Limit the number of threads. 2020-06-19 13:57:44 +03:00
Nikolai Kochetov
9ddd7b98cc Fix tests. 2020-06-19 12:47:15 +03:00
Nikolai Kochetov
6708ae75f3 Fix build. 2020-06-19 11:30:21 +03:00
Nikolai Kochetov
cda4de40d7 Merge branch 'master' into query-plan 2020-06-19 11:29:15 +03:00
Nikolai Kochetov
a99d9f7712 Fix build. 2020-06-18 21:36:37 +03:00
Nikolai Kochetov
b7bed87a7f Use QueryPlan in InterpreterSelectQuery [part 3]. 2020-06-18 21:29:33 +03:00
Nikolai Kochetov
45adacf0bc Use QueryPlan in InterpreterSelectQuery [part 2]. 2020-06-18 20:45:00 +03:00
Amos Bird
d524301038
add scalars in time 2020-06-19 00:14:19 +08:00
alesapin
d79982f497 Better locks in Storages 2020-06-18 19:10:47 +03:00
Nikolai Kochetov
e0841360db Use QueryPlan in InterpreterSelectQuery [part 1]. 2020-06-18 16:38:45 +03:00
Nikolai Kochetov
5bb2ddc6f9 Add DISTINCT trait for QueryPlanStep. 2020-06-18 16:00:16 +03:00
Nikolai Kochetov
9ad00187e3 Update InterpreterSelectQuery.. 2020-06-17 23:19:55 +03:00
Nikolai Kochetov
169ad5e805 Add FinishSortingStep. 2020-06-17 22:57:13 +03:00
Nikolai Kochetov
e8049d34c5 Add OffsetsStep. 2020-06-17 22:38:45 +03:00
Nikolai Kochetov
69ff3a8f7d Add ExtremesStep. 2020-06-17 22:30:36 +03:00
Nikolai Kochetov
d795b2b037 Add FillingStep 2020-06-17 22:20:30 +03:00
Nikolai Kochetov
84fd7fe51c Update having. 2020-06-17 22:02:46 +03:00
Nikolai Kochetov
2cffa56914 Add RollupStep and CubeStep. 2020-06-17 20:20:09 +03:00
Nikolai Kochetov
d41db3622f Add RollupStep and CubeStep. 2020-06-17 20:15:24 +03:00
alesapin
51a47560e0 Fix segmentation fault 2020-06-17 19:59:26 +03:00
Nikolai Kochetov
c5d9379df0 Add TotalsHavingStep. 2020-06-17 19:54:51 +03:00
alesapin
dffdece350 getColumns in StorageInMemoryMetadta (only compilable) 2020-06-17 19:39:58 +03:00
Nikolai Kochetov
1ad4f2c0fd Add CreatingSets step. 2020-06-17 18:50:21 +03:00
Nikolai Kochetov
f279df6015 Added AggregatingStep. 2020-06-17 17:21:48 +03:00
alesapin
31abbe5dbd Select query in metadata 2020-06-17 17:06:22 +03:00
Nikolai Kochetov
582ea24469 Added AddingDelayedStreamStep 2020-06-17 16:38:17 +03:00
alesapin
1afdebeebd Primary key in storage metadata 2020-06-17 15:39:20 +03:00
alesapin
eca6e9087e Fix race condition 2020-06-17 14:52:19 +03:00
alesapin
ba04d02f1e Compilable sorting key in metadata 2020-06-17 14:05:11 +03:00
alesapin
62f2c17a66 Secondary indices in StorageInMemoryMetadata 2020-06-17 12:38:47 +03:00
Nikolai Kochetov
fdacc9f90e Add MergingAggregatedStep. 2020-06-16 21:57:21 +03:00
Nikolai Kochetov
148257abb5 Add LimitStep. 2020-06-16 21:23:01 +03:00
Nikolai Kochetov
5bb1d5cc43 Add LimitByStep. 2020-06-16 19:49:49 +03:00
Nikolai Kochetov
603bcdde2d Add DistinctStep. 2020-06-16 19:13:07 +03:00
Nikolai Kochetov
08e9ce8661 Add MergingSortedStep. 2020-06-16 18:47:40 +03:00
Nikolai Kochetov
f563193967 Add MergeSortingStep. 2020-06-16 18:08:01 +03:00
Nikolai Kochetov
815ac03892 Add PartialSortingStep. 2020-06-16 17:45:52 +03:00
alesapin
71f99a274d Compileable getSampleBlockWithColumns in StorageInMemoryMetadata 2020-06-16 17:25:08 +03:00
Nikolai Kochetov
4407bd7daa Added source steps. 2020-06-16 17:11:19 +03:00
Nikolai Kochetov
be8029e54f Expression and Filter steps. 2020-06-16 15:02:10 +03:00
alesapin
eae901f58d Merge branch 'master' into atomic_metadata5 2020-06-16 14:44:45 +03:00
Nikolai Kochetov
1bb72205bd Expression and Filter steps. 2020-06-16 12:42:25 +03:00
Anton Popov
eabbabed04 fix 'LIMIT WITH TIES' with aliases 2020-06-16 01:24:00 +03:00
Nikolai Kochetov
03e0a5ddbc Merge branch 'master' into query-plan 2020-06-15 23:38:35 +03:00
alesapin
36ba0192df Metadata in read and write methods of IStorage 2020-06-15 22:08:58 +03:00
Nikolai Kochetov
b725df6366 Added ReadFromStorageStep. 2020-06-15 18:17:44 +03:00
Artem Zuikov
24059efad5
Change push down logic in VIEW (#11513) 2020-06-15 15:36:10 +03:00
alexey-milovidov
fd6115cccf
Merge pull request #11648 from ClickHouse/simplify-code-functions-comparison-3
Allow comparison with String in index analysis; simplify code.
2020-06-15 05:47:03 +03:00
Alexey Milovidov
8dac30ae95 Split file for better build times 2020-06-14 21:42:10 +03:00
Alexey Milovidov
2cd82a25f5 Remove trivial count query optimization if row-level security is set #11352 2020-06-13 23:59:20 +03:00
Alexey Milovidov
f2cfb7d132 Whitespaces 2020-06-08 00:05:36 +03:00
Anton Popov
5c42408add
Merge pull request #9113 from dimarub2000/group_by_in_order_optimization
[WIP] Optimization of GROUP BY with respect to table sorting key.
2020-06-06 14:25:59 +03:00
Artem Zuikov
9717f3cfe5 make clang tidy happy 2020-06-05 13:27:54 +03:00
Artem Zuikov
6d211bec19 fix trivial count with join 2020-06-05 01:01:40 +03:00
Nikolai Kochetov
53d12f5ab8 Try fix build. 2020-06-01 20:06:21 +03:00
Nikolai Kochetov
325393d340 Try fix build. 2020-06-01 19:58:36 +03:00
Nikolai Kochetov
3a0acb00c1 Remove some streams. 2020-06-01 19:31:06 +03:00
alexey-milovidov
9984e989c1
Merge pull request #11317 from ClickHouse/logger-fmt-2
Preparation for structured logging
2020-06-01 00:11:29 +03:00
alexey-milovidov
97d9f218af
Merge pull request #11296 from ClickHouse/fix-false-too-slow
Fix bug with Throttler and query speed estimation
2020-05-31 14:18:29 +03:00
Alexey Milovidov
25f941020b Remove namespace pollution 2020-05-31 00:57:37 +03:00
Dmitry
4b0d32f026 Merge branch 'master' of github.com:yandex/ClickHouse into group_by_in_order_optimization 2020-05-31 00:21:02 +03:00
Alexey Milovidov
e4a329a47f Checking max execution speed on all nodes #5732 2020-05-29 23:37:37 +03:00
Nikolai Kochetov
f7e0e87c7d Try fix tests. 2020-05-29 19:35:09 +03:00
Nikolai Kochetov
bd814d83de Try fix tests. 2020-05-29 16:04:03 +03:00
Nikolai Kochetov
bbb57b7789 Try fix tests. 2020-05-29 13:31:19 +03:00
Nikolai Kochetov
534e55f5b5 try fix tests. 2020-05-28 19:01:12 +03:00
Nikolai Kochetov
557e531ff3 Fix tests. 2020-05-28 15:06:31 +03:00
Nikolai Kochetov
1fa795988f Remove some code. 2020-05-28 11:24:59 +03:00
Dmitry
38c585f867 Merge branch 'master' of github.com:yandex/ClickHouse into group_by_in_order_optimization 2020-05-26 21:27:50 +03:00
Nikolai Kochetov
c83014bc76
Merge pull request #11114 from azat/max_threads-simple-query-optimization-fix
Do not reserve extra threads after max threads optimization for simple queries
2020-05-25 14:05:46 +03:00
Alexey Milovidov
7e1813825b Return old names of macros 2020-05-24 01:24:01 +03:00
Alexey Milovidov
d6bc4d2bd0 find {base,src,programs} -name '*.h' -or -name '*.cpp' | xargs grep -l -P 'LOG_\w+\([^,]+, [^<]+ << "[^"]+" << [^<]+\);' | xargs sed -i -r -e 's/(LOG_\w+)\(([^,]+), ([^<]+) << "([^"]+)" << ([^<]+)\);/\1_FORMATTED(\2, "{}\4{}", \3, \5);/' 2020-05-23 20:14:57 +03:00
Azat Khuzhin
d4804e070f Do not reserve extra threads after max threads optimization for simple queries
Before this patch pipeline.setMaxThreads() was called with 0 (since
streams.size() == 0, while pipes.size() == 1), which leads that
pipeline.getMaxThreads() returns max_streams.

While this threads were not used, they were reserved, which should not
happens either.
2020-05-23 13:19:09 +03:00
Dmitry
47778c0259 Merge branch 'master' of github.com:yandex/ClickHouse into group_by_in_order_optimization 2020-05-21 23:45:12 +03:00
Dmitry
ee22a3ad30 Adding Info in FinalizingSimpleTransform 2020-05-21 23:34:53 +03:00
Artem Zuikov
9b38324028
fix JOIN with PREWHERE with second analyze() (#11051) 2020-05-20 12:29:23 +03:00
alexey-milovidov
f7b1263d41
Merge pull request #10268 from ClickHouse/max-rows-to-sort
Added failing tests about "max_rows_to_sort" setting.
2020-05-17 22:21:30 +03:00
alexey-milovidov
6c49967395
Merge pull request #10802 from PerformanceVision/offset_without_limit
Add the ability to set an OFFSET clause without a LIMIT
2020-05-17 12:06:22 +03:00
Alexey Milovidov
f2d438b79f Implement max_rows_to_sort again 2020-05-17 10:27:55 +03:00
Vitaly Baranov
d92ac9ea47
Merge branch 'master' into RBAC-10 2020-05-15 14:12:43 +03:00
Ri
fc7afaa639
Bitonic sort on GPU (OpenCL) (#10232) 2020-05-15 03:01:14 +03:00
Dmitry
bef37517b8 fixes after reivew 2 2020-05-14 17:20:49 +03:00
Dmitry
3221c7de85 fixes previous changes 2 2020-05-14 16:56:17 +03:00
Vitaly Baranov
5b84121d81 Improve system tables for quotas. Remove function currentQuota(). 2020-05-13 19:40:48 +03:00
Dmitry
84be0fe847 fixes previous changes 2020-05-13 17:21:22 +03:00
Dmitry
bbe0245b9d changes after review #1 2020-05-13 16:49:10 +03:00
Dmitry
c4295d97f9 Merge branch 'master' of github.com:yandex/ClickHouse into group_by_in_order_optimization 2020-05-12 17:57:32 +03:00
Dmitry
3ec80b5531 now aggregates in parallel 2020-05-12 17:50:13 +03:00
alexey-milovidov
15e38c8b44
Merge pull request #10563 from azat/SELECT-ALIAS-CAST
Fix SELECT of column ALIAS which default expression type different from column type
2020-05-11 20:20:34 +03:00
Guillaume Tassery
b0df037f25 Fix when we set a LIMIT to 0 2020-05-11 19:18:07 +07:00
Guillaume Tassery
c25ffb5930 Fix with a limit of 0 for system.numbers 2020-05-11 19:09:02 +07:00
Guillaume Tassery
5b6e9f2a7e offset_without_limitMerge branch 'master' of github.com:ClickHouse/ClickHouse into offset_without_limit 2020-05-11 16:05:44 +07:00
Guillaume Tassery
a64d44799d newline 2020-05-11 14:13:33 +07:00
Guillaume Tassery
f67cd0d3f2 remove new line 2020-05-11 13:46:16 +07:00
Guillaume Tassery
c7dfe9e93d cosmetic 2020-05-11 11:47:31 +07:00
Guillaume Tassery
654d6fcd26 cosmetic 2020-05-11 11:35:21 +07:00
Guillaume Tassery
802ae8948a cosmetic 2020-05-11 10:36:52 +07:00
Guillaume Tassery
f6fe196440 first draft fixed 2020-05-11 09:44:43 +07:00
Nikolai Kochetov
66111a6744 Try nut update scalars for only_analyze. 2020-05-09 21:43:16 +03:00
Dmitry
0286b60ed6 return multiple blocks 2020-05-08 22:46:52 +03:00
Dmitry
e7b747b0b7 limit fixes + func(primary_key) group by works 2020-05-08 16:13:50 +03:00
Guillaume Tassery
2e719314a3 push functional draft 2020-05-07 20:40:50 +07:00
Nikolai Kochetov
c72b0a45bc Do not filter totals. 2020-05-06 14:30:17 +03:00
Guillaume Tassery
8ce606571e write first draft of offset clause 2020-05-06 13:50:55 +07:00
Dmitry
e98c23a8cf read in order enabled 2020-05-05 17:35:23 +03:00
Azat Khuzhin
53aea23de0 Fix SELECT of column ALIAS which default expression type different from column type 2020-04-28 22:43:10 +03:00
Artem Zuikov
e34a9457e6
Improve right table insert for Partial MergeJoin on disk (#10467) 2020-04-28 16:55:50 +03:00
Dmitry
d9ac0e8fef Merge branch 'master' of github.com:yandex/ClickHouse into group_by_in_order_optimization 2020-04-26 14:44:06 +03:00
Alexey Milovidov
73dd78bac1 Merge branch 'master' into clang-tidy-8 2020-04-22 21:04:47 +03:00
Alexey Milovidov
0a7edce036 Checkpoint 2020-04-22 09:01:33 +03:00
Alexey Milovidov
e6acb35d95 Added a fix from Vadim Plakhtinskiy @VadimPlh 2020-04-22 07:32:05 +03:00
alexey-milovidov
fcf5c3c6e1
Merge pull request #10399 from ClickHouse/fix-distributed_group_by_no_merge-segfault
Fix segfault with distributed_group_by_no_merge
2020-04-22 01:28:02 +03:00
Nikolai Kochetov
1d2388de39 Drop currently calculated totals and extremes before aggregation. 2020-04-21 11:45:28 +03:00
Alexey Milovidov
d7264b292d Merge branch 'master' into sorting-processors 2020-04-20 09:29:41 +03:00
Dmitry
3a18982e2b removed debug cerr 2020-04-18 13:04:49 +03:00
Dmitry
25cd4ce132 Merge branch 'master' of github.com:yandex/ClickHouse into group_by_in_order_optimization 2020-04-18 12:56:34 +03:00
Alexey Milovidov
b16ee538ab Merge branch 'master' into sorting-processors 2020-04-17 05:31:55 +03:00
Artem Zuikov
47cb670213 Merge branch 'master' into joins 2020-04-16 19:06:13 +03:00
Alexey Milovidov
cdeda4ab91 Fix usage of max_parser_depth setting; remove harmful default function arguments 2020-04-16 04:06:10 +03:00
Nikolai Kochetov
153f795ebe Merge branch 'master' into sorting-processors 2020-04-15 12:07:05 +03:00
Alexey Milovidov
8ad04d4fec Remove useless code 2020-04-15 00:05:45 +03:00
Artem Zuikov
6106b2bfc7 Merge branch 'master' into joins 2020-04-13 20:37:27 +03:00
alexey-milovidov
0782052768
Merge pull request #10131 from ClickHouse/fix-extremes-for-processors
Fix extremes for processors
2020-04-12 02:31:39 +03:00
Nikolai Kochetov
1b6d80bfae Merge branch 'master' into sorting-processors 2020-04-10 12:07:21 +03:00
Nikolai Kochetov
d3acce8afa Fix build. 2020-04-10 12:04:28 +03:00
Alexander Kazakov
497df3086f Merge branch 'master' into timed_rwlock
Change-Id: I620bfde2121ff013773b001d514b40b1e796a58b
2020-04-10 11:38:20 +03:00
Alexander Kazakov
26dd6140b2 Added new config settings to control timeouts
* "lock_acquire_timeout" controls for how long a query will continue to
acquire each lock on its argument tables
 * "lock_acquire_timeout_for_background_operations" is a per-table
setting for storages of *MergeTree family
2020-04-09 21:10:27 +03:00
Artem Zuikov
9f0db4dfef Merge branch 'master' into joins 2020-04-09 11:53:18 +03:00
Artem Zuikov
8d6f641796 join over dictionary concept 2020-04-08 21:59:52 +03:00
Nikolai Kochetov
458c7f516d Fix extremes for Processors. 2020-04-08 21:50:14 +03:00
alexey-milovidov
09a397a68f
Merge pull request #9997 from azat/dist-on-dist-fixes
Fix Distributed-over-Distributed with one only shard in nested table
2020-04-08 03:45:29 +03:00
Artem Zuikov
8297683972
renames: AnalyzedJoin -> TableJoin, Join -> HashJoin (#10065) 2020-04-07 12:48:47 +03:00
Artem Zuikov
174136a1e0
inflating cross join (#10029) 2020-04-06 16:39:57 +03:00
Azat Khuzhin
1232760f78 Fix Distributed-over-Distributed when nested table has only one shard 2020-04-04 13:47:35 +03:00
Ivan Lezhankin
06446b4f08 dbms/ → src/ 2020-04-03 18:14:31 +03:00