Nikita Taranov
30f2a942c5
Predict size of hash table for GROUP BY ( #33439 )
...
* use AggregationMethod ctor with reserve
* add new settings
* add HashTablesStatistics
* support queries with limit
* support distributed and with external aggregation
* add new profile events
* add some tests
* add perf test
* export cache stats through AsynchronousMetrics
* rm redundant trace
* fix style
* fix 02122_parallel_formatting test
* review fixes
* fix 02122_parallel_formatting test
* apply also to two-level HTs
* try simpler strategy
* increase max_size_to_preallocate_for_aggregation for experiment
* fixes
* Revert "increase max_size_to_preallocate_for_aggregation for experiment"
This reverts commit 6cf6f75704
.
* fix test
Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
2022-03-30 22:47:51 +02:00
Nikolai Kochetov
283e20a9a5
Merge pull request #35395 from amosbird/distributedmultiplejoin
...
Validate some thoughts over making sets
2022-03-24 10:30:26 +01:00
Amos Bird
ab7923a26c
Remove comments
2022-03-23 23:21:02 +08:00
Anton Popov
4ff9627f60
fix crash with enabled optimize_functions_to_subcolumns
2022-03-23 01:27:52 +00:00
mergify[bot]
e5a5ab2a40
Merge branch 'master' into distributedmultiplejoin
2022-03-21 10:00:51 +00:00
Amos Bird
243de091bb
Validate some thoughts over making sets
2022-03-21 10:58:44 +08:00
Anton Popov
36ec379aeb
Merge remote-tracking branch 'upstream/master' into HEAD
2022-03-14 16:28:35 +00:00
Amos Bird
fe4534d464
Get rid of duplicate query planing.
2022-03-08 00:02:58 +08:00
Anton Popov
df3b07fe7c
Merge remote-tracking branch 'upstream/master' into HEAD
2022-03-03 22:25:28 +00:00
Frank Chen
b4829465d9
Improve the opentelemetry span logs for INSERT on distributed table ( #34480 )
2022-03-03 12:53:29 +01:00
Maksim Kita
b1a956c5f1
clang-tidy check performance-move-const-arg fix
2022-03-02 18:15:27 +00:00
Anton Popov
fcdebea925
Merge remote-tracking branch 'upstream/master' into HEAD
2022-02-25 13:41:30 +03:00
Dmitry Novik
4428e7aa1b
Merge branch 'master' into nv/move-part-count
2022-02-21 02:14:23 -08:00
Azat Khuzhin
774744a86d
Fix allow_experimental_projection_optimization with enable_global_with_statement
...
allow_experimental_projection_optimization requires one more
InterpreterSelectQuery, which with enable_global_with_statement will
apply ApplyWithAliasVisitor if the query is not subquery.
But this should not be done for queries from
MergeTreeData::getQueryProcessingStage()/getQueryProcessingStageWithAggregateProjections()
since this will duplicate WITH statements over and over.
This will also fix scalar.xml perf tests, that leads to the following
error now:
scalar.query0.prewarm0: DB::Exception: Stack size too large.
And since it has very long query in the log, this leads to the following
perf test error:
_csv.Error: field larger than field limit (131072)
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-02-16 19:14:47 +03:00
Anton Popov
a661eaf39f
better performance of getting storage snapshot
2022-02-16 02:17:22 +03:00
Anton Popov
18940b8637
Merge remote-tracking branch 'upstream/master' into HEAD
2022-02-09 23:38:38 +03:00
Nikolai Kochetov
38fb50f736
Merge pull request #33958 from Algunenano/mv_cacheable_scalars
...
Scalar cache improvements
2022-02-09 16:46:53 +03:00
Nicolae Vartolomei
1cdb50cf13
Disable optimize_trivial_count when deduplication for part movement feature is enabled
...
Fixes #34089
2022-02-07 18:26:49 +00:00
mergify[bot]
dd947f964c
Merge branch 'master' into mv_cacheable_scalars
2022-02-07 10:07:26 +00:00
Amos Bird
a6f0b01e6a
Fix order by after aggregation
2022-02-07 00:42:11 +08:00
Amos Bird
1ab773cc90
Fix aggregation_in_order with normal projection
2022-02-06 16:46:12 +08:00
Anton Popov
9b844c6b42
Merge pull request #32748 from CurtizJ/read-in-order-fixed-prefix
...
Support `optimize_read_in_order` if prefix of sorting key is already sorted
2022-02-03 18:17:08 +03:00
Anton Popov
836a348a9c
Merge remote-tracking branch 'upstream/master' into HEAD
2022-02-01 15:23:07 +03:00
Maksim Kita
5ef83deaa6
Update sort to pdqsort
2022-01-30 19:49:48 +00:00
Anton Popov
fa883f2ac3
fix tests
2022-01-28 17:23:51 +03:00
Raúl Marín
6c79e147d0
Add a few extra comments around uses_view_source and why it's propagated
2022-01-28 14:47:49 +01:00
Raúl Marín
045c92e2b9
Fix readability issues
2022-01-26 17:36:45 +01:00
Raúl Marín
444a8e2519
Minimize changes, improve scalar subquery for MVs
2022-01-26 17:36:45 +01:00
Raúl Marín
4b5ab80e3b
Better scalar cache handling
...
- Fixes global CTE scalar cache.
- Adds MVs back (views dependent on the source are cached locally and others globally
2022-01-26 17:36:45 +01:00
Anton Popov
e8ce091e68
Merge remote-tracking branch 'upstream/master' into HEAD
2022-01-21 20:11:18 +03:00
alexey-milovidov
6c71a7c40f
Merge pull request #33518 from hexiaoting/dev-limit-offset
...
Fix bug for select from VIEW with format and limit settings
2022-01-12 23:39:31 +03:00
hexiaoting
5c847264f7
Fix bug for select from VIEW with format and limit settings
2022-01-11 20:19:41 +08:00
Anton Popov
7c6f7f6732
support 'optimize_move_to_prewhere' with storage 'Merge'
2021-12-29 20:49:10 +03:00
alexey-milovidov
0a55fa3dc2
Revert "Grouping sets dev"
2021-12-25 20:30:31 +03:00
alexey-milovidov
6b97af4c63
Merge pull request #26869 from taylor12805/grouping-sets-dev
...
Grouping sets dev
2021-12-17 20:50:15 +03:00
Anton Popov
99ebabd822
Merge remote-tracking branch 'upstream/master' into HEAD
2021-12-17 19:02:29 +03:00
Vladimir C
2f2d98d8be
Merge pull request #32811 from Algunenano/materialized_view_scalar_subqueries
2021-12-16 13:35:22 +03:00
Raúl Marín
c61277fbac
Move the logic to the saving part instead
...
This way we can reuse scalars from other queries (non MVs) and
leave the door open for future improvements
2021-12-15 17:47:17 +01:00
Dmitry Novik
56a3f4a000
Cleanup code
2021-12-14 22:15:14 +03:00
Anton Popov
96070b414d
support read_in_order optimization if prefix of sorting key is fixed
2021-12-14 15:54:20 +03:00
cmsxbc
e4b405716f
fix multiply row policies on same column
2021-12-12 09:54:07 +00:00
fanzhou
b94f8878ff
1. refactor ExpressionAnalyzer
...
2. Update formatAST to support grouping sets query with distributed table
3. modify astExpression to support function in grouping sets
2021-11-30 14:58:55 +03:00
fanzhou
e725630fbf
refactor
2021-11-29 19:35:33 +03:00
fanzhou
43db4594ba
some changes
2021-11-29 19:35:33 +03:00
fanzhou
534160b0d4
grouing sets transformer instead of fork processor in aggregating transformer
2021-11-29 18:59:15 +03:00
fanzhou
2912344896
grouping sets support multiple group
2021-11-29 18:32:55 +03:00
MaxTheHuman
b39f5bd45f
grouping-sets: rearrange result columns so that resize is possible after grouping sets
2021-11-26 23:25:25 +03:00
MaxTheHuman
9953e1bc0a
grouping sets: add tests, fix bug
2021-11-26 23:09:23 +03:00
MaxTheHuman
2640f51843
grouping sets: fix
2021-11-26 23:05:07 +03:00
MaxTheHuman
4f07ec2659
grouping sets: make simple aggregation with grouping sets to work
2021-11-26 22:59:56 +03:00