ClickHouse

mirror of https://github.com/ClickHouse/ClickHouse.git synced 2024-12-14 18:32:29 +00:00

Author	SHA1	Message	Date
Antonio Andelic	b11f744252	Correctly disable async insert with deduplication when it's not needed (#50663 ) * Correctly disable async insert when it's not used * Better * Add comment * Better * Fix tests --------- Co-authored-by: Nikita Mikhaylov <mikhaylovnikitka@gmail.com>	2023-06-07 20:33:08 +02:00
Azat Khuzhin	79b83c4fd2	Remove superfluous includes of logger_userful.h from headers Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2023-04-10 17:59:30 +02:00
Azat Khuzhin	e10fb142fd	Fix race for distributed sends from disk Before it was initialized from disk only on startup, but if some INSERT can create the object before, then, it will lead to the situation when it will not be initialized. Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2023-02-28 22:33:36 +01:00
Azat Khuzhin	b5434eac3b	Rename StorageDistributedDirectoryMonitor to DistributedAsyncInsertDirectoryQueue Since #44922 it is not a directory monitor anymore. v2: Remove unused error codes v3: Contains some header fixes due to conflicts with master Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2023-02-28 22:33:36 +01:00
Azat Khuzhin	1c4659b8e7	Separate out Batch as DistributedAsyncInsertBatch (and also some helpers) Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2023-02-28 22:33:36 +01:00
Azat Khuzhin	3f892e52ab	Revert "Revert "Merge pull request #44922 from azat/dist/async-INSERT-metrics"" This is the revert of revert since there will be follow up patches to address the issues. This reverts commit `a55798626a`.	2023-02-28 22:33:36 +01:00
Azat Khuzhin	51019bc9f3	Fix a race between Distributed table creation and INSERT into it Initializing queues for pending on-disk files for async INSERT cannot be done after table had been attached and visible to user, since it initializes the per-table counter, that is used during INSERT. Now there is a window, when this counter is not initialized and it will start from the beginning, and this could lead to CANNOT_LINK error: Destination file /data/clickhouse/data/urls_v1/urls_in/shard6_replica1/13129817.bin is already exist and have different inode Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2023-01-23 09:55:43 +01:00
Azat Khuzhin	a55798626a	Revert "Merge pull request #44922 from azat/dist/async-INSERT-metrics" There are the following problems with this patch: - Looses files on exception - Existing current_batch.txt on startup leads to ENOENT error and hung of distributed sends without ATTACH/DETACH - Race between creating the queue for sending at table startup and INSERT, if it had been created from INSERT, then it will not be initialized from disk They were addressed in #45491, but it makes code more cmoplex and plus since, likely, the release is comming, it is better to revert the change. This reverts commit `94604f71b7`, reversing changes made to `80f6a45376`.	2023-01-21 22:42:00 +01:00
Nikita Mikhaylov	857799fbca	Parallel distributed insert select with s3Cluster [3] (#44955 ) * Revert "Revert "Resurrect parallel distributed insert select with s3Cluster (#41535)"" This reverts commit `b8d9066004`. * Fix build * Better * Fix test * Automatic style fix Co-authored-by: robot-clickhouse <robot-clickhouse@users.noreply.github.com>	2023-01-09 13:30:32 +01:00
Azat Khuzhin	4e76629aaf	Fixes for -Wshorten-64-to-32 - lots of static_cast - add safe_cast - types adjustments - config - IStorage::read/watch - ... - some TODO's (to convert types in future) P.S. That was quite a journey... v2: fixes after rebase v3: fix conflicts after #42308 merged Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-10-21 13:25:19 +02:00
Alexander Tokmakov	b8d9066004	Revert "Resurrect parallel distributed insert select with s3Cluster (#41535 )" This reverts commit `860e34e760`.	2022-10-07 15:53:30 +02:00
Nikita Mikhaylov	860e34e760	Resurrect parallel distributed insert select with s3Cluster (#41535 )	2022-10-06 13:47:32 +02:00
Alexander Tokmakov	f9f85a0e8b	Revert "Parallel distributed insert select from *Cluster table functions (#39107 )" This reverts commit `d3cc234986`.	2022-08-24 15:17:15 +03:00
Nikita Mikhaylov	d3cc234986	Parallel distributed insert select from *Cluster table functions (#39107 )	2022-08-15 12:41:17 +02:00
Nikolai Kochetov	c71256ea38	Remove some commented code.	2022-05-30 13:18:20 +00:00
Nikolai Kochetov	fd97a9d885	Move some resources	2022-05-23 19:47:32 +00:00
Nikolai Kochetov	56feef01e7	Move some resources	2022-05-20 19:49:31 +00:00
Robert Schulze	777b5bc15b	Don't let storages inherit from boost::noncopyable ... IStorage has deleted copy ctor / assignment already	2022-05-03 09:07:08 +02:00
Robert Schulze	330212e0f4	Remove inherited create() method + disallow copying The original motivation for this commit was that shared_ptr_helper used std::shared_ptr<>() which does two heap allocations instead of make_shared<>() which does a single allocation. Turned out that 1. the affected code (--> Storages/) is not on a hot path (rendering the performance argument moot ...) 2. yet copying Storage objects is potentially dangerous and was previously allowed. Hence, this change - removes shared_ptr_helper and as a result all inherited create() methods, - instead, Storage objects are now created using make_shared<>() by the caller (for that to work, many constructors had to be made public), and - all Storage classes were marked as noncopyable using boost::noncopyable. In sum, we are (likely) not making things faster but the code becomes cleaner and harder to misuse.	2022-05-02 08:46:52 +02:00
Amos Bird	4a5e4274f0	base should not depend on Common	2022-04-29 10:26:35 +08:00
Alexey Milovidov	242919eddd	Remove abbreviation	2022-04-18 01:02:49 +02:00
Alexander Tokmakov	da00beaf7f	Merge branch 'master' into mvcc_prototype	2022-04-05 11:14:42 +02:00
Alexey Milovidov	4d6c030d23	Revert "clang-tidy report issues with Medium priority"	2022-04-04 23:41:42 +03:00
Alexander Tokmakov	287d858fda	Merge branch 'master' into mvcc_prototype	2022-03-29 16:24:12 +02:00
Maksim Kita	a1a4552740	Merge pull request #35184 from DevTeamBK/clang-tidy-issues clang-tidy report issues with Medium priority	2022-03-29 13:19:54 +02:00
Alexander Tokmakov	07d952b728	use snapshots for semistructured data, durability fixes	2022-03-17 18:26:18 +01:00
Anton Popov	063917786e	minor fixes	2022-03-14 17:29:18 +00:00
Anton Popov	36ec379aeb	Merge remote-tracking branch 'upstream/master' into HEAD	2022-03-14 16:28:35 +00:00
Rajkumar	3d3b6d1956	clang-tidy report issues with Medium priority	2022-03-10 07:23:49 -08:00
Azat Khuzhin	c4b6342853	Improvements for `parallel_distributed_insert_select` (and related) (#34728 ) * Add a warning if parallel_distributed_insert_select was ignored Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Respect max_distributed_depth for parallel_distributed_insert_select Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Print warning for non applied parallel_distributed_insert_select only for initial query Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Remove Cluster::getHashOfAddresses() Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Forbid parallel_distributed_insert_select for remote()/cluster() with different addresses Before it uses empty cluster name (getClusterName()) which is not correct, compare all addresses instead. Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Fix max_distributed_depth check max_distributed_depth=1 must mean not more then one distributed query, not two, since max_distributed_depth=0 means no limit, and distribute_depth is 0 for the first query. Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Fix INSERT INTO remote()/cluster() with parallel_distributed_insert_select Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Add a test for parallel_distributed_insert_select with cluster()/remote() Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Return <remote> instead of empty cluster name in Distributed engine Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Make user with sharding_key and w/o in remote()/cluster() identical Before with sharding_key the user was "default", while w/o it it was empty. Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-03-08 15:24:39 +01:00
Anton Popov	2758db5341	add more comments	2022-03-01 19:32:55 +03:00
Anton Popov	a661eaf39f	better performance of getting storage snapshot	2022-02-16 02:17:22 +03:00
Anton Popov	e8ce091e68	Merge remote-tracking branch 'upstream/master' into HEAD	2022-01-21 20:11:18 +03:00
Anton Popov	7c6f7f6732	support 'optimize_move_to_prewhere' with storage 'Merge'	2021-12-29 20:49:10 +03:00
Anton Popov	6f4d9a53b2	Merge remote-tracking branch 'origin/sparse-serialization' into HEAD	2021-12-01 15:54:33 +03:00
Raúl Marín	b2cfa70541	Reduce dependencies on ASTFunction.h 481 -> 230	2021-11-26 18:21:54 +01:00
Anton Popov	a20922b2d3	Merge remote-tracking branch 'origin/sparse-serialization' into HEAD	2021-11-09 15:36:25 +03:00
Alexander Tokmakov	2e7e195e77	change alter_lock to std::timed_mutex	2021-10-26 13:37:00 +03:00
Alexey Milovidov	fe6b7c77c7	Rename "common" to "base"	2021-10-02 10:13:14 +03:00
Nikolai Kochetov	b997214620	Rename QueryPipeline to QueryPipelineBuilder.	2021-09-14 20:48:18 +03:00
Anton Popov	4c388e3d84	Merge remote-tracking branch 'origin/sparse-serialization' into HEAD	2021-09-09 14:10:16 +03:00
Alexander Tokmakov	13466a7cc3	minor fix	2021-09-03 20:06:38 +03:00
Alexander Tokmakov	42378b5913	fix	2021-08-20 17:05:53 +03:00
Anton Popov	61239343e3	Merge remote-tracking branch 'origin/sparse-serialization' into HEAD	2021-08-20 16:33:30 +03:00
Alexey Milovidov	24cef99065	Merge branch 'master' into fix-bad-cast	2021-08-08 04:00:29 +03:00
alexey-milovidov	c5207fc237	Merge pull request #26466 from azat/optimize-dist-select Rework SELECT from Distributed optimizations	2021-08-08 03:59:32 +03:00
mergify[bot]	dc57254982	Merge branch 'master' into improve_create_or_replace	2021-08-03 11:39:07 +00:00
Anton Popov	e36736b50c	Merge remote-tracking branch 'origin/sparse-serialization' into HEAD	2021-08-02 22:52:02 +03:00
Azat Khuzhin	2fb95d9ee0	Rework SELECT from Distributed query stages optimization Before this patch it wasn't possible to optimize simple SELECT * FROM dist ORDER BY (w/o GROUP BY and DISTINCT) to more optimal stage (QueryProcessingStage::WithMergeableStateAfterAggregationAndLimit), since that code was under allow_nondeterministic_optimize_skip_unused_shards, rework it and make it possible. Also now distributed_push_down_limit is respected for optimize_distributed_group_by_sharding_key. Next step will be to enable distributed_push_down_limit by default. v2: fix detection of aggregates	2021-08-02 21:04:29 +03:00
Alexey Milovidov	edfeb0957f	Fix strange code	2021-07-24 04:52:18 +03:00

1 2 3

134 Commits