ClickHouse

mirror of https://github.com/ClickHouse/ClickHouse.git synced 2024-12-15 02:41:59 +00:00

Author	SHA1	Message	Date
Anton Popov	d8b6f15ef4	Merge pull request #23027 from azat/distributed-push-down-limit Add ability to push down LIMIT for distributed queries	2021-06-20 23:08:50 +03:00
Maksim Kita	67e9b85951	Merge ext into common	2021-06-16 23:28:41 +03:00
alexey-milovidov	34d12063f8	Merge pull request #23349 from azat/dist-respect-insert_allow_materialized_columns Respect insert_allow_materialized_columns for INSERT into Distributed()	2021-06-14 07:23:00 +03:00
Nikita Mikhaylov	82b8d45cd7	Merge pull request #23518 from nikitamikhaylov/copier-stuck Bugfixes and improvements of `clickhouse-copier`	2021-06-09 11:36:42 +03:00
Azat Khuzhin	18e8f0eb5e	Add ability to push down LIMIT for distributed queries This way the remote nodes will not need to send all the rows, so this will decrease network io and also this will make queries w/ optimize_aggregation_in_order=1/LIMIT X and w/o ORDER BY faster since it initiator will not need to read all the rows, only first X (but note that for this you need to your data to be sharded correctly or you may get inaccurate results). Note, that having lots of processing stages will increase the complexity of interpreter (it is already not that clean and simple right now). Although using separate QueryProcessingStage looks pretty natural. Another option is to make WithMergeableStateAfterAggregation always, but in this case you will not be able to disable only this optimization, i.e. if there will be some issue with it. v2: fix OFFSET v3: convert 01814_distributed_push_down_limit test to .sh and add retries v4: add test with OFFSET v5: add new query stage into the bash completion v6/tests: use LIMIT O,L syntax over LIMIT L OFFSET O since it is broken in ANTLR parser https://clickhouse-test-reports.s3.yandex.net/23027/a18a06399b7aeacba7c50b5d1e981ada5df19745/functional_stateless_tests_(antlr_debug).html#fail1 v7/tests: set use_hedged_requests to 0, to avoid excessive log entries on retries https://clickhouse-test-reports.s3.yandex.net/23027/a18a06399b7aeacba7c50b5d1e981ada5df19745/functional_stateless_tests_flaky_check_(address).html#fail1	2021-06-09 02:29:50 +03:00
Amos Bird	78fca8f8fa	Fix possible race condition when getting cluster	2021-06-04 21:09:59 +08:00
Nikita Mikhaylov	312bb96eeb	Merge branch 'master' of github.com:ClickHouse/ClickHouse into copier-stuck	2021-06-02 01:04:47 +03:00
Nikita Mikhaylov	6d19dea761	better	2021-05-31 17:38:20 +03:00
Nikita Mikhaylov	90ab394769	better	2021-05-31 17:37:10 +03:00
kssenii	3dee003f9b	Merge branch 'master' of github.com:ClickHouse/ClickHouse into poco-file-to-std-fs	2021-05-20 19:20:09 +03:00
Azat Khuzhin	4d737a5481	Respect insert_allow_materialized_columns for INSERT into Distributed()	2021-05-20 07:40:46 +03:00
Alexander Kuzmenkov	e9b69bbd70	Merge pull request #23906 from azat/fix-distributed_group_by_no_merge distributed_group_by_no_merge fixes	2021-05-19 16:16:08 +03:00
Alexander Kuzmenkov	09cb467812	Update StorageDistributed.cpp	2021-05-19 16:14:33 +03:00
kssenii	9b8df78fdd	Merge branch 'master' of github.com:ClickHouse/ClickHouse into poco-file-to-std-fs	2021-05-17 17:42:05 +03:00
feng lv	c6f8ab9826	fix	2021-05-13 02:05:53 +00:00
kssenii	0527f0ea33	Merge branch 'master' of github.com:ClickHouse/ClickHouse into poco-file-to-std-fs	2021-05-12 16:54:18 +03:00
Amos Bird	cd6414639e	add metadata_snapshot to getQueryProcessingStage	2021-05-11 18:12:26 +08:00
Azat Khuzhin	eefd67fce5	Disable optimize_distributed_group_by_sharding_key with window functions	2021-05-06 00:44:22 +03:00
feng lv	39f68bf5ff	fix conflict	2021-05-02 16:33:45 +00:00
kssenii	ee06936596	Merge branch 'master' of github.com:ClickHouse/ClickHouse into poco-file-to-std-fs	2021-05-01 17:24:31 +03:00
feng lv	aed2f337e9	Fix CLEAR COLUMN does not work after #21303	2021-04-30 05:02:32 +00:00
kssenii	deb4903af8	Merge branch 'master' of github.com:ClickHouse/ClickHouse into poco-file-to-std-fs	2021-04-28 20:57:13 +03:00
kssenii	eeb71672a0	Change in Storages/*	2021-04-27 16:49:37 +03:00
feng lv	4ffe199d39	Implement table comments	2021-04-23 12:18:23 +00:00
Amos Bird	096d76627e	Skip unavaiable shards when writing to distributed tables	2021-04-21 10:30:40 +08:00
Maksim Kita	e361f5943f	Merge pull request #22999 from azat/no-optimize_skip_unused_shards-single-node Do not perform optimize_skip_unused_shards for cluster with one node	2021-04-15 14:36:56 +03:00
Nikita Mikhaylov	7a68820342	style	2021-04-13 22:39:42 +03:00
Nikita Mikhaylov	081ea84a41	save	2021-04-13 22:39:41 +03:00
tavplubix	1525e38a3c	Merge pull request #22990 from ClickHouse/tavplubix-patch-1 Fix excessive warning in StorageDistributed with cross-replication	2021-04-13 18:58:12 +03:00
Azat Khuzhin	a497d4d462	Do not perform optimize_skip_unused_shards for cluster with one node	2021-04-12 22:18:31 +03:00
tavplubix	a995962e6a	Update StorageDistributed.cpp	2021-04-12 14:58:24 +03:00
Azat Khuzhin	79bd8d4d3f	Respect optimize_skip_unused_shards_rewrite_in with optimize_skip_unused_shards_limit	2021-04-12 10:37:28 +03:00
Azat Khuzhin	e439914d38	Fix optimized cluster logic for optimize_skip_unused_shards	2021-04-12 10:37:28 +03:00
Azat Khuzhin	fbb386dca5	Rewrite IN in query for remote shards to exclude values that does not belongs to shard v2: fix optimize_skip_unused_shards_rewrite_in for sharding_key wrapped into function v3: fix column name for optimize_skip_unused_shards_rewrite_in v4: fix optimize_skip_unused_shards_rewrite_in with Null v5: - squash with Remove query argument for IStreamFactory::createForShard() - use proper column after function execution (using sharding_key_column_name) - update the test reference since (X) now is tuple(X)	2021-04-12 10:37:28 +03:00
Ivan	495c6e03aa	Replace all Context references with std::weak_ptr (#22297 ) * Replace all Context references with std::weak_ptr * Fix shared context captured by value * Fix build * Fix Context with named sessions * Fix copy context * Fix gcc build * Merge with master and fix build * Fix gcc-9 build	2021-04-11 02:33:54 +03:00
Nikolai Kochetov	6102652c99	Merge branch 'master' into better-filter-push-down	2021-04-06 13:38:03 +03:00
Maxim Akhmedov	725fa17961	Introduce IStorage::distributedWrite method for distributed INSERT SELECT.	2021-04-05 02:14:27 +03:00
Nikolai Kochetov	c3c393a7aa	Merge branch 'master' into refactor-actions-dag	2021-03-18 14:33:07 +03:00
Nikolai Kochetov	e8d7349c79	Merge branch 'master' into dist-query-zero-shards-fix	2021-03-16 12:00:08 +03:00
Azat Khuzhin	61d40c3600	Fix optimize_skip_unused_shards for zero shards case v2: move check to the beginning of the StorageDistributed::read()	2021-03-10 09:05:14 +03:00
Azat Khuzhin	3474ea044e	Avoid processing optimize_skip_unused_shards twice	2021-03-09 10:05:56 +03:00
Azat Khuzhin	ed09897eb1	Pass optimize_skip_unused_shards_limit to the bottom layer And now optimize_skip_unused_shards_limit=0 is not a special case anymore.	2021-03-08 10:05:56 +03:00
Azat Khuzhin	16f4c02d42	Add optimize_skip_unused_shards_limit Limit for number of sharding key values, turns off optimize_skip_unused_shards if the limit is reached	2021-03-26 06:09:00 +03:00
Nikolai Kochetov	a669f7d641	Merge branch 'master' into refactor-actions-dag	2021-03-05 18:21:14 +03:00
Nikolai Kochetov	9a39459888	Refactor ActionsDAG	2021-03-04 20:38:12 +03:00
Azat Khuzhin	6965ac26c3	Distributed: Add ability to delay/throttle INSERT until pending data will be reduced Add two new settings for the Distributed engine: - bytes_to_delay_insert - max_delay_to_insert If at the beginning of INSERT there will be too much pending data, more then bytes_to_delay_insert, then the INSERT will wait until it will be shrinked, and not more then max_delay_to_insert seconds. If after this there will be still too much pending, it will throw an exception. Also new profile events were added (by analogy to the MergeTree): - DistributedDelayedInserts (although you can use system.errors instead of this, but still) - DistributedRejectedInserts - DistributedDelayedInsertsMilliseconds	2021-03-03 23:30:23 +03:00
Azat Khuzhin	b43046ba06	Distributed: More accurate distribution_queue counters So now system.distribution_queue will show accurate statistics, so tests does not requires sleep anymore. But note that with too much distributed pending this will iterate over all directories.	2021-03-03 23:30:03 +03:00
Azat Khuzhin	b5a5778589	Distributed: Add ability to limit amount of pending bytes for async INSERT Right now with distributed_directory_monitor_batch_inserts=1 and insert_distributed_sync=0 INSERT into Distributed table will store blocks that should be sent to remote (and in case of prefer_localhost_replica=0 to the localhost too) on the local filesystem, and sent it in background. However there is no limit for this storage, and if the remote is unavailable (or some other error), these pending blocks may take significant space, and this is not always desired behaviour. Add new Distributed setting - bytes_to_throw_insert, that will set the limit for how much pending bytes is allowed, if the limit will be reached an exception will be throw. By default was set to 0, to avoid surprises.	2021-03-03 23:30:00 +03:00
Azat Khuzhin	ce09b7ff89	Distributed: Implement totalBytes() (system.tables.total_bytes)	2021-03-03 23:29:11 +03:00
Anton Popov	a4c00ab5dc	Merge pull request #21303 from ucasFL/forbid Forbid to drop a column if it's referenced by materialized view	2021-03-03 02:55:06 +03:00
feng lv	a26c9e64a9	fix fix	2021-03-02 03:20:03 +00:00
feng lv	51021c1164	forbid to drop a column if it's referenced by materialized view	2021-02-28 05:24:39 +00:00
Nikolai Kochetov	d328bfa41f	Review fixes. Add setting max_optimizations_to_apply.	2021-02-26 19:29:56 +03:00
Azat Khuzhin	809fa7e4cc	Sync SYSTEM FLUSH DISTRIBUTED with TRUNCATE	2021-02-10 23:10:37 +03:00
Azat Khuzhin	ce91c257b2	Lockless SYSTEM FLUSH DISTRIBUTED Right now SYSTEM FLUSH DISTRIBUTED will block: - INSERT into this Distributed table (requireDirectoryMonitor()) - SELECT * FROM system.distribution_queue	2021-02-08 22:07:30 +03:00
Kruglov Pavel	d94e8624d7	Merge branch 'master' into shard-id	2021-02-06 16:48:17 +03:00
Aleksei Semiglazov	921518db0a	CLICKHOUSE-606: query deduplication based on parts' UUID * add the query data deduplication excluding duplicated parts in MergeTree family engines. query deduplication is based on parts' UUID which should be enabled first with merge_tree setting assign_part_uuids=1 allow_experimental_query_deduplication setting is to enable part deduplication, default ot false. data part UUID is a mechanism of giving a data part a unique identifier. Having UUID and deduplication mechanism provides a potential of moving parts between shards preserving data consistency on a read path: duplicated UUIDs will cause root executor to retry query against on of the replica explicitly asking to exclude encountered duplicated fingerprints during a distributed query execution. NOTE: this implementation don't provide any knobs to lock part and hence its UUID. Any mutations/merge will update part's UUID. * add _part_uuid virtual column, allowing to use UUIDs in predicates. Signed-off-by: Aleksei Semiglazov <asemiglazov@cloudflare.com> address comments	2021-02-02 16:53:39 +00:00
feng lv	4279c7da41	add setting insert_shard_id add test fix style fix	2021-02-02 04:26:59 +00:00
kreuzerkrieg	29a2ef3089	Add IStoragePolicy interface	2021-01-26 10:55:28 +02:00
Azat Khuzhin	2e55bd2285	Accept IDisk in DirectoryMonitor (for further fsync)	2021-01-09 16:31:42 +03:00
Azat Khuzhin	b5ace27014	Add fsync support for Distributed engine. Two new settings (by analogy with MergeTree family) has been added: - `fsync_after_insert` - Do fsync for every inserted. Will decreases performance of inserts. - `fsync_tmp_directory` - Do fsync for temporary directory (that is used for async INSERT only) after all part operations (writes, renames, etc.). Refs: #17380 (p1)	2021-01-09 11:31:32 +03:00
Azat Khuzhin	714d5a067a	Expose supports_parallel_insert via system.table_engines	2021-01-08 14:57:24 +03:00
Alexey Milovidov	190402b7d5	Do not insert empty blocks on sync Distributed INSERT	2021-01-06 02:54:22 +03:00
Amos Bird	6fc225e676	Distributed insertion to one random shard (#18294 ) * Distributed insertion to one random shard * add some tests * add some documentation * Respect shards' weights * fine locking Co-authored-by: Ivan Lezhankin <ilezhankin@yandex-team.ru>	2020-12-23 19:04:05 +03:00
Azat Khuzhin	5365718f01	Fix optimize_distributed_group_by_sharding_key for query with OFFSET only (#16996 ) * Fix optimize_distributed_group_by_sharding_key for query with OFFSET only * Fix 01244_optimize_distributed_group_by_sharding_key flakiness	2020-12-02 20:11:39 +03:00
tavplubix	085359c110	Merge pull request #17274 from ClickHouse/fix_ast_formatting_in_logs Fix AST formatting in log messages	2020-11-24 19:00:56 +03:00
Alexander Tokmakov	60a5782c75	fix AST formatting in log messages	2020-11-22 20:23:12 +03:00
Amos Bird	1d9d586e20	Make global_context consistent.	2020-11-20 18:23:14 +08:00
Nikolai Kochetov	46f70dd0de	Merge branch 'master' into actions-dag-f14	2020-11-12 11:54:44 +03:00
tavplubix	058aa8f85e	Merge pull request #16824 from ClickHouse/replace_stringstreams_with_buffers Replace std::stringstreams with DB::Buffers	2020-11-12 01:11:44 +03:00
Nikolai Kochetov	1846bb3cac	Merge branch 'master' into actions-dag-f14	2020-11-11 13:08:57 +03:00
Nikolai Kochetov	1db8e77371	Add comments. Update ActionsDAG::Index	2020-11-10 17:54:59 +03:00
Nikolai Kochetov	195c941c4e	Merge branch 'master' into storage-read-query-plan	2020-11-10 15:02:22 +03:00
Alexander Tokmakov	5cdfcfb307	remove other stringstreams	2020-11-09 22:12:44 +03:00
Nikolai Kochetov	6717c7a0af	Merge branch 'master' into actions-dag-f14	2020-11-09 14:57:48 +03:00
alexey-milovidov	f4ba5f1f9a	Merge pull request #16772 from ClickHouse/fix-stringstream Fix "server failed to start" error	2020-11-08 14:27:08 +03:00
Alexey Milovidov	ba4ae00121	Whitespace	2020-11-08 00:30:40 +03:00
Alexey Milovidov	1ea3afadbc	Merge with master	2020-11-08 00:28:39 +03:00
Alexey Milovidov	5314185e25	Merge branch 'master' into azat-optimize_skip_unused_shards-optimization	2020-11-08 00:17:59 +03:00
Alexey Milovidov	fd84d16387	Fix "server failed to start" error	2020-11-07 03:14:53 +03:00
Nikolai Kochetov	c10f733587	Merge branch 'master' into storage-read-query-plan	2020-11-06 15:43:46 +03:00
Nikolai Kochetov	9aeb757da4	Merge branch 'master' into actions-dag-f14	2020-11-06 15:04:20 +03:00
Azat Khuzhin	f23995d290	Remove empty directories for async INSERT at start of Distributed engine Will be created by DistributedBlockOutputStream on demand.	2020-11-05 23:50:30 +03:00
Nikolai Kochetov	6767a226fc	Merge branch 'master' into actions-dag-f14	2020-11-03 15:21:06 +03:00
Nikolai Kochetov	07a7c46b89	Refactor ExpressionActions [Part 3]	2020-11-03 14:28:28 +03:00
Azat Khuzhin	fc14fde24a	Fix DROP TABLE for Distributed (racy with INSERT) <details> ``` drop() on T1275: 0 DB::StorageDistributed::drop (this=0x7f9ed34f0000) at ../contrib/libcxx/include/__hash_table:966 1 0x000000000d557242 in DB::DatabaseOnDisk::dropTable (this=0x7f9fc22706d8, context=..., table_name=...) at ../contrib/libcxx/include/new:340 2 0x000000000d6fcf7c in DB::InterpreterDropQuery::executeToTable (this=this@entry=0x7f9e42560dc0, query=...) at ../contrib/libcxx/include/memory:3826 3 0x000000000d6ff5ee in DB::InterpreterDropQuery::execute (this=0x7f9e42560dc0) at ../src/Interpreters/InterpreterDropQuery.cpp:50 4 0x000000000daa40c0 in DB::executeQueryImpl (begin=<optimized out>, end=<optimized out>, context=..., internal=<optimized out>, stage=DB::QueryProcessingStage::Complete, has_query_tail=false, istr=0x0) at ../src/Interpreters/executeQuery.cpp:420 5 0x000000000daa59df in DB::executeQuery (query=..., context=..., internal=internal@entry=false, stage=<optimized out>, may_have_embedded_data=<optimized out>) at ../contrib/libcxx/include/string:1487 6 0x000000000e1369e6 in DB::TCPHandler::runImpl (this=this@entry=0x7f9ddf3a9000) at ../src/Server/TCPHandler.cpp:254 7 0x000000000e1379c9 in DB::TCPHandler::run (this=0x7f9ddf3a9000) at ../src/Server/TCPHandler.cpp:1326 8 0x000000001086fac7 in Poco::Net::TCPServerConnection::start (this=this@entry=0x7f9ddf3a9000) at ../contrib/poco/Net/src/TCPServerConnection.cpp:43 9 0x000000001086ff2b in Poco::Net::TCPServerDispatcher::run (this=0x7f9e4eba5c00) at ../contrib/poco/Net/src/TCPServerDispatcher.cpp:114 10 0x00000000109dbe8e in Poco::PooledThread::run (this=0x7f9e4a2d2f80) at ../contrib/poco/Foundation/src/ThreadPool.cpp:199 11 0x00000000109d78f9 in Poco::ThreadImpl::runnableEntry (pThread=<optimized out>) at ../contrib/poco/Foundation/include/Poco/SharedPtr.h:401 12 0x00007f9fc3cccea7 in start_thread (arg=<optimized out>) at pthread_create.c:477 13 0x00007f9fc3bebeaf in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:95 StorageDistributedDirectoryMonitor on T166: 0 DB::StorageDistributedDirectoryMonitor::StorageDistributedDirectoryMonitor (this=0x7f9ea7ab1400, storage_=..., path_=..., pool_=..., monitor_blocker_=..., bg_pool_=...) at ../src/Storages/Distributed/DirectoryMonitor.cpp:81 1 0x000000000dbf684e in std::__1::make_unique<> () at ../contrib/libcxx/include/memory:3474 2 DB::StorageDistributed::requireDirectoryMonitor (this=0x7f9ed34f0000, disk=..., name=...) at ../src/Storages/StorageDistributed.cpp:682 3 0x000000000de3d5fa in DB::DistributedBlockOutputStream::writeToShard (this=this@entry=0x7f9ed39c7418, block=..., dir_names=...) at ../src/Storages/Distributed/DistributedBlockOutputStream.cpp:634 4 0x000000000de3e214 in DB::DistributedBlockOutputStream::writeAsyncImpl (this=this@entry=0x7f9ed39c7418, block=..., shard_id=shard_id@entry=79) at ../src/Storages/Distributed/DistributedBlockOutputStream.cpp:539 5 0x000000000de3e47b in DB::DistributedBlockOutputStream::writeSplitAsync (this=this@entry=0x7f9ed39c7418, block=...) at ../contrib/libcxx/include/vector:1546 6 0x000000000de3eab0 in DB::DistributedBlockOutputStream::writeAsync (block=..., this=0x7f9ed39c7418) at ../src/Storages/Distributed/DistributedBlockOutputStream.cpp:141 7 DB::DistributedBlockOutputStream::write (this=0x7f9ed39c7418, block=...) at ../src/Storages/Distributed/DistributedBlockOutputStream.cpp:135 8 0x000000000d73b376 in DB::PushingToViewsBlockOutputStream::write (this=this@entry=0x7f9ea7a8cf58, block=...) at ../src/DataStreams/PushingToViewsBlockOutputStream.cpp:157 9 0x000000000d7853eb in DB::AddingDefaultBlockOutputStream::write (this=0x7f9ed383d118, block=...) at ../contrib/libcxx/include/memory:3826 10 0x000000000d740790 in DB::SquashingBlockOutputStream::write (this=0x7f9ed383de18, block=...) at ../contrib/libcxx/include/memory:3826 11 0x000000000d68c308 in DB::CountingBlockOutputStream::write (this=0x7f9ea7ac6d60, block=...) at ../contrib/libcxx/include/memory:3826 12 0x000000000ddab449 in DB::StorageBuffer::writeBlockToDestination (this=this@entry=0x7f9fbd56a000, block=..., table=...) at ../src/Storages/StorageBuffer.cpp:747 13 0x000000000ddabfa6 in DB::StorageBuffer::flushBuffer (this=this@entry=0x7f9fbd56a000, buffer=..., check_thresholds=check_thresholds@entry=true, locked=locked@entry=false, reset_block_structure=reset_block_structure@entry=false) at ../src/Storages/StorageBuffer.cpp:661 14 0x000000000ddac415 in DB::StorageBuffer::flushAllBuffers (reset_blocks_structure=false, check_thresholds=true, this=0x7f9fbd56a000) at ../src/Storages/StorageBuffer.cpp:605 shutdown() on T1275: 0 DB::StorageDistributed::shutdown (this=0x7f9ed34f0000) at ../contrib/libcxx/include/atomic:1612 1 0x000000000d6fd938 in DB::InterpreterDropQuery::executeToTable (this=this@entry=0x7f98530c79a0, query=...) at ../src/Storages/TableLockHolder.h:12 2 0x000000000d6ff5ee in DB::InterpreterDropQuery::execute (this=0x7f98530c79a0) at ../src/Interpreters/InterpreterDropQuery.cpp:50 3 0x000000000daa40c0 in DB::executeQueryImpl (begin=<optimized out>, end=<optimized out>, context=..., internal=<optimized out>, stage=DB::QueryProcessingStage::Complete, has_query_tail=false, istr=0x0) at ../src/Interpreters/executeQuery.cpp:420 4 0x000000000daa59df in DB::executeQuery (query=..., context=..., internal=internal@entry=false, stage=<optimized out>, may_have_embedded_data=<optimized out>) at ../contrib/libcxx/include/string:1487 5 0x000000000e1369e6 in DB::TCPHandler::runImpl (this=this@entry=0x7f9ddf3a9000) at ../src/Server/TCPHandler.cpp:254 6 0x000000000e1379c9 in DB::TCPHandler::run (this=0x7f9ddf3a9000) at ../src/Server/TCPHandler.cpp:1326 7 0x000000001086fac7 in Poco::Net::TCPServerConnection::start (this=this@entry=0x7f9ddf3a9000) at ../contrib/poco/Net/src/TCPServerConnection.cpp:43 8 0x000000001086ff2b in Poco::Net::TCPServerDispatcher::run (this=0x7f9e4eba5c00) at ../contrib/poco/Net/src/TCPServerDispatcher.cpp:114 9 0x00000000109dbe8e in Poco::PooledThread::run (this=0x7f9e4a2d2f80) at ../contrib/poco/Foundation/src/ThreadPool.cpp:199 10 0x00000000109d78f9 in Poco::ThreadImpl::runnableEntry (pThread=<optimized out>) at ../contrib/poco/Foundation/include/Poco/SharedPtr.h:401 11 0x00007f9fc3cccea7 in start_thread (arg=<optimized out>) at pthread_create.c:477 12 0x00007f9fc3bebeaf in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:95 ``` </details>	2020-10-27 21:19:36 +03:00
Ivan	1d170f5745	ASTTableIdentifier Part #1 : improve internal representation of ASTIdentifier name (#16149 ) * Use only \|name_parts\| as primary name source * Restore legacy logic for table restoration * Fix build * Fix tests * Add pytest server config * Fix tests * Fixes due to review	2020-10-24 21:46:10 +03:00
Nikolai Kochetov	7fa045cff8	Merge branch 'master' into storage-read-query-plan	2020-10-22 13:31:10 +03:00
Azat Khuzhin	9b8abd44ab	Add allow_nondeterministic_optimize_skip_unused_shards	2020-10-17 01:07:02 +03:00
Alexander Tokmakov	72b1339656	Revert "Revert "Write structure of table functions to metadata"" This reverts commit `c65d1e5c70`.	2020-10-14 15:19:29 +03:00
tavplubix	c65d1e5c70	Revert "Write structure of table functions to metadata"	2020-10-14 13:59:29 +03:00
alexey-milovidov	f60ccb4edf	Merge pull request #14295 from ClickHouse/write_structure_of_table_functions Write structure of table functions to metadata	2020-10-13 23:56:09 +03:00
Nikolai Kochetov	7e58f99f64	Merge branch 'master' into storage-read-query-plan	2020-10-12 13:12:39 +03:00
Alexey Milovidov	5b482f4191	Cleanups	2020-10-10 19:31:10 +03:00
Nikolai Kochetov	c5cb05f5f3	Try fix tests.	2020-10-07 14:26:29 +03:00
Azat Khuzhin	b838214a35	Pass non-const SelectQueryInfo (and drop mutable qualifiers)	2020-10-02 22:42:35 +03:00
Azat Khuzhin	587cde853e	Avoid skipping unused shards twice (for query processing stage and read itself)	2020-10-02 22:42:09 +03:00
Nikolai Kochetov	576ffadb17	Fix explain for ISourceStep.	2020-09-30 15:22:30 +03:00
Alexander Tokmakov	b0d99217fb	Merge branch 'master' into write_structure_of_table_functions	2020-09-27 14:26:47 +03:00
Nikolai Kochetov	dea90009e3	Fix build	2020-09-25 16:03:12 +03:00
Alexey Milovidov	24b334258b	Resolve review comment	2020-09-18 22:25:56 +03:00
Alexey Milovidov	e1ffa07a39	Resolve review comments	2020-09-18 22:08:53 +03:00
Alexander Tokmakov	1ca9a92b21	Merge branch 'master' into write_structure_of_table_functions	2020-09-18 21:09:23 +03:00
Nikolai Kochetov	b26f11c00c	Support StorageDistributed::read for QueryPlan.	2020-09-18 17:16:53 +03:00
Alexey Milovidov	7fb4dfea2c	Small improvements for IStorage::rename	2020-09-17 22:50:43 +03:00
alexey-milovidov	84eece69ba	Merge pull request #14876 from amosbird/ns Get rid of query settings after initialization.	2020-09-17 17:49:25 +03:00
Amos Bird	96a202c0fb	Get rid of query settings after initialization.	2020-09-16 22:35:39 +08:00
Pavel Kovalenko	01ab28a182	Don't throw exception if Distributed storage has multi-volume storage policy configuration.	2020-09-15 12:26:56 +03:00
Alexander Tokmakov	b840d741d0	Merge branch 'master' into write_structure_of_table_functions	2020-09-04 13:00:07 +03:00
alexey-milovidov	edea940e17	Update StorageDistributed.cpp	2020-09-03 04:39:36 +03:00
Azat Khuzhin	fffeeeba06	Force WithMergeableStateAfterAggregation via distributed_group_by_no_merge (convert to UInt64) Possible values: - 1 - Do not merge aggregation states from different servers for distributed query processing - in case it is for certain that there are different keys on different shards. - 2 - same as 1 but also apply ORDER BY and LIMIT stages	2020-09-03 00:52:51 +03:00
Azat Khuzhin	10b4f3b41f	Optimize queries with LIMIT/LIMIT BY/ORDER BY for distributed with GROUP BY sharding_key Previous set of QueryProcessingStage does not allow to do this. But after WithMergeableStateAfterAggregation had been introduced the following queries can be optimized too under optimize_distributed_group_by_sharding_key: - GROUP BY sharding_key LIMIT - GROUP BY sharding_key LIMIT BY - GROUP BY sharding_key ORDER BY And right now it is still not supports: - WITH TOTALS (looks like it can be supported) - WITH ROLLUP (looks like it can be supported) - WITH CUBE - SETTINGS extremes=1 (looks like it can be supported) But will be implemented separatelly. vX: fixes v2: fix WITH * v3: fix extremes v4: fix LIMIT OFFSET (and make a little bit cleaner) v5: fix HAVING v6: fix ORDER BY v7: rebase against 20.7 v8: move out WithMergeableStateAfterAggregation v9: add optimize_distributed_group_by_sharding_key into test names	2020-09-03 00:52:51 +03:00
Alexander Tokmakov	969940b4c9	write table tructure for table function remote(...)	2020-08-26 23:55:40 +03:00
Nikolai Kochetov	2cca4d5fcf	Refactor Pipe [part 2].	2020-08-03 16:54:14 +03:00
Vladimir Chebotarev	faedb04722	Minor fixes.	2020-07-28 19:45:46 +03:00
Vladimir Chebotarev	1b3f5c99f5	Real fix of test.	2020-07-26 21:27:36 +03:00
Vladimir Chebotarev	f5af64514f	Test fix (removed redundant code).	2020-07-26 21:27:36 +03:00
Vladimir Chebotarev	8039d45910	Minor fix in `StorageDistributed`.	2020-07-26 21:27:36 +03:00
Gleb Novikov	7f5b6fba78	Generic volume is coming... 1. SingleDiskVolume for temporary volumes 2. Generic VolumePtr in StoragePolicies 3. Removed max_data_part_size in system.storage_policies, added volume_type	2020-07-26 21:27:36 +03:00
Artem Zuikov	2afd123eda	Refactoring: extract TreeOptimizer from SyntaxAnalyzer (#12645 )	2020-07-22 20:13:05 +03:00
Azat Khuzhin	6ea1b19476	Remove data for Distributed tables (blocks from async INSERTs) on DROP TABLE	2020-07-17 08:59:57 +03:00
Azat Khuzhin	9c94993f14	Fix "Sharding key is not deterministic" message	2020-06-29 23:00:14 +03:00
alexey-milovidov	18eb141ea1	Merge pull request #11715 from azat/dist-optimize_skip_unused_shards-fixes Control nesting level for shards skipping and disallow non-deterministic functions	2020-06-24 12:54:58 +03:00
alesapin	c9fa5d2ec3	Better naming	2020-06-19 18:39:41 +03:00
Azat Khuzhin	d34e6217bc	Add logging of adjusting conditional settings for distributed queries	2020-06-18 21:49:29 +03:00
Azat Khuzhin	041533eae2	Disable optimize_skip_unused_shards if sharding_key has non-deterministic func Example of such functions is rand() And this patch disables only optimize_skip_unused_shards, i.e. INSERT code path does not changed, so it will work as before.	2020-06-18 21:49:29 +03:00
alesapin	d79982f497	Better locks in Storages	2020-06-18 19:10:47 +03:00
alesapin	aab4ce6394	Truncate with metadata	2020-06-18 13:29:13 +03:00
alesapin	760e9a8488	Fix crash	2020-06-18 12:08:24 +03:00
alesapin	ebb36bec8a	Merge branch 'master' into atomic_metadata5	2020-06-18 11:57:16 +03:00
alesapin	dffdece350	getColumns in StorageInMemoryMetadta (only compilable)	2020-06-17 19:39:58 +03:00
alesapin	ef8781cce7	Better getVirtuals method	2020-06-17 17:37:21 +03:00
Nikita Mikhaylov	ff0262626a	Merge pull request #11645 from azat/load-balancing-round-robin Add round_robin load_balancing	2020-06-17 14:34:59 +04:00
alesapin	1ddeb3d149	Buildable getSampleBlock in StorageInMemoryMetadata	2020-06-16 18:51:29 +03:00
alesapin	53cb5210de	Move getSampleBlockNonMaterialized to StorageInMemoryMetadata	2020-06-16 15:48:10 +03:00
alesapin	36ba0192df	Metadata in read and write methods of IStorage	2020-06-15 22:08:58 +03:00
alesapin	af2fe2ba55	Compilable setColumns, setConstraints, setIndices	2020-06-15 19:55:33 +03:00
Azat Khuzhin	c139a05370	Forward declaration in StorageDistributed	2020-06-14 01:09:21 +03:00
alesapin	8be957ecb5	Better checks around metadata	2020-06-10 14:16:31 +03:00
alesapin	d2fcf5aea5	Fixes for gcc	2020-06-09 20:28:29 +03:00
alexey-milovidov	82e849e6a1	Update StorageDistributed.cpp	2020-06-06 18:57:52 +03:00
Azat Khuzhin	ff85125326	Fix readability-qualified-auto	2020-06-04 20:23:46 +03:00
Azat Khuzhin	86c5465bf8	Rewrite StorageSystemDistributionQueue interfaces	2020-06-04 03:04:32 +03:00
Azat Khuzhin	389f78ceee	Add system.distribution_queue system.distribution_queue contains the following columns: - database - table - data_path - is_blocked - error_count - data_files - data_compressed_bytes	2020-06-04 02:36:16 +03:00
Azat Khuzhin	60d10f1bac	Fix typo in StorageDistributed	2020-06-04 02:36:16 +03:00
alesapin	3847ea892d	Merge branch 'master' into consistent_metadata3	2020-06-01 13:17:59 +03:00
Alexey Milovidov	25f941020b	Remove namespace pollution	2020-05-31 00:57:37 +03:00
alesapin	52ca6b2051	I'm able to build it	2020-05-28 15:37:05 +03:00
Alexey Milovidov	7e1813825b	Return old names of macros	2020-05-24 01:24:01 +03:00
Alexey Milovidov	85f84550ba	Progress on task	2020-05-23 23:37:37 +03:00
Alexey Milovidov	9d2a0d2dd7	Apply all transformations again	2020-05-23 21:59:49 +03:00
Alexey Milovidov	a2ad11897f	Remove duplicate whitespaces (preparation)	2020-05-23 21:53:58 +03:00
Alexey Milovidov	1f13515a65	Make all LOG in single line (preparation)	2020-05-23 21:31:37 +03:00
Alexey Milovidov	e391b77d81	find {base,src,programs} -name '.h' -or -name '.cpp' \| xargs grep -l -P 'LOG_\w+\([^,]+, "[^"]+" << [^<]+ << "[^"]+"\);' \| xargs sed -i -r -e 's/(LOG_\w+)\(([^,]+), "([^"]+)" << ([^<]+) << "([^"]+)"\);/\1_FORMATTED(\2, "\3{}\5", \4);/'	2020-05-23 19:56:05 +03:00
Alexey Milovidov	ee4ffbc332	find {base,src,programs} -name '.h' -or -name '.cpp' \| xargs grep -l -P 'LOG_\w+\([^,]+, "[^"]+" << [^<]+\);' \| xargs sed -i -r -e 's/(LOG_\w+)\(([^,]+), "([^"]+)" << ([^<]+)\);/\1_FORMATTED(\2, "\3{}", \4);/'	2020-05-23 19:47:56 +03:00
Alexey Milovidov	8d2e80a5e2	find {base,src,programs} -name '.h' -or -name '.cpp' \| xargs grep -l -P 'LOG_\w+\([^,]+, "[^"]+"\)' \| xargs sed -i -r -e 's/(LOG_\w+)\(([^,]+, "[^"]+")\)/\1_FORMATTED(\2)/'	2020-05-23 19:42:39 +03:00
Azat Khuzhin	bc4b75dead	Add table name into logs for StorageDistributed	2020-05-23 11:57:14 +03:00
alesapin	59b3bc0c05	Merge branch 'master' into fix_deadlock_system_logs_startup	2020-05-21 22:57:52 +03:00
alesapin	c036af0261	Fix deadlock after clickhouse-server update (with changes in one of system log tables structure) during startup between concurrent merge and table rename.	2020-05-21 17:11:56 +03:00
Azat Khuzhin	d93b9a57f6	Forward declaration for Context as much as possible. Now after changing Context.h 488 modules will be recompiled instead of 582.	2020-05-21 01:53:18 +03:00
Gleb Novikov	1a25ac6e1f	Merge branch 'master' into refactor-reservations	2020-05-16 23:34:45 +03:00
alesapin	352c402cf2	Merge branch 'add_alter_rename_column_to_distributed' of https://github.com/vzakaznikov/ClickHouse into vzakaznikov-add_alter_rename_column_to_distributed	2020-05-13 14:14:23 +03:00
alexey-milovidov	33d491edf3	Merge pull request #10516 from azat/dist-GROUP_BY-sharding_key-fixes Disable GROUP BY sharding_key optimization by default (and fix for WITH ROLLUP/CUBE/TOTALS)	2020-05-11 12:03:27 +03:00
Gleb Novikov	390b39b272	VolumePtr instead of DiskPtr in MergeTreeData*	2020-05-10 00:24:15 +03:00
Vitaliy Zakaznikov	90e52e7fea	Adding support for ALTER RENAME COLUMN query to Distributed table engine.	2020-05-07 14:54:35 +02:00
Gleb Novikov	c637d99e07	Volumes and storages refactoring: 1. Moved Volume to separate file 2. Created IVolume interface and implemented current behaviour in implementation of new interface — VolumeJBOD 3. Replaced all old volume usages with new VolumeJBOD. Where it is unnecessary to have JBOD — left just IVolume. 4. Removed old Volume completely 5. Moved StoragePolicy to separated files 6. Moved DiskSelector to separated files 7. Removed DiskSpaceMonitor file	2020-05-04 23:15:38 +03:00
alexey-milovidov	229f666dea	Merge pull request #10611 from azat/optimize_skip_unused_shards-LowCardinality Fix optimize_skip_unused_shards with LowCardinality	2020-05-02 22:14:33 +03:00
Azat Khuzhin	63d8ab8f03	Make createSelector() static (in storage) and const (in stream)	2020-05-01 11:31:05 +03:00
Azat Khuzhin	f22ba15b4a	Reduce copy-paste of DistributedBlockOutputStream::createSelector This will make it less error prone.	2020-05-01 02:59:40 +03:00
Azat Khuzhin	cdd7013438	Drop superfluous "Skipping irrelevant shards" messages Before this patch it printed 3 times: - from StorageDistributed::getProcessingStageImpl() - from StorageDistributed::read() - from StorageDistributed::getProcessingStageImpl() (from StorageDistributed::read() -> getSampleBlock()) (But this should be optimized)	2020-05-01 02:56:13 +03:00
Azat Khuzhin	c648c300bf	Fix optimize_skip_unused_shards with LowCardinality	2020-05-01 02:39:58 +03:00
Azat Khuzhin	4cbe625567	Fix shard numbers output in logs (full cluster had been printed over optimized)	2020-05-01 02:13:07 +03:00
Azat Khuzhin	038235684d	Add optimize_distributed_group_by_sharding_key and disable it by default I know at least one way to fool that optimization, by using as sharding key something like `if(col1>0, col1, col2)` (although this is not common sharding key I would say, but can be useful if this will work correctly), so let's disable it by default.	2020-04-29 00:09:25 +03:00
alesapin	f981649213	Fix pushing to views stream and refactor virtuals	2020-04-28 13:38:57 +03:00
alesapin	4badd0fd28	Better code	2020-04-27 20:46:51 +03:00
alesapin	01db4877f6	Fix style check	2020-04-27 18:44:33 +03:00
alesapin	18c550df15	Better virtuals logic	2020-04-27 16:55:30 +03:00
alesapin	2829774105	Merge branch 'master' into refactor_istorage	2020-04-27 15:34:21 +03:00
Azat Khuzhin	20b4eed9a1	Disable GROUP BY sharding_key optimization for WITH ROLLUP/CUBE/TOTALS	2020-04-27 01:30:54 +03:00
alexey-milovidov	c9334d3fde	Merge pull request #10491 from azat/dist-shutdown Proper Distributed shutdown (fixes UAF, avoid waiting for sending all batches)	2020-04-25 23:47:59 +03:00
Azat Khuzhin	747a74215f	Avoid processing all batches before Distributed shutdown	2020-04-25 02:03:27 +03:00
Azat Khuzhin	8ad6b37913	Proper StorageDistributed shutdown to avoid UAF in DistributedMonitor StorageDistributed::shutdown() does not acquire the lock, that controls access to the cluster_nodes_data, thus it does not synced with the requireDirectoryMonitor(), hence some monitors can be untracked that will trigger UAF (use-after-free) after DROP TABLE dist: This is for the SIGSEGV from the DirectoryMonitor (with already destroyed storage): 0 0x0000000008e9f760 in std::__1::__cxx_atomic_load<int> (__order=std::__1::memory_order::seq_cst, __a=0x0) 1 std::__1::__atomic_base<int, false>::load (__m=std::__1::memory_order::seq_cst, this=0x0) <-- this is nullptr 2 std::__1::__atomic_base<int, false>::operator int (this=0x0) 3 DB::ActionBlocker::isCancelled (this=0x7f85e31c9bb8) at ../src/Common/ActionBlocker.h:18 4 DB::StorageDistributedDirectoryMonitor::run (this=0x7f85f93b2a00) at ../src/Storages/Distributed/DirectoryMonitor.cpp:140	2020-04-25 02:03:26 +03:00
alesapin	793f4b734a	Remove obsolete comment	2020-04-24 13:31:03 +03:00
alesapin	dc2dd77d2e	Remove redundant overrides from IStorage	2020-04-24 12:20:09 +03:00
Alexander Tokmakov	04d6b59ac0	Merge branch 'master' into database_atomic	2020-04-23 17:31:37 +03:00
Alexey Milovidov	1e325a9fd9	Checkpoint	2020-04-22 09:22:14 +03:00
Alexander Tokmakov	b29bddac12	Merge branch 'master' into database_atomic	2020-04-20 14:09:09 +03:00
Alexey Milovidov	d7264b292d	Merge branch 'master' into sorting-processors	2020-04-20 09:29:41 +03:00
alexey-milovidov	1577d771df	Merge pull request #10341 from azat/auto_distributed_group_by_no_merge Auto distributed_group_by_no_merge on GROUP BY sharding key	2020-04-20 08:30:27 +03:00
Azat Khuzhin	e44d5c5749	Fix clang readability-container-size-empty warning in StorageDistributed::canForceGroupByNoMerge()	2020-04-20 01:12:22 +03:00
Azat Khuzhin	be1dec9239	Fix distributed_group_by_no_merge optimization for Distributed-over-Distributed	2020-04-19 21:11:14 +03:00
Azat Khuzhin	93d049fe64	Allow auto distributed_group_by_no_merge for DISTINCT of sharding key	2020-04-19 18:53:37 +03:00
Azat Khuzhin	de4a723264	Auto distributed_group_by_no_merge on GROUP BY injective function of sharding key	2020-04-19 18:33:49 +03:00
Azat Khuzhin	5d11118cc9	Use thread pool (background_distributed_schedule_pool_size) for distributed sends After #8756 the problem with 1 thread for each (distributed table, disk) for distributed sends became even worse (since there can be multiple disks), so use predefined thread pool for this tasks, that can be controlled with background_distributed_schedule_pool_size knob.	2020-04-19 12:01:56 +03:00
Nikolai Kochetov	153f795ebe	Merge branch 'master' into sorting-processors	2020-04-15 12:07:05 +03:00
Alexander Tokmakov	5e6d4b9449	Merge branch 'master' into database_atomic	2020-04-12 16:35:44 +03:00
Nikolai Kochetov	71c72a75d7	Move Graphite params to separate file.	2020-04-10 12:24:16 +03:00
Alexander Kazakov	497df3086f	Merge branch 'master' into timed_rwlock Change-Id: I620bfde2121ff013773b001d514b40b1e796a58b	2020-04-10 11:38:20 +03:00
Alexander Kazakov	26dd6140b2	Added new config settings to control timeouts * "lock_acquire_timeout" controls for how long a query will continue to acquire each lock on its argument tables * "lock_acquire_timeout_for_background_operations" is a per-table setting for storages of *MergeTree family	2020-04-09 21:10:27 +03:00
Alexander Tokmakov	dd1590830b	Merge branch 'master' into database_atomic	2020-04-08 22:00:46 +03:00
alexey-milovidov	6d80ab1eed	Merge pull request #9811 from vitlibar/RBAC-8 RBAC-8	2020-04-08 05:47:55 +03:00
Vitaly Baranov	e573549945	Rework access rights for table functions.	2020-04-07 23:31:59 +03:00
Alexander Tokmakov	4c48b7dd80	better rename	2020-04-07 18:31:33 +03:00
Alexander Tokmakov	08bae4668d	Merge branch 'master' into database_atomic	2020-04-06 16:18:07 +03:00
Azat Khuzhin	1232760f78	Fix Distributed-over-Distributed when nested table has only one shard	2020-04-04 13:47:35 +03:00
Ivan Lezhankin	06446b4f08	dbms/ → src/	2020-04-03 18:14:31 +03:00

... 4 5 6 7 8 ...

456 Commits