ClickHouse

mirror of https://github.com/ClickHouse/ClickHouse.git synced 2024-12-17 03:42:48 +00:00

Author	SHA1	Message	Date
Kseniia Sumarokova	427ae808ba	Merge pull request #35191 from DevTeamBK/RemoteHostFilter_improvement Added RemoteHostFilter check for MYSQL and postgresSQL	2022-03-15 09:58:08 +01:00
Antonio Andelic	067b79b00b	Merge branch 'master' into parallel-downloading-url-engine	2022-03-15 07:55:41 +00:00
lgbo-ustc	abfaa82bca	fixed hive query bugs	2022-03-15 12:01:34 +08:00
Anton Popov	ccbddd53a3	fix mutations in tables with enabled sparse columns	2022-03-15 01:48:21 +00:00
alesapin	fbb1ebd9b8	Merge pull request #35274 from CurtizJ/fix-check-table-sparse-columns Fix check table in case when there exist sparse columns	2022-03-14 21:56:04 +01:00
Maksim Kita	2fdcf53a76	Fix clang-tidy warnings in Server, Storages folders	2022-03-14 18:17:35 +00:00
Anton Popov	063917786e	minor fixes	2022-03-14 17:29:18 +00:00
Anton Popov	36ec379aeb	Merge remote-tracking branch 'upstream/master' into HEAD	2022-03-14 16:28:35 +00:00
Anton Popov	428bbd6377	fix check table in case when there exist sparse columns	2022-03-14 15:22:23 +00:00
Heena Bansal	c774458f96	Update src/Storages/StoragePostgreSQL.cpp Co-authored-by: Kseniia Sumarokova <54203879+kssenii@users.noreply.github.com>	2022-03-14 10:38:12 -04:00
Heena Bansal	01c8b2f71e	Update src/Storages/StorageMySQL.cpp Co-authored-by: Kseniia Sumarokova <54203879+kssenii@users.noreply.github.com>	2022-03-14 10:37:59 -04:00
Antonio Andelic	d3353f3f0c	Merge branch 'master' into parallel-downloading-url-engine	2022-03-14 12:47:23 +00:00
Antonio Andelic	556fe2bcc5	Don't send RANGE with HEAD call	2022-03-14 12:46:22 +00:00
Antonio Andelic	ed3d71d83f	Merge pull request #35072 from azat/buffer-memory-tracker-leak Avoid MEMORY_LIMIT_EXCEEDED during INSERT into Buffer with AggregateFunction	2022-03-14 12:31:04 +01:00
Maksim Kita	ad6b3693e1	Merge pull request #35123 from zhanghuajieHIT/fix_build_fail_with_gcc fix build fail with gcc	2022-03-14 10:36:15 +01:00
Antonio Andelic	9dda2863d3	Split download threads when multiple URLs are used	2022-03-14 09:27:09 +00:00
HeenaBansal2009	09be30ac26	Added suggestion from review comments	2022-03-11 07:33:23 -08:00
zhanghuajie	53a8987b3b	fix build fail with gcc --fix warnings without disabling some parameters	2022-03-11 21:59:19 +08:00
Antonio Andelic	f5d3a8a31d	Polishing	2022-03-11 13:38:19 +00:00
taiyang-li	1f9e050152	add column distributed_depth for system.query_log and system.processes	2022-03-11 17:57:34 +08:00
Antonio Andelic	28e9508c4e	Improve shared pool and add settings	2022-03-11 08:33:34 +00:00
Antonio Andelic	cce318273b	Use a shared IO thread pool	2022-03-11 08:33:34 +00:00
Antonio Andelic	df0f5e20d0	Refactor	2022-03-11 08:33:34 +00:00
Antonio Andelic	e051587fc2	Format code	2022-03-11 08:33:34 +00:00
Antonio Andelic	29c32ed831	Refactor code	2022-03-11 08:33:34 +00:00
Antonio Andelic	0a1a3a230e	Add support for parallel http range requests	2022-03-11 08:33:34 +00:00
Antonio Andelic	58557b9bec	Test check for ranges support	2022-03-11 08:33:34 +00:00
Kseniia Sumarokova	818459b9f0	Merge pull request #33717 from kssenii/local-cache-for-remote-fs Local cache for remote filesystem	2022-03-11 07:23:10 +01:00
Anton Popov	37efe2ddb5	Apply suggestions from code review Co-authored-by: Kseniia Sumarokova <54203879+kssenii@users.noreply.github.com>	2022-03-10 22:24:19 +01:00
alesapin	c0d8ccc91b	Merge pull request #35178 from Varinara/master Added disk_name to system.part_log	2022-03-10 22:22:37 +01:00
HeenaBansal2009	d981463b05	Added RemoteHostFilter check for MYSQL and postgresSQL	2022-03-10 08:58:48 -08:00
Maksim Kita	493169910b	Merge pull request #35174 from zhangyifan27/fix_typo fix typos	2022-03-10 17:10:44 +01:00
Kseniia Sumarokova	3fc399b6e9	Merge pull request #35158 from kssenii/fix-materialized-postgresql Fix materialised postrgesql adding new table after manually removing it	2022-03-10 17:02:32 +01:00
Kseniia Sumarokova	e30b0c5d57	Merge pull request #35162 from kssenii/fix-materialized-postgresql-table-override Fix materialised postgres `table overrides` for partition by, etc	2022-03-10 17:01:24 +01:00
Varinara	f5523f7ff0	added disk_name to system.part_log	2022-03-10 18:44:19 +03:00
Rajkumar	3d3b6d1956	clang-tidy report issues with Medium priority	2022-03-10 07:23:49 -08:00
mergify[bot]	df01290e73	Merge branch 'master' into fix_typo	2022-03-10 13:35:04 +00:00
kssenii	1dc3f36a11	Better	2022-03-10 12:19:20 +01:00
kssenii	787a0805a5	Merge master	2022-03-10 11:42:19 +01:00
Kseniia Sumarokova	e6ee891c9c	Merge pull request #34957 from bigo-sg/hive_random_access_file_cache Optimization for first time to read a random access readbuffer in hive	2022-03-10 11:36:22 +01:00
zhangyifan27	e6fa9f699a	fix typo	2022-03-10 18:29:42 +08:00
kssenii	3cd1da1e11	Fix	2022-03-10 11:11:59 +01:00
kssenii	af9d8d278e	Fix	2022-03-09 19:25:43 +01:00
Vladimir C	ce266b5a3e	Merge pull request #35146 from amosbird/fixpartitionprunerin	2022-03-09 13:23:45 +01:00
Nikolai Kochetov	6bfee7aca2	Merge pull request #35132 from azat/parallel_distributed_insert_select-view Support view() for parallel_distributed_insert_select	2022-03-09 09:10:34 +01:00
Azat Khuzhin	e2960e1a52	Avoid MEMORY_LIMIT_EXCEEDED during INSERT into Buffer with AggregateFunction In case of Buffer table has columns of AggregateFunction type, aggregate states for such columns will be allocated from the query context but those states can be destroyed from the server context (in case of background flush), and thus memory will be leaked from the query since aggregate states can be shared, and eventually this will lead to MEMORY_LIMIT_EXCEEDED error. To avoid this, prohibit sharing the aggregate states. But note, that this problem only about memory accounting, not memory usage itself. Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-03-09 10:57:49 +03:00
Amos Bird	a19224bc9b	Fix partition pruner: non-monotonic function IN	2022-03-09 15:48:42 +08:00
Azat Khuzhin	3a5a39a9df	Do not delay final part writing by default For async s3 writes final part flushing was defered until all the INSERT block was processed, however in case of too many partitions/columns you may exceed max_memory_usage limit (since each stream has overhead). Introduce max_insert_delayed_streams_for_parallel_writes (with default to 1000 for S3, 0 otherwise), to avoid this. This should "Memory limit exceeded" errors in performance tests. Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-03-08 22:17:36 +03:00
Azat Khuzhin	4843e210c3	Support view() for parallel_distributed_insert_select Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-03-08 22:05:57 +03:00
Azat Khuzhin	ced34dea84	Take flush_time into account for scheduling background flush of the Buffer Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-03-08 21:58:10 +03:00
kssenii	5260822964	Merge master	2022-03-08 18:21:28 +01:00
kssenii	e231c3a3e0	Fix split build	2022-03-08 18:05:55 +01:00
Azat Khuzhin	c4b6342853	Improvements for `parallel_distributed_insert_select` (and related) (#34728 ) * Add a warning if parallel_distributed_insert_select was ignored Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Respect max_distributed_depth for parallel_distributed_insert_select Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Print warning for non applied parallel_distributed_insert_select only for initial query Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Remove Cluster::getHashOfAddresses() Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Forbid parallel_distributed_insert_select for remote()/cluster() with different addresses Before it uses empty cluster name (getClusterName()) which is not correct, compare all addresses instead. Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Fix max_distributed_depth check max_distributed_depth=1 must mean not more then one distributed query, not two, since max_distributed_depth=0 means no limit, and distribute_depth is 0 for the first query. Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Fix INSERT INTO remote()/cluster() with parallel_distributed_insert_select Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Add a test for parallel_distributed_insert_select with cluster()/remote() Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Return <remote> instead of empty cluster name in Distributed engine Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com> * Make user with sharding_key and w/o in remote()/cluster() identical Before with sharding_key the user was "default", while w/o it it was empty. Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-03-08 15:24:39 +01:00
Antonio Andelic	bc5d7aea57	Merge pull request #34876 from azat/long-INSERT-fix Fix possible "Part directory doesn't exist" during INSERT	2022-03-08 12:44:53 +01:00
Kseniia Sumarokova	1eb2bae792	Merge pull request #34954 from bigo-sg/hive_read_columns_pruning read columns pruning for hive	2022-03-08 10:17:24 +01:00
lgbo-ustc	256e92ffee	Merge remote-tracking branch 'ck/master' into hive_random_access_file_cache	2022-03-08 14:14:40 +08:00
Azat Khuzhin	caffc144b5	Fix possible "Part directory doesn't exist" during INSERT In #33291 final part commit had been defered, and now it can take significantly more time, that may lead to "Part directory doesn't exist" error during INSERT: 2022.02.21 18:18:06.979881 [ 11329 ] {insert} <Debug> executeQuery: (from 127.1:24572, user: default) INSERT INTO db.table (...) VALUES 2022.02.21 20:58:03.933593 [ 11329 ] {insert} <Trace> db.table: Renaming temporary part tmp_insert_20220214_18044_18044_0 to 20220214_270654_270654_0. 2022.02.21 21:16:50.961917 [ 11329 ] {insert} <Trace> db.table: Renaming temporary part tmp_insert_20220214_18197_18197_0 to 20220214_270689_270689_0. ... 2022.02.22 21:16:57.632221 [ 64878 ] {} <Warning> db.table: Removing temporary directory /clickhouse/data/db/table/tmp_insert_20220214_18232_18232_0/ ... 2022.02.23 12:23:56.277480 [ 11329 ] {insert} <Trace> db.table: Renaming temporary part tmp_insert_20220214_18232_18232_0 to 20220214_273459_273459_0. 2022.02.23 12:23:56.299218 [ 11329 ] {insert} <Error> executeQuery: Code: 107. DB::Exception: Part directory /clickhouse/data/db/table/tmp_insert_20220214_18232_18232_0/ doesn't exist. Most likely it is a logical error. (FILE_DOESNT_EXIST) (version 22.2.1.1) (from 127.1:24572) (in query: INSERT INTO db.table (...) VALUES), Stack trace (when copying this message, always include the lines below): Follow-up for: #28760 Refs: #33291 Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-03-08 07:44:11 +03:00
lgbo-ustc	a8cfc2458a	update codes	2022-03-08 11:55:15 +08:00
taiyang-li	b4174b0bef	merge master and fix conflicts	2022-03-08 11:39:25 +08:00
Maksim Kita	2f9361008b	Merge pull request #35089 from 1lann/1lann/fix-update_lag-typo Fix typo of update_lag	2022-03-07 23:12:35 +01:00
Kseniia Sumarokova	5511f2f6e6	Merge pull request #34940 from bigo-sg/hive_client_connection_pool Use connection pool in HiveMetastoreClient	2022-03-07 17:14:56 +01:00
Kseniia Sumarokova	28b9ec01c0	Merge pull request #34945 from bigo-sg/hive_bug_fixed unexpected result when use `in` in hive query	2022-03-07 17:13:11 +01:00
Anton Popov	0bc57da238	Merge remote-tracking branch 'upstream/master' into HEAD	2022-03-07 14:46:08 +00:00
alesapin	d90bee1df9	Merge pull request #35080 from azat/mutate-exceptions Do not hide exceptions during mutations	2022-03-07 12:22:49 +01:00
Vladimir C	678f05ca1e	Merge pull request #34912 from kssenii/fix-filelog-metadata-path	2022-03-07 11:45:29 +01:00
1lann	5423c5a45c	Fix typo of update_lag In external dictionary providers, the allowed keys for configuration seemed to have a typo of "update_lag" as "update_tag", preventing the use of "update_lag". This change fixes that.	2022-03-07 18:31:20 +08:00
lgbo-ustc	8ae5296ee8	fixed compile errors	2022-03-07 17:26:48 +08:00
lgbo-ustc	eab925554d	fixed code styles	2022-03-07 14:01:03 +08:00
lgbo-ustc	0c83b96d8c	fixed code style	2022-03-07 13:13:33 +08:00
lgbo-ustc	cfeedd2cb5	fixed code style	2022-03-07 12:28:31 +08:00
lgbo-ustc	4507cc58aa	update codes	2022-03-07 12:05:07 +08:00
lgbo-ustc	c37eedd887	update codes	2022-03-07 10:30:54 +08:00
lgbo-ustc	75a50a30c4	update codes	2022-03-07 09:43:53 +08:00
lgbo-ustc	d907b70cc4	update codes: get actual read block	2022-03-07 09:26:05 +08:00
lgbo-ustc	f4d8fb46c5	update codes	2022-03-07 09:26:05 +08:00
lgbo-ustc	62c1bd5ae9	hive read columns pruning	2022-03-07 09:26:05 +08:00
Azat Khuzhin	bc224dee36	Do not hide exceptions during mutations system.mutations includes only the message, but not stacktrace, and it is not always obvious to understand the culprit w/o stacktrace. Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-03-06 13:39:49 +03:00
Kseniia Sumarokova	3ec6cd3128	Update StorageFileLog.cpp	2022-03-06 11:03:22 +01:00
alexey-milovidov	f9b7df6ba1	Merge pull request #35050 from CurtizJ/fix-async-inserts-system-table Fix reading from `system.asynchronous_inserts` table	2022-03-06 02:25:53 +03:00
Maksim Kita	7ae1f0fa3b	Merge pull request #34911 from larspars/master Allow LowCardinality strings for ngrambf_v1/tokenbf_v1 indexes. Fixes #21865	2022-03-04 19:17:48 +01:00
Azat Khuzhin	2ef9d32448	Revert "Remove VERSION_DATE from system.build_options" As requested by @kitaisreal This reverts commit `4a404532fb`. Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-03-04 15:32:54 +03:00
Azat Khuzhin	4a404532fb	Remove VERSION_DATE from system.build_options It was set only bu utils/release/release_lib.sh, and seems that this script is not used anymore, at least that part of it. Also note, that GIT_DATE is the same, and it is date time, not only date. Plus VERSION_DATE is not installed for releases anyway. Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-03-04 15:31:37 +03:00
Azat Khuzhin	b0f964a14f	Remove LIBRARY_ARCHITECTURE from system.build_options CMAKE_LIBRARY_ARCHITECTURE and it is useless, since it is reported only if the compiler reports subdir arch triplet [1] [1]: https://bugzilla.redhat.com/show_bug.cgi?id=1531678 Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-03-04 15:31:32 +03:00
Azat Khuzhin	494fe91f86	Fix LINK_FLAGS in system.build_options Fixes: `79f6f5a202` Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-03-04 15:31:32 +03:00
Azat Khuzhin	c426eef07d	Fix generating USE_* for system.build_options Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-03-04 15:31:32 +03:00
Anton Popov	c836a57000	fix reading from system.asynchronous_inserts table	2022-03-04 11:46:15 +00:00
Anton Popov	df3b07fe7c	Merge remote-tracking branch 'upstream/master' into HEAD	2022-03-03 22:25:28 +00:00
Anton Popov	aea7bfb59a	Merge pull request #34992 from azat/fix-asynchronous_inserts-race Fix race between INSERT async_insert=1 and system.asynchronous_inserts	2022-03-03 20:55:19 +03:00
kssenii	d19f199e93	Revert	2022-03-03 15:25:27 +01:00
Kseniia Sumarokova	b11b34dc8c	Merge pull request #34849 from kssenii/fix-too-many-columns Fix reading too many columns for s3 and url storages	2022-03-03 13:57:22 +01:00
Kseniia Sumarokova	ad09554c4c	Merge pull request #34996 from kssenii/fix-filelog-assertion Fix possible segfault in filelog storage	2022-03-03 13:52:54 +01:00
Frank Chen	b4829465d9	Improve the opentelemetry span logs for INSERT on distributed table (#34480 )	2022-03-03 12:53:29 +01:00
mergify[bot]	e169813004	Merge branch 'master' into fix-too-many-columns	2022-03-02 18:46:35 +00:00
Maksim Kita	b1a956c5f1	clang-tidy check performance-move-const-arg fix	2022-03-02 18:15:27 +00:00
Maksim Kita	1f5837359e	clang-tidy check performance-noexcept-move-constructor fix	2022-03-02 18:15:27 +00:00
mreddy017	f893002b69	Fix vulnerable code related to std::move and noexcept This commit fixes the vulnerable code related to std::move and noexcept identified by clangtidy tool.	2022-03-02 18:15:27 +00:00
kssenii	d5952109fb	Merge master	2022-03-02 18:15:25 +01:00
mergify[bot]	add225c83e	Merge branch 'master' into fix-filelog-assertion	2022-03-02 17:06:08 +00:00
Amos Bird	d4cdf04683	Add missing locks (#34025 )	2022-03-02 16:23:29 +01:00
kssenii	5e84c75942	Fix	2022-03-02 15:31:34 +01:00
kssenii	ef344a581b	Fix bug in FileLog storage	2022-03-02 15:28:17 +01:00
alesapin	b1f5805647	Merge pull request #34609 from ClickHouse/unrestricted-zk-reads allow unrestricted reads from zookeeper	2022-03-02 14:53:12 +01:00
Azat Khuzhin	57f636a1e8	Fix race between INSERT async_insert=1 and system.asynchronous_inserts CI report [1]: [c190f600f8c6] 2022.03.02 01:07:34.553012 [ 23552 ] {76b6113b-1479-46c9-90ab-e78a3c9f3dbb} executeQuery: Code: 60. DB::Exception: Both table name and UUID are empty. (UNKNOWN_TABLE) (version 22.3.1.1) (from [::1]:42040) (comment: '02015_async_inserts_stress_long.sh') (in query: SELECT * FROM system.asynchronous_inserts FORMAT Null), Stack trace (when copying this message, always include the lines below): 0. ClickHouse/contrib/libcxx/include/exception:133: Poco::Exception::Exception(std::__1::basic_string, std::__1::allocator > const&, int) @ 0xf50e04c in /fasttest-workspace/build/programs/clickhouse 1. ClickHouse/src/Common/Exception.cpp:58: DB::Exception::Exception(std::__1::basic_string, std::__1::allocator > const&, int, bool) @ 0x663ebfa in /fasttest-workspace/build/programs/clickhouse 2. DB::StorageID::assertNotEmpty() const @ 0xbc08591 in /fasttest-workspace/build/programs/clickhouse 3. ClickHouse/contrib/libcxx/include/string:1444: DB::StorageID::getDatabaseName() const @ 0xe50d2b6 in /fasttest-workspace/build/programs/clickhouse 4. ClickHouse/contrib/libcxx/include/string:1957: DB::StorageSystemAsynchronousInserts::fillData(std::__1::vector::mutable_ptr, std::__1::allocator::mutable_ptr > >&, std::__1::shared_ptr, DB::SelectQueryInfo const&) const @ 0xdac636c in /fasttest-workspace/build/programs/clickhouse [1]: https://s3.amazonaws.com/clickhouse-test-reports/34973/e6fc6a22d5c018961c18247242dd3a40b8c54ff2/fast_test__actions_.html Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-03-02 15:28:06 +03:00
alesapin	9249c5d50e	Use tryget instead of get	2022-03-02 13:09:12 +03:00
Maksim Kita	53116faeeb	Update MergeTreeIndexFullText.cpp	2022-03-02 11:08:35 +01:00
Kseniia Sumarokova	a9ab149b31	Merge pull request #34859 from Vxider/windowview-multi-column-groupby Fix bugs for multiple columns group by in WindowView	2022-03-02 10:09:47 +01:00
Filatenkov Artur	f48f35cad0	Merge pull request #34975 from Vector-Similarity-Search-for-ClickHouse/fix-typo Fix typo	2022-03-02 09:59:06 +03:00
Anton Popov	d7cd9aa69b	fix reading of missed subcolumns	2022-03-02 03:31:40 +03:00
NikitaEvs	06f47673f4	Fix typo	2022-03-01 21:42:27 +00:00
alesapin	e2989c2b85	Fix storage system zookeeper	2022-03-01 21:23:26 +01:00
alesapin	ec3e4251e1	Fix style	2022-03-01 20:34:25 +01:00
kssenii	a594f388a4	Merge master	2022-03-01 19:43:45 +01:00
kssenii	aa1c71a877	Merge master	2022-03-01 19:25:04 +01:00
kssenii	755e63ed03	Keep compatibility	2022-03-01 19:21:59 +01:00
Anton Popov	c1fdcf7a64	Merge remote-tracking branch 'upstream/master' into HEAD	2022-03-01 20:21:39 +03:00
Anton Popov	04a3a10148	minor fixes	2022-03-01 20:20:53 +03:00
alesapin	cba5fe44a9	Merge branch 'master' into unrestricted-zk-reads	2022-03-01 18:09:21 +01:00
Anton Popov	2758db5341	add more comments	2022-03-01 19:32:55 +03:00
kssenii	092ec45b47	Merge master	2022-03-01 12:06:56 +01:00
lgbo-ustc	ca470e1b94	lazy initialization about getting hive metadata in HiveStorage	2022-03-01 19:04:44 +08:00
Kseniia Sumarokova	781621eefe	Merge pull request #34946 from bigo-sg/hive_table_function Add hive table function	2022-03-01 11:28:36 +01:00
alesapin	4b61e4795c	Merge pull request #34949 from nikitamikhaylov/system_log_tables_and_settings Recreate system.{*}_log table on settings changes	2022-03-01 11:15:19 +01:00
lgbo-ustc	5ed41bda9b	fixed code style	2022-03-01 17:20:32 +08:00
lgbo-ustc	5ae99df87c	fxied code style	2022-03-01 15:35:57 +08:00
lgbo-ustc	6e568c1530	update codes	2022-03-01 15:24:40 +08:00
lgbo-ustc	91a45d799e	optimization for first time to read a random access readbuffer	2022-03-01 15:22:07 +08:00
Nikita Mikhaylov	d6036f6da3	Better (cherry picked from commit 4ae445c9e227581ea9f1cbe9aa9d1ba82e1236c9)	2022-02-28 15:27:52 +00:00
Kruglov Pavel	011813957d	Merge pull request #34938 from azat/create-as-ignore-ttl Ignore per-column TTL in CREATE TABLE AS if new table engine does not support it	2022-02-28 16:58:15 +03:00
kssenii	9b64a8fe39	Fix odbc bridge	2022-02-28 14:29:05 +01:00
lgbo-ustc	99cd25d70e	add new table function: hive()	2022-02-28 20:51:33 +08:00
lgbo-ustc	6473767c99	fixed code style	2022-02-28 17:10:56 +08:00
lgbo-ustc	5885cfd869	fixed bug : unexpected result when using in clause for filtering partitions	2022-02-28 16:47:50 +08:00
Hongbin	c9bc442114	fix comments	2022-02-28 16:44:35 +08:00
Azat Khuzhin	644f9168fa	Ignore per-column TTL in CREATE TABLE AS if new table engine does not support it Follow-up for: #6968 Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-02-28 10:29:26 +03:00
lgbo-ustc	c5e02be44e	fixed code-style	2022-02-28 15:22:54 +08:00
lgbo-ustc	2176d74cd1	Use connection pool in HiveMetastoreClient 1. remove lock for hive metastore client access 2. auo reconnect when connection is broken	2022-02-28 15:11:38 +08:00
Hongbin	99bd56e2de	Fix some code comments style	2022-02-28 08:15:37 +08:00
mergify[bot]	8d84d22618	Merge branch 'master' into windowview-multi-column-groupby	2022-02-26 00:50:49 +00:00
kssenii	2ba9010a34	Fix	2022-02-25 17:53:19 +01:00
kssenii	6c8401bfbd	Fix	2022-02-25 16:35:37 +01:00
Lars Eidnes	2629614dfe	Allow LowCardinality strings for ngrambf_v1/tokenbf_v1 indexes. Fixes #21865	2022-02-25 15:36:36 +01:00
Anton Popov	fcdebea925	Merge remote-tracking branch 'upstream/master' into HEAD	2022-02-25 13:41:30 +03:00
kssenii	cfad79bf74	Remove redundant	2022-02-25 09:37:48 +01:00
Sergei Trifonov	2d25c79e37	analyze select queries from `system.zookeeper` table with `LIKE pattern` and fetch nodes using prefix recursively	2022-02-24 13:40:47 +03:00
Vxider	06469eb793	remove blank row	2022-02-24 14:16:24 +08:00
Vxider	43475f79bf	windowview_multi_column_groupby	2022-02-24 14:06:37 +08:00
kssenii	003b807b00	Fix	2022-02-23 20:33:05 +01:00
tavplubix	43626b3ffd	Update src/Storages/FileLog/StorageFileLog.cpp Co-authored-by: Kseniia Sumarokova <54203879+kssenii@users.noreply.github.com>	2022-02-23 21:07:37 +03:00
Alexander Tokmakov	5a26f856d9	remove trash that shouldn't have been merged	2022-02-22 23:41:33 +03:00
Dmitry Novik	2fd4baaa64	Merge pull request #34387 from nvartolomei/nv/move-part-settings-cleanup Remove useless setting experimental_query_deduplication_send_all_part_uuids	2022-02-22 06:11:00 -08:00
Sergei Trifonov	7fe3bef866	add test for unrestricted zk reads	2022-02-22 16:51:30 +03:00
kssenii	c637385dd0	Merge master	2022-02-22 13:17:51 +01:00
Kseniia Sumarokova	eeea322556	Merge pull request #34629 from amosbird/remotefsimprove Some refactoring and improvement over async and remote buffer related stuff	2022-02-22 11:36:40 +01:00
Dmitry Novik	1df43a7f57	Merge pull request #34385 from nvartolomei/nv/move-part-count Disable optimize_trivial_count when deduplication for part movement feature is enabled	2022-02-21 08:53:09 -08:00
Anton Popov	065305ab65	Merge pull request #34764 from ucasfl/hints-index Add name hints for data skipping indices	2022-02-21 16:50:59 +03:00
Mikhail f. Shiryaev	5ac8cdbc69	Merge pull request #34786 from ClickHouse/make_drop_column_metadata_only Make drop of alias column metadata only	2022-02-21 14:11:55 +01:00
mergify[bot]	314ab73b11	Merge branch 'master' into nv/move-part-settings-cleanup	2022-02-21 10:18:44 +00:00
Dmitry Novik	4428e7aa1b	Merge branch 'master' into nv/move-part-count	2022-02-21 02:14:23 -08:00
alesapin	d7cae5ffb4	Fix build	2022-02-21 11:54:52 +03:00
alesapin	852757219f	Make drop of alias column metadata only	2022-02-21 11:46:16 +03:00
Vitaly Baranov	aee67a6693	Merge pull request #31484 from eungenue/Implement-SSL-X509-certificate-authentication Implement ssl x509 certificate authentication	2022-02-21 11:30:52 +03:00
Vitaly Baranov	0d377de5f0	Support syntax CREATE USER IDENTIFIED WITH ssl_certificate CN ...	2022-02-21 07:01:00 +03:00
Vitaly Baranov	7b97c986cb	Revert "Allow restrictive row policies without permissive"	2022-02-21 06:54:28 +03:00
feng lv	07280e0ab1	Add name hints for data skipping indices fix test	2022-02-20 11:48:22 +00:00
Vitaly Baranov	874b2c8dcb	Merge pull request #34596 from vitlibar/allow-restrictive-without-permissive Allow restrictive row policies without permissive	2022-02-19 21:45:28 +07:00
Azat Khuzhin	fef5f146e7	Fix ENOENT with fsync_part_directory and Vertical merge fsync of the temporary part directory is superfluous anyway, and besides that directory is not exists at that time, that will lead to ENOENT error: 2022.02.18 17:02:51.634565 [ 35639 ] {} <Error> void DB::MergeTreeBackgroundExecutor<DB::MergeMutateRuntimeQueue>::routine(DB::TaskRuntimeDataPtr) [Queue = DB::MergeMutateRuntimeQueue]: Code: 107. DB::ErrnoException: Cannot open file /var/lib/clickhouse/data/system/text_log/tmp_merge_202202_1864_3192_14/, errno: 2, strerror: No such file or directory. (FILE_DOESNT_EXIST), Stack trace (when copying this message, always include the lines below): 0. DB::Exception::Exception() @ 0xb26ecfa in /usr/lib/debug/.build-id/01/8c328bd4858d67.debug 1. DB::throwFromErrnoWithPath() @ 0xb2700ea in /usr/lib/debug/.build-id/01/8c328bd4858d67.debug 2. DB::LocalDirectorySyncGuard::LocalDirectorySyncGuard() @ 0x14905531 in /usr/lib/debug/.build-id/01/8c328bd4858d67.debug 3. DB::DiskLocal::getDirectorySyncGuard() const @ 0x148af3e3 in /usr/lib/debug/.build-id/01/8c328bd4858d67.debug 4. DB::MergeTask::ExecuteAndFinalizeHorizontalPart::prepare() @ 0x157bef13 in /usr/lib/debug/.build-id/01/8c328bd4858d67.debug Note, that IMergeTreeDataPart::renameTo() anyway will have fsync for the directory. Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-02-19 07:50:59 +03:00
Nikolai Kochetov	e4d5db6161	Merge pull request #34717 from azat/merge-mutate-memory-tracker Fix possible memory_tracker use-after-free (for async s3 writes) for merges/mutations	2022-02-18 19:28:43 +01:00
Vladimir C	9b7d011ee7	Merge pull request #34529 from vdimir/join-nullable-on-pipeline Apply join_use_nulls on types before join	2022-02-18 18:34:44 +01:00
Azat Khuzhin	65e9b4879d	Fix possible memory_tracker use-after-free for merges/mutations There are two possible cases for execution merges/mutations: 1) from background thread 2) from OPTIMIZE TABLE query 1) is pretty simple, it's memory tracking structure is as follow: current_thread::memory_tracker = level=Thread / description="(for thread)" == background_thread_memory_tracker = level=Thread / description="(for thread)" current_thread::memory_tracker.parent = level=Global / description="(total)" So as you can see it is pretty simple and MemoryTrackerThreadSwitcher does not do anything icky for this case. 2) is complex, it's memory tracking structure is as follow: current_thread::memory_tracker = level=Thread / description="(for thread)" current_thread::memory_tracker.parent = level=Process / description="(for query)" == background_thread_memory_tracker = level=Process / description="(for query)" Before this patch to track memory (and related things, like sampling, profiling and so on) for OPTIMIZE TABLE query dirty hacks was done to do this, since current_thread memory_tracker was of Thread scope, that does not have any limits. And so if will change parent for it to Merge/Mutate memory tracker (which also does not have some of settings) it will not be correctly tracked. To address this Merge/Mutate was set as parent not to the current_thread memory_tracker but to it's parent, since it's scope is Process with all settings. But that parent's memory_tracker is the memory_tracker of the thread_group, and so if you will have nested ThreadPool inside merge/mutate (this is the case for s3 async writes, which has been added in #33291) you may get use-after-free of memory_tracker. Consider the following example: MemoryTrackerThreadSwitcher() thread_group.memory_tracker.parent = merge_list_entry->memory_tracker (see also background_thread_memory_tracker above) CurrentThread::attachTo() current_thread.memory_tracker.parent = thread_group.memory_tracker CurrentThread::detachQuery() current_thread.memory_tracker.parent = thread_group.memory_tracker.parent # and this is equal to merge_list_entry->memory_tracker ~MemoryTrackerThreadSwitcher() thread_group.memory_tracker = thread_group.memory_tracker.parent So after the following we will get incorrect memory_tracker (from the mege_list_entry) when the next job in that ThreadPool will not have thread_group, since in this case it will not try to update the current_thread.memory_tracker.parent and use-after-free will happens. So to address the (2) issue, settings from the parent memory_tracker should be copied to the merge_list_entry->memory_tracker, to avoid playing with parent memory tracker. Note, that settings from the query (OPTIMIZE TABLE) is not available at that time, so it cannot be used (instead of parent's memory tracker settings). v2: remove memory_tracker.setOrRaiseHardLimit() from settings Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-02-18 16:23:54 +03:00
Amos Bird	f459e8fc95	Less getMark calls	2022-02-18 19:55:19 +08:00
Anton Popov	0a7895ebb9	add comments and small refactoring	2022-02-17 22:00:25 +03:00
zvonand	90c857c5e3	merge	2022-02-17 18:23:37 +03:00
tavplubix	0f5ee19d0b	Merge pull request #34633 from zhangjmruc/master For ReplatedMergeTree, early break for multiple leaders case when log has been updated by the other leader	2022-02-17 14:01:50 +03:00
Kruglov Pavel	6dcb766879	Merge pull request #34465 from Avogar/fix-url-globs Improve schema inference with globs in FIle/S3/HDFS/URL engines	2022-02-17 13:33:27 +03:00
Vitaly Baranov	2de6e8e575	Change type of RowPolicyKind: bool -> enum.	2022-02-17 14:18:10 +07:00
Amos Bird	d3bd8b5f93	Cosmetic fix	2022-02-17 14:31:22 +08:00
Amos Bird	ba19c7cf44	Slightly better interface of compressed buffer	2022-02-17 14:31:22 +08:00
Jianmei Zhang	ef0c3b99ff	Merge remote-tracking branch 'upstream/master'	2022-02-17 14:02:27 +08:00
Sergei Trifonov	b6bb479c48	add setting to enable unrestricted reads from zookeeper	2022-02-16 23:03:44 +03:00
Sergei Trifonov	f342c497ef	fix style	2022-02-16 20:22:03 +03:00
Azat Khuzhin	774744a86d	Fix allow_experimental_projection_optimization with enable_global_with_statement allow_experimental_projection_optimization requires one more InterpreterSelectQuery, which with enable_global_with_statement will apply ApplyWithAliasVisitor if the query is not subquery. But this should not be done for queries from MergeTreeData::getQueryProcessingStage()/getQueryProcessingStageWithAggregateProjections() since this will duplicate WITH statements over and over. This will also fix scalar.xml perf tests, that leads to the following error now: scalar.query0.prewarm0: DB::Exception: Stack size too large. And since it has very long query in the log, this leads to the following perf test error: _csv.Error: field larger than field limit (131072) Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2022-02-16 19:14:47 +03:00
kssenii	47f94120da	Merge master	2022-02-16 14:43:28 +01:00
kssenii	3bd3e51aa0	Fix tests	2022-02-16 14:08:41 +01:00
Mikhail f. Shiryaev	4f84406136	Merge pull request #34641 from ClickHouse/version-and-release refactor version_helper, create release script	2022-02-16 14:00:55 +01:00
Maksim Kita	d6e88f56cd	Merge pull request #34623 from CurtizJ/minor-subcolumns-fix Fix quadratic complexity while adding subcolumns	2022-02-16 12:38:00 +01:00
Mikhail f. Shiryaev	c5db40f679	Deprecate sh script for StorageSystemContributors, update generated file	2022-02-16 12:16:43 +01:00
Nikolai Kochetov	f9d2dae88e	Merge pull request #34424 from yakov-olkhovskiy/ephemeral-column Ephemeral column issue #9436	2022-02-16 12:04:57 +01:00
Kruglov Pavel	dd863ca2a0	Merge branch 'master' into fix-url-globs	2022-02-16 12:45:31 +03:00
Jianmei Zhang	25c761b3b6	Early break for multiple leaders case when log updated by other leader	2022-02-16 16:06:41 +08:00
Anton Popov	e4fddaa03a	fix quadratic complexity while adding subcolumns	2022-02-16 02:42:50 +03:00
Anton Popov	a661eaf39f	better performance of getting storage snapshot	2022-02-16 02:17:22 +03:00
alesapin	bc2d0ee7c7	Merge pull request #34215 from ClickHouse/revert-34211-revert-34153-add_func_tests_over_s3 Add func tests run with s3 and fix several bugs	2022-02-15 19:07:11 +03:00
Sergei Trifonov	a507f83d8d	allow unrestricted reads from zookeeper	2022-02-15 17:12:37 +03:00
Nikolai Kochetov	ab288642f6	Merge branch 'master' into ephemeral-column	2022-02-15 10:03:34 +00:00
Nikolai Kochetov	d6cbac1ed3	Merge pull request #34577 from ClickHouse/alwasy-remove-unused-actions-for-add-missing-defaults Always remove unused actions from addMissingDefaults	2022-02-15 11:01:29 +01:00
alesapin	447cd56cb9	Fix comments	2022-02-15 12:11:50 +03:00
李扬	f52b67b939	Merge branch 'master' into rocksdb_metacache	2022-02-15 02:16:29 -06:00
alesapin	e15396d90c	Fix race condition:	2022-02-14 22:19:49 +03:00
Nikolai Kochetov	b3ea360cd2	Fix a little bit more	2022-02-14 19:05:30 +00:00
Kseniia Sumarokova	382b8e0388	Merge pull request #34432 from ClickHouse/static-files-disk-uploader-create-symlinks `static-files-disk-uploader`: add a mode to create symlinks	2022-02-14 18:10:53 +01:00

... 2 3 4 5 6 ...

7385 Commits