ClickHouse

mirror of https://github.com/ClickHouse/ClickHouse.git synced 2024-12-15 10:52:30 +00:00

Author	SHA1	Message	Date
Nikolai Kochetov	ec1e2436cc	Merge pull request #45450 from ClickHouse/fix-disabled-two-level-agg Fix disabled two-level aggregation from HTTP	2023-01-21 12:01:59 +01:00
Sema Checherinda	962894afc8	Merge pull request #44909 from CheSema/intersect-prev-part Do not merge over a gap with outdated undeleted parts	2023-01-21 11:51:21 +01:00
Maksim Kita	47385a19e7	Remove unnecessary getTotalRowCount function calls	2023-01-21 11:27:07 +01:00
Azat Khuzhin	a64f6b5f3e	Fix possible (likely distributed) query hung Recently I saw the following, the client executed long distributed query and terminated the connection, and in this case query cancellation will be done from PullingAsyncPipelineExecutor dtor, but during cancellation one of nodes sent ECONNRESET, and this leads to an exception from PullingAsyncPipelineExecutor::cancel(), and this leads to a deadlock when multiple threads waits each others, because cancel() for LazyOutputFormat wasn't called. Here is as relevant portion of logs: 2023.01.04 08:26:09.236208 [ 37968 ] {f2ed6149-146d-4a3d-874a-b0b751c7b567} <Debug> executeQuery: (from 10.61.13.253:44266, user: default) TooLongDistributedQueryToPost ... 2023.01.04 08:26:09.262424 [ 37968 ] {f2ed6149-146d-4a3d-874a-b0b751c7b567} <Trace> MergeTreeInOrderSelectProcessor: Reading 1 ranges in order from part 9_330_538_18, approx. 61440 rows starting from 0 2023.01.04 08:26:09.266399 [ 26788 ] {f2ed6149-146d-4a3d-874a-b0b751c7b567} <Trace> Connection (s4.ch:9000): Connecting. Database: (not specified). User: default 2023.01.04 08:26:09.266849 [ 26788 ] {f2ed6149-146d-4a3d-874a-b0b751c7b567} <Trace> Connection (s4.ch:9000): Connected to ClickHouse server version 22.10.1. 2023.01.04 08:26:09.267165 [ 26788 ] {f2ed6149-146d-4a3d-874a-b0b751c7b567} <Debug> Connection (s4.ch:9000): Sent data for 2 scalars, total 2 rows in 3.1587e-05 sec., 62635 rows/sec., 68.00 B (2.03 MiB/sec.), compressed 0.4594594594594595 times to 148.00 B (4.41 MiB/sec.) 2023.01.04 08:39:13.047170 [ 37968 ] {f2ed6149-146d-4a3d-874a-b0b751c7b567} <Error> PullingAsyncPipelineExecutor: Code: 210. DB::NetException: Connection reset by peer, while writing to socket (10.7.142.115:9000). (NETWORK_ERROR), Stack trace (when copying this message, always include the lines below): 0. ./.build/./contrib/libcxx/include/exception:133: Poco::Exception::Exception(std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> > const&, int) @ 0x1818234c in /usr/lib/debug/usr/bin/clickhouse.debug 1. ./.build/./src/Common/Exception.cpp:69: DB::Exception::Exception(std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> > const&, int, bool) @ 0x1004fbda in /usr/lib/debug/usr/bin/clickhouse.debug 2. ./.build/./src/Common/NetException.h:12: DB::WriteBufferFromPocoSocket::nextImpl() @ 0x14e352f3 in /usr/lib/debug/usr/bin/clickhouse.debug 3. ./.build/./src/IO/BufferBase.h:39: DB::Connection::sendCancel() @ 0x15c21e6b in /usr/lib/debug/usr/bin/clickhouse.debug 4. ./.build/./src/Client/MultiplexedConnections.cpp:0: DB::MultiplexedConnections::sendCancel() @ 0x15c4d5b7 in /usr/lib/debug/usr/bin/clickhouse.debug 5. ./.build/./src/QueryPipeline/RemoteQueryExecutor.cpp:627: DB::RemoteQueryExecutor::tryCancel(char const, std::__1::unique_ptr<DB::RemoteQueryExecutorReadContext, std::__1::default_delete<DB::RemoteQueryExecutorReadContext> >) @ 0x14446c09 in /usr/lib/debug/usr/bin/clickhouse.debug 6. ./.build/./contrib/libcxx/include/__iterator/wrap_iter.h💯 DB::ExecutingGraph::cancel() @ 0x15d2c0de in /usr/lib/debug/usr/bin/clickhouse.debug 7. ./.build/./contrib/libcxx/include/__memory/unique_ptr.h:300: DB::PullingAsyncPipelineExecutor::cancel() @ 0x15d32055 in /usr/lib/debug/usr/bin/clickhouse.debug 8. ./.build/./contrib/libcxx/include/__memory/unique_ptr.h:312: DB::PullingAsyncPipelineExecutor::~PullingAsyncPipelineExecutor() @ 0x15d31f4f in /usr/lib/debug/usr/bin/clickhouse.debug 9. ./.build/./src/Server/TCPHandler.cpp:0: DB::TCPHandler::processOrdinaryQueryWithProcessors() @ 0x15cde919 in /usr/lib/debug/usr/bin/clickhouse.debug 10. ./.build/./src/Server/TCPHandler.cpp:0: DB::TCPHandler::runImpl() @ 0x15cd8554 in /usr/lib/debug/usr/bin/clickhouse.debug 11. ./.build/./src/Server/TCPHandler.cpp:1904: DB::TCPHandler::run() @ 0x15ce6479 in /usr/lib/debug/usr/bin/clickhouse.debug 12. ./.build/./contrib/poco/Net/src/TCPServerConnection.cpp:57: Poco::Net::TCPServerConnection::start() @ 0x18074f07 in /usr/lib/debug/usr/bin/clickhouse.debug 13. ./.build/./contrib/libcxx/include/__memory/unique_ptr.h:54: Poco::Net::TCPServerDispatcher::run() @ 0x180753ed in /usr/lib/debug/usr/bin/clickhouse.debug 14. ./.build/./contrib/poco/Foundation/src/ThreadPool.cpp:213: Poco::PooledThread::run() @ 0x181e3807 in /usr/lib/debug/usr/bin/clickhouse.debug 15. ./.build/./contrib/poco/Foundation/include/Poco/SharedPtr.h:156: Poco::ThreadImpl::runnableEntry(void) @ 0x181e1483 in /usr/lib/debug/usr/bin/clickhouse.debug 16. ? @ 0x7ffff7e55fd4 in ? 17. ? @ 0x7ffff7ed666c in ? (version 22.10.1.1) And here is the state of the threads: <details> <summary>system.stack_trace</summary> ```sql SELECT arrayStringConcat(arrayMap(x -> demangle(addressToSymbol(x)), trace), '\n') AS sym FROM system.stack_trace WHERE query_id = 'f2ed6149-146d-4a3d-874a-b0b751c7b567' SETTINGS allow_introspection_functions=1 Row 1: ────── sym: pthread_cond_wait std::__1::condition_variable::wait(std::__1::unique_lock<std::__1::mutex>&) bool ConcurrentBoundedQueue<DB::Chunk>::emplaceImpl<DB::Chunk>(std::__1::optional<unsigned long>, DB::Chunk&&) DB::IOutputFormat::work() DB::ExecutionThreadContext::executeTask() DB::PipelineExecutor::executeStepImpl(unsigned long, std::__1::atomic<bool>) Row 2: ────── sym: pthread_cond_wait Poco::EventImpl::waitImpl() DB::PipelineExecutor::joinThreads() DB::PipelineExecutor::executeImpl(unsigned long) DB::PipelineExecutor::execute(unsigned long) Row 3: ────── sym: pthread_cond_wait Poco::EventImpl::waitImpl() DB::PullingAsyncPipelineExecutor::Data::~Data() DB::PullingAsyncPipelineExecutor::~PullingAsyncPipelineExecutor() DB::TCPHandler::processOrdinaryQueryWithProcessors() DB::TCPHandler::runImpl() DB::TCPHandler::run() Poco::Net::TCPServerConnection::start() Poco::Net::TCPServerDispatcher::run() Poco::PooledThread::run() Poco::ThreadImpl::runnableEntry(void*) ``` </details> Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2023-01-21 08:05:56 +01:00
Azat Khuzhin	e2fcf0f072	Catch exception on query cancellation Since we still want to join the thread, yes it will be done in dtor, but this looks better. Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2023-01-21 08:05:56 +01:00
Azat Khuzhin	0566f72d36	Cleanup PullingAsyncPipelineExecutor::cancel() Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2023-01-21 08:05:56 +01:00
avogar	eed1db7e07	Fix schema inference in hdfsCluster	2023-01-20 21:17:35 +00:00
Anton Popov	41a199e175	Fix crash when `ListObjects` request fails (#45371 )	2023-01-20 20:10:23 +01:00
Nikolai Kochetov	dcd84c152a	Fix possible deadlock with allow_asynchronous_read_from_io_pool_for_merge_tree in case of exception from ThreadPool::schedule	2023-01-20 18:57:47 +00:00
Han Fei	4bbe90f6b4	Merge pull request #45473 from hanfei1991/hanfei/async-inserts-doc update docs for async insert deduplication	2023-01-20 18:25:19 +01:00
Han Fei	18a397f8c9	address comments	2023-01-20 18:09:42 +01:00
Han Fei	449ace3373	Update docs/en/operations/settings/settings.md Co-authored-by: Dan Roscigno <dan@roscigno.com>	2023-01-20 18:07:19 +01:00
Han Fei	9d87bd10ee	Update docs/en/operations/settings/settings.md Co-authored-by: Dan Roscigno <dan@roscigno.com>	2023-01-20 18:07:08 +01:00
Han Fei	badfbcb3d8	Update docs/en/operations/settings/settings.md Co-authored-by: Dan Roscigno <dan@roscigno.com>	2023-01-20 18:06:58 +01:00
Han Fei	e9c4cf46cd	Update docs/en/operations/settings/merge-tree-settings.md Co-authored-by: Dan Roscigno <dan@roscigno.com>	2023-01-20 18:06:46 +01:00
Han Fei	9d254f7d87	Update docs/en/operations/settings/merge-tree-settings.md Co-authored-by: Dan Roscigno <dan@roscigno.com>	2023-01-20 18:06:32 +01:00
Nikolay Degterinsky	c2ac4fece0	Add more retries to AST Fuzzer	2023-01-20 16:46:24 +00:00
robot-ch-test-poll4	2066581d8f	Merge pull request #45451 from evillique/default_granularity Add default GRANULARITY argument for secondary indexes	2023-01-20 17:46:21 +01:00
avogar	86336940f8	Better comment	2023-01-20 16:41:59 +00:00
avogar	4432ee9927	Fix aborts in arrow lib	2023-01-20 16:40:33 +00:00
vdimir	e30ab0874b	Review fixes	2023-01-20 16:30:34 +00:00
Alexander Tokmakov	910d6dc0ce	Merge pull request #45342 from ClickHouse/exception_message_patterns Save message format strings for DB::Exception	2023-01-20 18:46:52 +03:00
Nikita Mikhaylov	9ab454366c	Better test	2023-01-20 16:40:01 +01:00
Kseniia Sumarokova	01320da02b	Update BoundedReadBuffer.cpp	2023-01-20 16:25:02 +01:00
ltrk2	810c9ba50c	Produce a null map of the correct size	2023-01-20 10:24:42 -05:00
ltrk2	9d798ea1bc	Document functions	2023-01-20 10:24:42 -05:00
ltrk2	65b9c69c90	Introduce non-throwing variants of hasToken	2023-01-20 10:24:42 -05:00
Nikolay Degterinsky	02142596fb	Add docs	2023-01-20 15:22:13 +00:00
avogar	550a703fbc	Make a bit better	2023-01-20 14:58:39 +00:00
Antonio Andelic	136e4ec1b3	Merge pull request #45273 from azat/fix-test-log-level Fix log level "Test" for send_logs_level in client	2023-01-20 15:36:05 +01:00
Alexander Tokmakov	ec5d7d0a3a	Update src/Functions/FunctionsConversion.h Co-authored-by: Alexander Gololobov <440544+davenger@users.noreply.github.com>	2023-01-20 17:33:01 +03:00
Kruglov Pavel	28ddcc2432	Merge branch 'master' into tsv-csv-detect-header	2023-01-20 15:08:38 +01:00
Robert Schulze	b55c8ddd65	Merge pull request #45472 from ClickHouse/rs-duplicate-writing-guide Don't duplicate writing guide, instead point to existing writing guide	2023-01-20 15:05:13 +01:00
Sema Checherinda	b76b612d23	fix typo	2023-01-20 14:55:58 +01:00
Nikolai Kochetov	039901b395	Fixing build	2023-01-20 13:49:50 +00:00
Han Fei	5fc4998f10	update docs for async insert deduplication	2023-01-20 14:42:11 +01:00
Robert Schulze	4ac17d71fa	Merge pull request #45470 from ClickHouse/rs-doc-typos Fix typos	2023-01-20 14:39:27 +01:00
Robert Schulze	3b182e0fec	Don't duplicate writing guide, instead point to existing writing guide	2023-01-20 13:36:31 +00:00
Robert Schulze	3f2e4c8217	Fix typos	2023-01-20 13:20:25 +00:00
Robert Schulze	687f9c35a7	Merge pull request #45469 from ClickHouse/inv-idx-docs Docs for inverted index	2023-01-20 14:17:58 +01:00
Robert Schulze	1a966a9590	Fix bad comparison	2023-01-20 13:05:06 +00:00
Mikhail f. Shiryaev	d4f60bc9da	Fix the case when merge-base does not show the oldest commit	2023-01-20 13:55:21 +01:00
Sema Checherinda	02f22f04e8	fix typos	2023-01-20 13:35:23 +01:00
kssenii	8d20af8127	Fix	2023-01-20 13:34:23 +01:00
Robert Schulze	7e6d3163b1	Initial inverted index docs	2023-01-20 12:12:20 +00:00
Azat Khuzhin	bdeb5514c5	Fix ASan builds for glibc 2.36+ (use RTLD_NEXT for ThreadFuzzer interceptors) Recently I noticed that clickhouse compiled with ASan does not work with newer glibc 2.36+, before I though that this was only about compiling with old but using new, however that was not correct, ASan simply does not work with glibc 2.36+. Here is a simple reproducer [1]: $ cat > test-asan.cpp <<EOL #include <pthread.h> int main() { // something broken in ASan in interceptor for __pthread_mutex_lock // and only since glibc 2.36, and for pthread_mutex_lock everything is OK pthread_mutex_t mutex = PTHREAD_MUTEX_INITIALIZER; return __pthread_mutex_lock(&mutex); } EOL $ clang -g3 -o test-asan test-asan.cpp -fsanitize=address $ ./test-asan AddressSanitizer:DEADLYSIGNAL ================================================================= ==15659==ERROR: AddressSanitizer: SEGV on unknown address 0x000000000000 (pc 0x000000000000 bp 0x7fffffffccb0 sp 0x7fffffffcb98 T0) ==15659==Hint: pc points to the zero page. ==15659==The signal is caused by a READ memory access. ==15659==Hint: address points to the zero page. #0 0x0 (<unknown module>) #1 0x7ffff7cda28f (/usr/lib/libc.so.6+0x2328f) (BuildId: 1e94beb079e278ac4f2c8bce1f53091548ea1584) AddressSanitizer can not provide additional info. SUMMARY: AddressSanitizer: SEGV (<unknown module>) ==15659==ABORTING [1]: https://gist.github.com/azat/af073e57a248e04488b21068643f079e I've started observing glibc code, there was some changes in glibc, that moves pthread functions out from libpthread.so.0 into libc.so.6 (somewhere between 2.31 and 2.35), but the problem pops up only with 2.36, 2.35 works fine. After this I've looked into changes between 2.35 and 2.36, and found this patch [2] - "dlsym: Make RTLD_NEXT prefer default version definition [BZ #14932]", that fixes this bug [3]. [2]: https://sourceware.org/git/?p=glibc.git;a=commit;h=efa7936e4c91b1c260d03614bb26858fbb8a0204 [3]: https://sourceware.org/bugzilla/show_bug.cgi?id=14932 The problem with using DL_LOOKUP_RETURN_NEWEST flag for RTLD_NEXT is that it does not resolve hidden symbols (and __pthread_mutex_lock is indeed hidden). Here is a sample that will show the difference [4]: $ cat > test-dlsym.c <<EOL #define _GNU_SOURCE #include <dlfcn.h> #include <stdio.h> int main() { void *p = dlsym(RTLD_NEXT, "__pthread_mutex_lock"); printf("__pthread_mutex_lock: %p (via RTLD_NEXT)\n", p); return 0; } EOL # glibc 2.35: __pthread_mutex_lock: 0x7ffff7e27f70 (via RTLD_NEXT) # glibc 2.36: __pthread_mutex_lock: (nil) (via RTLD_NEXT) [4]: https://gist.github.com/azat/3b5f2ae6011bef2ae86392cea7789eb7 But ThreadFuzzer uses internal symbols to wrap pthread_mutex_lock/pthread_mutex_unlock, which are intercepted by ASan and this leads to NULL dereference. The fix was obvious - just use dlsym(RTLD_NEXT), however on older glibc's this leads to endless recursion (see commits in the code). But only for jemalloc [5], and even though sanitizers does not uses jemalloc the code of ThreadFuzzer is generic and I don't want to guard it with more preprocessors macros. [5]: https://gist.github.com/azat/588d9c72c1e70fc13ebe113197883aa2 So we have to use RTLD_NEXT only for ASan. There is also one more interesting issue, if you will compile with clang that itself had been compiled with newer libc (i.e. 2.36), you will get the following error: $ podman run --privileged -v $PWD/.cmake-asan/programs:/root/bin -e PATH=/bin:/root/bin -e --rm -it ubuntu-dev-v3 clickhouse ==1==ERROR: AddressSanitizer failed to allocate 0x0 (0) bytes of SetAlternateSignalStack (error code: 22) ... ==1==End of process memory map. AddressSanitizer: CHECK failed: sanitizer_common.cpp:53 "((0 && "unable to mmap")) != (0)" (0x0, 0x0) (tid=1) <empty stack> The problem is that since GLIBC_2.31, `SIGSTKSZ` is a call to `getconf(_SC_MINSIGSTKSZ)`, but older glibc does not have it, so `-1` will be returned and used as `SIGSTKSZ` instead. The workaround to disable alternative stack: $ podman run --privileged -v $PWD/.cmake-asan/programs:/root/bin -e PATH=/bin:/root/bin -e ASAN_OPTIONS=use_sigaltstack=0 --rm -it ubuntu-dev-v3 clickhouse client --version ClickHouse client version 22.13.1.1. Fixes: #43426 Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2023-01-20 13:09:13 +01:00
Robert Schulze	bfc3b4f5ca	Suffix "GinFilter" --> "Inverted"	2023-01-20 12:02:35 +00:00
Nikolai Kochetov	1e29993aef	Fixing build	2023-01-20 11:55:20 +00:00
Robert Schulze	0738b2499c	Use GinFilters typedef where possible	2023-01-20 11:52:04 +00:00
Maksim Kita	3e08a98f16	Merge pull request #45388 from azat/dict/remove-preallocate Remove PREALLOCATE for HASHED/SPARSE_HASHED dictionaries	2023-01-20 14:51:25 +03:00

... 3 4 5 6 7 ...

106269 Commits