ClickHouse

mirror of https://github.com/ClickHouse/ClickHouse.git synced 2024-11-28 10:31:57 +00:00

Author	SHA1	Message	Date
Robert Schulze	7f611681df	Add a similar sanity check as in other skipping indexes	2024-08-12 15:26:01 +00:00
Robert Schulze	f944ef25bb	Better handling of errors during add, search, and save	2024-08-12 15:25:58 +00:00
Robert Schulze	e7c2bf49c3	Add detach/attach test	2024-08-12 15:25:55 +00:00
Robert Schulze	40bed3e20f	Remove support for WHERE-type queries These kind of vector search similarity queries are rather obscure and rare in practice. They require the user to specify a maximum distance which is not intuitive to obtain. Furthermore, these queries are not natively supported in USearch, so the vector search index had to emulate these queries. Therefore simplifying the code base and restricting vector search to ORDER-BY queries only.	2024-08-12 15:25:52 +00:00
Robert Schulze	abb8e61981	Remove support code for Lp norm in vector search It is a generalization of other norms, too expensive to calculate and not relevant in practice. Also, Usearch doesn't support it.	2024-08-12 15:25:48 +00:00
Robert Schulze	65186f0b69	Remove tuple support Indexes for approximate nearest neighbourhood (ANN) search (USearch) can be build on columns of type Array(Float32) or Tuple(Float32[, Float32[, ...]]). In practice, Arrays(Float32) is the only relevant data type. Arrays store high-dimensional embeddings consecutively (--> cache locality) and the additional flexibility of different data types in a tuple is not needed for vector search. Therefore removing support for ANN indexes over tuple columns to simplify the code, tests and docs.	2024-08-12 15:25:39 +00:00
Robert Schulze	218421c255	Remove Annoy indexes Annoy indexes fell out of favor in the community, at least when it comes to vector databases. Such indexes work okay-ish low dimensions but they suffers badly from a curse of dimensionality which makes them inapt for a high number of dimensions. Now that Annoy is gone, issue () also disappears and we can drop 'no-ubsan', 'no-cpu-aarch64', and 'no-asan' from tests. () spotify/annoy#456	2024-08-12 15:24:49 +00:00
Robert Schulze	7c41939921	Fix test results (no analyzer support yet ...)	2024-08-12 15:24:22 +00:00
Robert Schulze	d7211f9d12	Fix CMake integration of usearch and annoy Registers usearch and annoy properly via configure_config.cmake and config.h.in like all other 3rd party libs, instead of (mis)using target_compile_definitions.	2024-08-12 15:24:18 +00:00
Robert Schulze	a39b9cf643	Un-screw usearch's build description No directory 'SimSIMD-map' exists, the build only worked because SimSIMD support in usearch was (accidentally?) disabled. This commit corrects the build description. SimSIMD support in usearch will be enabled by a later commit.	2024-08-12 15:24:14 +00:00
divanik	eb3ffb7184	Add supportsReplication	2024-08-12 15:09:16 +00:00
Robert Schulze	85f63b056b	Merge pull request #68135 from ClickHouse/refactor-field-get Only use Field::safeGet - Field::get prone to type punning	2024-08-12 14:25:11 +00:00
Pablo Marcos	da5b9582a9	Fix indent	2024-08-12 13:54:17 +00:00
János Benjamin Antal	6cde029ed9	Fix style	2024-08-12 13:48:44 +00:00
Pablo Marcos	f7c6eabb49	Small fix to filter by current_database in system.query_log	2024-08-12 13:44:05 +00:00
János Benjamin Antal	34643ee16c	Run test only from modified files	2024-08-12 13:30:25 +00:00
Robert Schulze	037a1006fd	Merge remote-tracking branch 'ClickHouse/master' into ci-fuzzer-enable	2024-08-12 12:28:32 +00:00
Nikita Taranov	2f546fb513	Merge pull request #68098 from aiven-sal/aiven-sal/segfault Fix UB in hopEnd, hopStart, tumbleEnd, and tumbleStart	2024-08-12 12:09:23 +00:00
János Benjamin Antal	737948470d	Fix syntax	2024-08-12 11:54:34 +00:00
Sema Checherinda	5e836bc20e	Merge pull request #67472 from ClickHouse/chesema-02765 speed up system flush logs	2024-08-12 11:51:55 +00:00
Antonio Andelic	e9d16bc054	Use new mc restart	2024-08-12 13:39:53 +02:00
vdimir	52f37f2ec6	Merge pull request #67980 from ClickHouse/vdimir/fix_03130_convert_outer_join_to_inner_join Fix 03130_convert_outer_join_to_inner_join	2024-08-12 11:34:10 +00:00
Pablo Marcos	06ceaee502	Fix test 01903_correct_block_size_prediction_with_default - Don't allow random settings that affect the memory usage - Run two queries and compare the memory usage, rather than having an arbitrary hardcoded value	2024-08-12 11:11:10 +00:00
János Benjamin Antal	8f124710ef	Remove Log engine from Kafka integration tests It doesn't work well when `thread_per_consumer` is used as writer can make readers starve when `shared_time_mutex` prefers writes over reads.	2024-08-12 10:27:21 +00:00
Yarik Briukhovetskyi	8bc89ac8df	Merge branch 'master' into hive_style_partitioning	2024-08-12 11:44:45 +02:00
Robert Schulze	ee433684dd	Follow-up to ClickHouse#63898, pt. III	2024-08-12 08:15:03 +00:00
Robert Schulze	0aa30b10d5	Merge pull request #68069 from rschu1ze/cmake-cleanup Minor CMake cleanup	2024-08-12 06:43:00 +00:00
Robert Schulze	574c445be9	Refactor tests for (experimental) statistics	2024-08-12 05:56:16 +00:00
Alexey Milovidov	6016dc96aa	Fix test `01172_transaction_counters`	2024-08-12 07:19:54 +02:00
Alexey Milovidov	6f189e9eb7	Fix leftovers	2024-08-12 07:17:17 +02:00
Alexey Milovidov	b92a8f0fbc	Fix leftovers	2024-08-12 07:16:25 +02:00
Alexey Milovidov	b3504def35	Fix leftovers	2024-08-12 07:15:08 +02:00
Alexey Milovidov	2a12604cf5	Merge pull request #66494 from azat/gdb-image Update gdb to 15.1 (by compiling from sources)	2024-08-12 05:04:57 +00:00
Alexey Milovidov	94fe53de64	Merge branch 'master' into vdimir/fix_03130_convert_outer_join_to_inner_join	2024-08-12 07:03:34 +02:00
Alexey Milovidov	5acf9f6f81	Fix `test_cluster_all_replicas`	2024-08-12 07:01:52 +02:00
Alexey Milovidov	0b1887eb65	Merge pull request #68138 from jsc0218/Fix01710 Fix01710 Timeout	2024-08-12 04:07:52 +00:00
sakulali	957a0b6ea4	Add a setting query_cache_tag	2024-08-12 08:40:01 +08:00
Alexey Milovidov	c1c8e6dd8d	Merge pull request #68099 from GrahamCampbell/patch-2 Do not apply redundant sorting removal when there's an offset	2024-08-11 23:59:15 +00:00
Alexey Milovidov	d18a68f285	Merge pull request #68160 from azat/tests/02122_join_group_by_timeout tests: fix 02122_join_group_by_timeout flakiness	2024-08-12 02:20:51 +02:00
Alexey Milovidov	c462f4639b	Merge pull request #68161 from narqo/patch-1 Fix typos in Prometheus protocol docs	2024-08-11 23:55:53 +00:00
Alexey Milovidov	d2a9eaaa01	Merge pull request #68157 from azat/local-fix-log Remove "Processing configuration file" message from clickhouse-local	2024-08-11 23:07:43 +00:00
Yakov Olkhovskiy	5c8665c660	fix system.kafka_consumers and doc, fix tidy	2024-08-11 20:40:55 +00:00
Azat Khuzhin	8a48b33344	Fix settings/current_database in system.processes for async BACKUP/RESTORE Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2024-08-11 21:27:08 +02:00
Alexey Milovidov	4de79653ea	Merge pull request #68134 from azat/tests/01246_buffer_flush tests: fix 01246_buffer_flush flakiness due to slow trace_log flush	2024-08-11 17:56:05 +00:00
Azat Khuzhin	e384e2c38e	tests: fix 02122_join_group_by_timeout flakiness CI found [1] failure of the test: 2024-08-11 21:06:07 /usr/share/clickhouse-test/queries/0_stateless/02122_join_group_by_timeout.sh: line 51: 52614 Killed timeout -s KILL $MAX_PROCESS_WAIT $CLICKHOUSE_CLIENT -q "SELECT a.name as n And the problem is not the server, but the client, since query executed for ~1 second: 2024.08.11 21:06:02.284318 [ 49232 ] {ba989ee2-f615-49ca-bcd8-31b3916aeb2c} <Debug> executeQuery: (from [::1]:54144) (comment: 02122_join_group_by_timeout.sh) SELECT a.name as n FROM ( SELECT 'Name' as name, number FROM system.numbers LIMIT 2000000 ) AS a, ( SELECT 'Name' as name2, number FROM system.numbers LIMIT 2000000 ) as b FORMAT Null SETTINGS max_execution_time = 1, timeout_overflow_mode = 'break' (stage: Complete) 2024.08.11 21:06:03.331249 [ 49232 ] {ba989ee2-f615-49ca-bcd8-31b3916aeb2c} <Debug> executeQuery: Read 517104 rows, 3.95 MiB in 1.072023 sec., 482362.78512681165 rows/sec., 3.68 MiB/sec. [1]: https://s3.amazonaws.com/clickhouse-test-reports/67134/18da3f0ab63da1eef9396627d0dfd56cf5356f65/stateless_tests__msan__[1_4].html So instead of using timeout, let's use time from the system.query_log instead. Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>	2024-08-11 18:39:09 +02:00
Vladimir Varankin	d314e5aa45	typos in prometheus.md	2024-08-11 18:37:29 +02:00
Yakov Olkhovskiy	8e706265e6	fix	2024-08-11 16:29:35 +00:00
Igor Nikonov	4ef3fe416d	Fix and simplify test	2024-08-11 13:08:53 +00:00
Yakov Olkhovskiy	4fec61da55	fix wrong datatype in system.kafka_consumers	2024-08-11 12:35:27 +00:00
Igor Nikonov	fbf4baf47e	Merge remote-tracking branch 'origin/master' into patch-2	2024-08-11 11:52:42 +00:00

... 3 4 5 6 7 ...

150686 Commits