Robert Schulze
27a6931a35
Cosmetics: variable naming
2024-08-12 15:29:59 +00:00
Robert Schulze
289c27c804
Introduce version for for index files in persistence
2024-08-12 15:29:02 +00:00
Robert Schulze
4ad624cb7e
Cosmetics
2024-08-12 15:28:58 +00:00
Robert Schulze
74de79e52b
Addd logging of basic statistics
2024-08-12 15:28:46 +00:00
Robert Schulze
8853b3359b
Remove useless templatization
...
Makes the code cleaner, compile faster, and the binary smaller.
2024-08-12 15:27:06 +00:00
Robert Schulze
4f23f7754b
Cosmetics
2024-08-12 15:26:05 +00:00
Robert Schulze
7f611681df
Add a similar sanity check as in other skipping indexes
2024-08-12 15:26:01 +00:00
Robert Schulze
f944ef25bb
Better handling of errors during add, search, and save
2024-08-12 15:25:58 +00:00
Robert Schulze
e7c2bf49c3
Add detach/attach test
2024-08-12 15:25:55 +00:00
Robert Schulze
40bed3e20f
Remove support for WHERE-type queries
...
These kind of vector search similarity queries are rather obscure and
rare in practice. They require the user to specify a maximum distance
which is not intuitive to obtain. Furthermore, these queries are not
natively supported in USearch, so the vector search index had to emulate
these queries.
Therefore simplifying the code base and restricting vector search to
ORDER-BY queries only.
2024-08-12 15:25:52 +00:00
Robert Schulze
abb8e61981
Remove support code for Lp norm in vector search
...
It is a generalization of other norms, too expensive to calculate and
not relevant in practice. Also, Usearch doesn't support it.
2024-08-12 15:25:48 +00:00
Robert Schulze
65186f0b69
Remove tuple support
...
Indexes for approximate nearest neighbourhood (ANN) search (USearch) can
be build on columns of type Array(Float32) or Tuple(Float32[, Float32[, ...]]).
In practice, Arrays(Float32) is the only relevant data type.
Arrays store high-dimensional embeddings consecutively (--> cache
locality) and the additional flexibility of different data types in a
tuple is not needed for vector search.
Therefore removing support for ANN indexes over tuple columns to
simplify the code, tests and docs.
2024-08-12 15:25:39 +00:00
Robert Schulze
218421c255
Remove Annoy indexes
...
Annoy indexes fell out of favor in the community, at least when it comes
to vector databases. Such indexes work okay-ish low dimensions but they
suffers badly from a curse of dimensionality which makes them inapt for
a high number of dimensions.
Now that Annoy is gone, issue (*) also disappears and we can drop
'no-ubsan', 'no-cpu-aarch64', and 'no-asan' from tests.
(*) spotify/annoy#456
2024-08-12 15:24:49 +00:00
Robert Schulze
7c41939921
Fix test results (no analyzer support yet ...)
2024-08-12 15:24:22 +00:00
Robert Schulze
d7211f9d12
Fix CMake integration of usearch and annoy
...
Registers usearch and annoy properly via configure_config.cmake and
config.h.in like all other 3rd party libs, instead of (mis)using
target_compile_definitions.
2024-08-12 15:24:18 +00:00
Robert Schulze
a39b9cf643
Un-screw usearch's build description
...
No directory 'SimSIMD-map' exists, the build only worked because SimSIMD
support in usearch was (accidentally?) disabled. This commit corrects
the build description. SimSIMD support in usearch will be enabled by a
later commit.
2024-08-12 15:24:14 +00:00
Robert Schulze
85f63b056b
Merge pull request #68135 from ClickHouse/refactor-field-get
...
Only use Field::safeGet - Field::get prone to type punning
2024-08-12 14:25:11 +00:00
Pablo Marcos
da5b9582a9
Fix indent
2024-08-12 13:54:17 +00:00
János Benjamin Antal
6cde029ed9
Fix style
2024-08-12 13:48:44 +00:00
Pablo Marcos
f7c6eabb49
Small fix to filter by current_database in system.query_log
2024-08-12 13:44:05 +00:00
János Benjamin Antal
34643ee16c
Run test only from modified files
2024-08-12 13:30:25 +00:00
Robert Schulze
037a1006fd
Merge remote-tracking branch 'ClickHouse/master' into ci-fuzzer-enable
2024-08-12 12:28:32 +00:00
Nikita Taranov
2f546fb513
Merge pull request #68098 from aiven-sal/aiven-sal/segfault
...
Fix UB in hopEnd, hopStart, tumbleEnd, and tumbleStart
2024-08-12 12:09:23 +00:00
János Benjamin Antal
737948470d
Fix syntax
2024-08-12 11:54:34 +00:00
Sema Checherinda
5e836bc20e
Merge pull request #67472 from ClickHouse/chesema-02765
...
speed up system flush logs
2024-08-12 11:51:55 +00:00
Antonio Andelic
e9d16bc054
Use new mc restart
2024-08-12 13:39:53 +02:00
vdimir
52f37f2ec6
Merge pull request #67980 from ClickHouse/vdimir/fix_03130_convert_outer_join_to_inner_join
...
Fix 03130_convert_outer_join_to_inner_join
2024-08-12 11:34:10 +00:00
Pablo Marcos
06ceaee502
Fix test 01903_correct_block_size_prediction_with_default
...
- Don't allow random settings that affect the memory usage
- Run two queries and compare the memory usage, rather than
having an arbitrary hardcoded value
2024-08-12 11:11:10 +00:00
János Benjamin Antal
8f124710ef
Remove Log engine from Kafka integration tests
...
It doesn't work well when `thread_per_consumer` is used as writer can make readers starve when `shared_time_mutex` prefers writes over reads.
2024-08-12 10:27:21 +00:00
Yarik Briukhovetskyi
8bc89ac8df
Merge branch 'master' into hive_style_partitioning
2024-08-12 11:44:45 +02:00
Robert Schulze
ee433684dd
Follow-up to ClickHouse#63898, pt. III
2024-08-12 08:15:03 +00:00
Robert Schulze
0aa30b10d5
Merge pull request #68069 from rschu1ze/cmake-cleanup
...
Minor CMake cleanup
2024-08-12 06:43:00 +00:00
Robert Schulze
574c445be9
Refactor tests for (experimental) statistics
2024-08-12 05:56:16 +00:00
Alexey Milovidov
6016dc96aa
Fix test 01172_transaction_counters
2024-08-12 07:19:54 +02:00
Alexey Milovidov
6f189e9eb7
Fix leftovers
2024-08-12 07:17:17 +02:00
Alexey Milovidov
b92a8f0fbc
Fix leftovers
2024-08-12 07:16:25 +02:00
Alexey Milovidov
b3504def35
Fix leftovers
2024-08-12 07:15:08 +02:00
Alexey Milovidov
2a12604cf5
Merge pull request #66494 from azat/gdb-image
...
Update gdb to 15.1 (by compiling from sources)
2024-08-12 05:04:57 +00:00
Alexey Milovidov
94fe53de64
Merge branch 'master' into vdimir/fix_03130_convert_outer_join_to_inner_join
2024-08-12 07:03:34 +02:00
Alexey Milovidov
5acf9f6f81
Fix test_cluster_all_replicas
2024-08-12 07:01:52 +02:00
Alexey Milovidov
0b1887eb65
Merge pull request #68138 from jsc0218/Fix01710
...
Fix01710 Timeout
2024-08-12 04:07:52 +00:00
sakulali
957a0b6ea4
Add a setting query_cache_tag
2024-08-12 08:40:01 +08:00
Alexey Milovidov
c1c8e6dd8d
Merge pull request #68099 from GrahamCampbell/patch-2
...
Do not apply redundant sorting removal when there's an offset
2024-08-11 23:59:15 +00:00
Alexey Milovidov
d18a68f285
Merge pull request #68160 from azat/tests/02122_join_group_by_timeout
...
tests: fix 02122_join_group_by_timeout flakiness
2024-08-12 02:20:51 +02:00
Alexey Milovidov
c462f4639b
Merge pull request #68161 from narqo/patch-1
...
Fix typos in Prometheus protocol docs
2024-08-11 23:55:53 +00:00
Igor Nikonov
b6f5eb1211
Fix screwed merge
2024-08-11 23:15:19 +00:00
Alexey Milovidov
d2a9eaaa01
Merge pull request #68157 from azat/local-fix-log
...
Remove "Processing configuration file" message from clickhouse-local
2024-08-11 23:07:43 +00:00
Yakov Olkhovskiy
5c8665c660
fix system.kafka_consumers and doc, fix tidy
2024-08-11 20:40:55 +00:00
Igor Nikonov
d04db7e26d
Merge remote-tracking branch 'origin/master' into pr-local-plan
2024-08-11 20:11:32 +00:00
Azat Khuzhin
8a48b33344
Fix settings/current_database in system.processes for async BACKUP/RESTORE
...
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-08-11 21:27:08 +02:00