Commit Graph

14392 Commits

Author SHA1 Message Date
neoman36
b76277ca00 named-collections.md is updated 2024-08-16 12:23:20 +02:00
Alexey Milovidov
a4ade2347c
Merge pull request #68308 from rschu1ze/enable-simsimd
Various vector similarity index related fixes
2024-08-16 00:41:56 +00:00
Kruglov Pavel
c4d9944a90
Merge pull request #68382 from Avogar/better-datetime-inference
Improve schema inference of date times
2024-08-15 19:33:37 +00:00
Nikita Taranov
e98c89ebcb
Merge pull request #68164 from petern48/add_create_if_not_exists_setting
Add create_if_not_exists setting
2024-08-15 19:16:00 +00:00
jsc0218
ee4b1eaaed
Merge pull request #68411 from jsc0218/DocForLWDProj
Add Doc for LWD and Projection
2024-08-15 14:43:59 +00:00
jsc0218
fb037bcc72 move to mergetree setting and add more info 2024-08-15 13:48:43 +00:00
avogar
50a8cee0c5 Update docs 2024-08-15 11:39:04 +00:00
Robert Schulze
38a2b0dcc7
Allow Array(Float64) as type of underlying column 2024-08-15 10:47:55 +00:00
Alexey Milovidov
dbbd2c7875
Merge pull request #68248 from rschu1ze/update-usearch
Bump usearch to v2.13.2
2024-08-15 01:24:17 +00:00
Robert Schulze
6170a8663f
Bump usearch to 2.13.2 2024-08-14 08:04:00 +00:00
jsc0218
82dfc336be
Merge pull request #68295 from austin-tildei/patch-1
Remove trailing colon in header for consistency
2024-08-13 19:49:24 +00:00
Kruglov Pavel
6c8ad63104
Merge pull request #68132 from Avogar/better-dynamic
Reimplement Dynamic type
2024-08-13 19:21:52 +00:00
jsc0218
2bf9ca6c43
Merge pull request #68245 from jsc0218/SpecialMergeTreeProjMergeDoc
Add Projection Merge Doc
2024-08-13 19:21:01 +00:00
Austin Bruch
7e209ebdf6
Remove trailing colon in header for consistency 2024-08-13 12:30:17 -04:00
Nikita Taranov
dd3fa7c3b6
Merge pull request #61793 from nickitat/keep_alive_max_reqs
Implement `max_keep_alive_requests` setting for server
2024-08-13 15:06:43 +00:00
Robert Schulze
2ffcc97af2
Merge pull request #63675 from rschu1ze/vector-search
Initial implementation of vector similarity index
2024-08-13 15:06:20 +00:00
Peter Nguyen
94604126fe
Merge branch 'ClickHouse:master' into add_create_if_not_exists_setting 2024-08-13 07:25:26 -06:00
Robert Schulze
99282e526a
Merge pull request #68235 from sakulali/query_cache_tag
QueryCache: Add tagging
2024-08-13 10:44:10 +00:00
Yarik Briukhovetskyi
086c0f03a6
Merge pull request #65997 from yariks5s/hive_style_partitioning
Implementing Hive-style partitioning
2024-08-13 10:04:21 +00:00
vdimir
dfb892ba5f
Merge pull request #66616 from Blargian/docs_getXYZ
add documentation for `getSubcolumn` and `getTypeSerializationStreams`
2024-08-13 09:35:44 +00:00
avogar
03182c7a8f Merge branch 'master' of github.com:ClickHouse/ClickHouse into better-dynamic 2024-08-12 20:29:24 +00:00
Han Fei
40382451a2
Merge pull request #68186 from rschu1ze/stats-tests-refactoring
Refactor tests for (experimental) statistics
2024-08-12 18:58:19 +00:00
Shaun Struwig
aa7a2bcb02
Fix typo 2024-08-12 20:34:02 +02:00
jsc0218
38405dd7cd add projection merge doc 2024-08-12 18:14:22 +00:00
Robert Schulze
c22265b889
Some fixups 2024-08-12 17:45:38 +00:00
Robert Schulze
fe537045c9
Merge remote-tracking branch 'ClickHouse/master' into query_cache_tag 2024-08-12 16:16:32 +00:00
Robert Schulze
fb76cb90b1
Allow un-quoted skip index parameters
Previously, only this syntax to create a skip index worked:

   INDEX index_name column_name TYPE vector_similarity('hnsw', 'L2Distance')

Now, this syntax will work as well:

  INDEX index_name column_name TYPE vector_similarity(hnsw, L2Distance)
2024-08-12 15:32:25 +00:00
Robert Schulze
d2e79f0b92
Rework vector index parameters
USearch (similar to FAISS) allows to specify the distance function,
quantization, and various HNSW meta-parameters for index creation and
sarch. Some users wished for greater configurability, so let's expose
them.

Index creation now requires either
- 2 parameters (with the other 4 parameters taking on default values), or
- 6 parameters for full control

This commit also remove quantization `f64` (that would be upsampling).
2024-08-12 15:32:19 +00:00
Robert Schulze
785b6637fa
Rename index type "usearch" to "vector_similarity"
First, index type "vector_similarity" is more speaking and user-friendly
than "usearch". Second, we should not expose the name of the library
doing the job (usearch). Of course, the docs will continue to mention
usearch (credit where credit is due).

Existing setting `allow_experimental_usearch_index` was marked obsolete.
A new settings `allow_experimental_vector_similarity_index` was added.
2024-08-12 15:30:45 +00:00
Kruglov Pavel
bba4a90a9c
Merge branch 'master' into better-dynamic 2024-08-12 17:28:09 +02:00
Robert Schulze
40bed3e20f
Remove support for WHERE-type queries
These kind of vector search similarity queries are rather obscure and
rare in practice. They require the user to specify a maximum distance
which is not intuitive to obtain. Furthermore, these queries are not
natively supported in USearch, so the vector search index had to emulate
these queries.

Therefore simplifying the code base and restricting vector search to
ORDER-BY queries only.
2024-08-12 15:25:52 +00:00
Robert Schulze
218421c255
Remove Annoy indexes
Annoy indexes fell out of favor in the community, at least when it comes
to vector databases. Such indexes work okay-ish low dimensions but they
suffers badly from a curse of dimensionality which makes them inapt for
a high number of dimensions.

Now that Annoy is gone, issue (*) also disappears and we can drop
'no-ubsan', 'no-cpu-aarch64', and 'no-asan' from tests.

(*) spotify/annoy#456
2024-08-12 15:24:49 +00:00
Robert Schulze
85f63b056b
Merge pull request #68135 from ClickHouse/refactor-field-get
Only use Field::safeGet - Field::get prone to type punning
2024-08-12 14:25:11 +00:00
Peter Nguyen
c817a4e8ad
Update settings.md to clarify create_if_not_exists behavior
Co-authored-by: Nikita Taranov <nickita.taranov@gmail.com>
2024-08-12 07:45:51 -06:00
Yarik Briukhovetskyi
8bc89ac8df
Merge branch 'master' into hive_style_partitioning 2024-08-12 11:44:45 +02:00
Robert Schulze
574c445be9
Refactor tests for (experimental) statistics 2024-08-12 05:56:16 +00:00
sakulali
957a0b6ea4 Add a setting query_cache_tag 2024-08-12 08:40:01 +08:00
Peter Nguyen
2af1134c08 Update settings.md docs with new create_if_not_exists settings 2024-08-11 16:59:52 -06:00
Yakov Olkhovskiy
5c8665c660 fix system.kafka_consumers and doc, fix tidy 2024-08-11 20:40:55 +00:00
Vladimir Varankin
d314e5aa45
typos in prometheus.md 2024-08-11 18:37:29 +02:00
Alexander Tokmakov
53bc1b7e35
Revert "Refactor tests for (experimental) statistics" 2024-08-11 13:19:36 +02:00
Robert Schulze
f7e7a884b5
Merge pull request #67962 from Blargian/docs_toDecimalXYZ
Docs:`toDecimal32/64/128/256` and variants
2024-08-11 09:23:40 +00:00
Robert Schulze
45db564354
Merge pull request #68034 from rschu1ze/stats-tests-refactoring
Refactor tests for (experimental) statistics
2024-08-11 08:43:31 +00:00
Blargian
e582118544 review changes 2024-08-10 13:09:40 +02:00
Alexey Milovidov
e6e06af7bd
Merge pull request #64281 from bigo-sg/extend_inequal_join
`Any/Semi/Anti` join support mixed join conditions
2024-08-09 21:36:31 +00:00
Alexey Milovidov
a79eff9dcf
Merge pull request #67647 from jacobrec/multilinestring
Added support for reading MultiLineString WKTs
2024-08-09 21:35:55 +00:00
avogar
c61eef4a76 Reimplement Dynamic type 2024-08-09 21:17:49 +00:00
Nikita Taranov
ffa1371dde Merge branch 'master' into keep_alive_max_reqs 2024-08-09 12:24:45 +01:00
Yarik Briukhovetskyi
aebb07884b
Merge branch 'master' into hive_style_partitioning 2024-08-09 11:53:14 +02:00
Robert Schulze
9b94bbc87e
Merge remote-tracking branch 'ClickHouse/master' into stats-tests-refactoring 2024-08-09 07:54:35 +00:00