Commit Graph

19205 Commits

Author SHA1 Message Date
Robert Schulze
d2e79f0b92
Rework vector index parameters
USearch (similar to FAISS) allows to specify the distance function,
quantization, and various HNSW meta-parameters for index creation and
sarch. Some users wished for greater configurability, so let's expose
them.

Index creation now requires either
- 2 parameters (with the other 4 parameters taking on default values), or
- 6 parameters for full control

This commit also remove quantization `f64` (that would be upsampling).
2024-08-12 15:32:19 +00:00
Robert Schulze
785b6637fa
Rename index type "usearch" to "vector_similarity"
First, index type "vector_similarity" is more speaking and user-friendly
than "usearch". Second, we should not expose the name of the library
doing the job (usearch). Of course, the docs will continue to mention
usearch (credit where credit is due).

Existing setting `allow_experimental_usearch_index` was marked obsolete.
A new settings `allow_experimental_vector_similarity_index` was added.
2024-08-12 15:30:45 +00:00
Kruglov Pavel
bba4a90a9c
Merge branch 'master' into better-dynamic 2024-08-12 17:28:09 +02:00
Robert Schulze
40bed3e20f
Remove support for WHERE-type queries
These kind of vector search similarity queries are rather obscure and
rare in practice. They require the user to specify a maximum distance
which is not intuitive to obtain. Furthermore, these queries are not
natively supported in USearch, so the vector search index had to emulate
these queries.

Therefore simplifying the code base and restricting vector search to
ORDER-BY queries only.
2024-08-12 15:25:52 +00:00
Robert Schulze
218421c255
Remove Annoy indexes
Annoy indexes fell out of favor in the community, at least when it comes
to vector databases. Such indexes work okay-ish low dimensions but they
suffers badly from a curse of dimensionality which makes them inapt for
a high number of dimensions.

Now that Annoy is gone, issue (*) also disappears and we can drop
'no-ubsan', 'no-cpu-aarch64', and 'no-asan' from tests.

(*) spotify/annoy#456
2024-08-12 15:24:49 +00:00
Robert Schulze
85f63b056b
Merge pull request #68135 from ClickHouse/refactor-field-get
Only use Field::safeGet - Field::get prone to type punning
2024-08-12 14:25:11 +00:00
Peter Nguyen
c817a4e8ad
Update settings.md to clarify create_if_not_exists behavior
Co-authored-by: Nikita Taranov <nickita.taranov@gmail.com>
2024-08-12 07:45:51 -06:00
avogar
9296d4ecfb Merge branch 'master' of github.com:ClickHouse/ClickHouse into json-data-type 2024-08-12 11:55:39 +00:00
Yarik Briukhovetskyi
8bc89ac8df
Merge branch 'master' into hive_style_partitioning 2024-08-12 11:44:45 +02:00
Robert Schulze
574c445be9
Refactor tests for (experimental) statistics 2024-08-12 05:56:16 +00:00
Alexey Milovidov
b92a8f0fbc Fix leftovers 2024-08-12 07:16:25 +02:00
sakulali
957a0b6ea4 Add a setting query_cache_tag 2024-08-12 08:40:01 +08:00
Peter Nguyen
2af1134c08 Update settings.md docs with new create_if_not_exists settings 2024-08-11 16:59:52 -06:00
Yakov Olkhovskiy
5c8665c660 fix system.kafka_consumers and doc, fix tidy 2024-08-11 20:40:55 +00:00
Vladimir Varankin
d314e5aa45
typos in prometheus.md 2024-08-11 18:37:29 +02:00
Alexander Tokmakov
53bc1b7e35
Revert "Refactor tests for (experimental) statistics" 2024-08-11 13:19:36 +02:00
Robert Schulze
f7e7a884b5
Merge pull request #67962 from Blargian/docs_toDecimalXYZ
Docs:`toDecimal32/64/128/256` and variants
2024-08-11 09:23:40 +00:00
Robert Schulze
45db564354
Merge pull request #68034 from rschu1ze/stats-tests-refactoring
Refactor tests for (experimental) statistics
2024-08-11 08:43:31 +00:00
Blargian
e582118544 review changes 2024-08-10 13:09:40 +02:00
Alexey Milovidov
e6e06af7bd
Merge pull request #64281 from bigo-sg/extend_inequal_join
`Any/Semi/Anti` join support mixed join conditions
2024-08-09 21:36:31 +00:00
Alexey Milovidov
a79eff9dcf
Merge pull request #67647 from jacobrec/multilinestring
Added support for reading MultiLineString WKTs
2024-08-09 21:35:55 +00:00
avogar
c61eef4a76 Reimplement Dynamic type 2024-08-09 21:17:49 +00:00
Nikita Taranov
ffa1371dde Merge branch 'master' into keep_alive_max_reqs 2024-08-09 12:24:45 +01:00
Yarik Briukhovetskyi
aebb07884b
Merge branch 'master' into hive_style_partitioning 2024-08-09 11:53:14 +02:00
Robert Schulze
9b94bbc87e
Merge remote-tracking branch 'ClickHouse/master' into stats-tests-refactoring 2024-08-09 07:54:35 +00:00
pufit
21f8b03789
Merge pull request #67655 from ClickHouse/pufit/fuzz-query-revert-revert
FuzzQuery table function (resubmit)
2024-08-09 05:57:43 +00:00
Alexey Milovidov
89ea89ad67
Merge pull request #67885 from canhld94/drop_detached_partition_all
Support DROP DETACHED PARTITION ALL
2024-08-09 05:20:55 +00:00
Justin de Guzman
0071765138
Merge pull request #67940 from ClickHouse/prometheus-documentation
Add documentation for Prometheus protocols and TimeSeries engine.
2024-08-09 01:03:26 +00:00
pufit
99fbef3340
Merge branch 'master' into pufit/fuzz-query-revert-revert 2024-08-08 20:12:23 -04:00
justindeguzman
eb4ea07577 [Docs] Fix broken links 2024-08-08 16:55:39 -07:00
Robert Schulze
fece58ae82
Merge remote-tracking branch 'ClickHouse/master' into stats-tests-refactoring 2024-08-08 17:38:38 +00:00
Jacob Reckhard
cd69fa5a4c fixed typos 2024-08-08 11:20:55 -06:00
Robert Schulze
076c4a9ce9
Merge pull request #67930 from rschu1ze/fix-stat-assert
Fix stress test error with TDigest statistics
2024-08-08 16:34:58 +00:00
Yarik Briukhovetskyi
a5fcf85fb6
Merge branch 'master' into hive_style_partitioning 2024-08-08 17:54:53 +02:00
Duc Canh Le
fdd8d5e5d8 Merge branch 'master' into drop_detached_partition_all
Fix CI
2024-08-08 14:44:10 +00:00
robot-clickhouse
06d7691fa6
Merge pull request #68045 from ClickHouse/auto/v24.7.3.42-stable
Update version_date.tsv and changelog after v24.7.3.42-stable
2024-08-08 14:13:09 +00:00
János Benjamin Antal
c9adfe246b
Merge pull request #57625 from ClickHouse/kafka-zookeeper
Kafka ZooKeeper
2024-08-08 14:02:27 +00:00
robot-clickhouse
9cf60214e1 Update version_date.tsv and changelogs after v24.7.3.42-stable 2024-08-08 13:43:12 +00:00
Alexey Katsman
f00f18ae69
Merge pull request #68031 from ClickHouse/tavplubix-patch-10
Add a link in trace_log docs
2024-08-08 12:59:07 +00:00
Yarik Briukhovetskyi
edccd30016
Merge branch 'ClickHouse:master' into hive_style_partitioning 2024-08-08 14:52:33 +02:00
Robert Schulze
b0ba53788a
Refactor tests for (experimental) statistics 2024-08-08 11:21:23 +00:00
Ilya Yatsishin
f21a48dac6
Merge pull request #67064 from zvonand/hdfs-double-asterisk
Add `**` to `hdfs` docs, add test for `**` in `hdfs`
2024-08-08 11:12:55 +00:00
Robert Schulze
18085eca8a
Merge pull request #68027 from rschu1ze/docs-3rdparty
Docs: Update 3rd party library guide
2024-08-08 10:51:28 +00:00
Alexander Tokmakov
2e5b71cdb1
Update trace_log.md 2024-08-08 12:27:03 +02:00
Yarik Briukhovetskyi
6384488826
Merge branch 'ClickHouse:master' into hive_style_partitioning 2024-08-08 11:45:01 +02:00
Robert Schulze
37641a0b4b
Merge remote-tracking branch 'ClickHouse/master' into fix-stat-assert 2024-08-08 08:57:22 +00:00
Robert Schulze
76b8bcd97a
Update 2024-08-08 08:46:46 +00:00
Robert Schulze
effaeeeeac
Update 3rd party lib guide 2024-08-08 08:43:39 +00:00
Dmitry Novik
486d717e88
Merge pull request #67661 from bigo-sg/win_bug
Fix crash on `percent_rank`
2024-08-08 08:28:48 +00:00
János Benjamin Antal
92be2db5b6 Merge remote-tracking branch 'origin/master' into kafka-zookeeper 2024-08-08 08:01:43 +00:00
pufit
6c0cc194d3
Merge branch 'master' into pufit/fuzz-query-revert-revert 2024-08-07 23:26:05 -04:00
lgbo-ustc
daf62e1682 update 2024-08-08 08:30:10 +08:00
Yarik Briukhovetskyi
9e3a2931d2
Merge branch 'master' into hive_style_partitioning 2024-08-07 23:59:34 +02:00
Andrey Zvonov
c6482a092b
Merge branch 'master' into hdfs-double-asterisk 2024-08-07 21:42:00 +03:00
Nikita Mikhaylov
722fb1f512
Merge pull request #68005 from den-crane/patch-72
Docs: Fix heading level
2024-08-07 18:21:27 +00:00
Yarik Briukhovetskyi
e3abcc7ae9
Merge pull request #68004 from ClickHouse/qoega-patch-7
Minor fixes in tables.md
2024-08-07 17:43:35 +00:00
Kseniia Sumarokova
315fd5496a
Merge pull request #65386 from skyoct/feat-s3-field
Feat add _etag for object storage
2024-08-07 17:35:43 +00:00
Denny Crane
3b48a1a92c
Update http.md 2024-08-07 14:01:40 -03:00
Ilya Yatsishin
4fb1febe48
Update table.md 2024-08-07 18:51:24 +02:00
max-vostrikov
7dc55faf73
Merge pull request #67979 from ClickHouse/docs-for-lightweight_mutation_projection_mode
docs for lightweight_mutation_projection_mode option
2024-08-07 14:31:02 +00:00
Dmitry Novik
27c54d7cce
Merge pull request #67985 from ClickHouse/fix-memory-overcommit-docs
Fix documentation for memory overcommit
2024-08-07 14:19:27 +00:00
Nikita Fomichev
ed94f94657
Merge pull request #67970 from ClickHouse/fix_docs_for_groupconcat_function
fix docks for groupConcat function
2024-08-07 14:17:55 +00:00
Dmitry Novik
ecba21bfe7
Fix documentation for memory overcommit 2024-08-07 15:23:19 +02:00
maxvostrikov
37a6bd25f6 squash! docs for lightweight_mutation_projection_mode option starting with 24.7 we can lightweight delete in tables with projections. Fixed docs stating it is not possible 2024-08-07 14:06:02 +02:00
maxvostrikov
c39cdcffaf docs for lightweight_mutation_projection_mode option
starting with 24.7 we can lightweight delete in tables with projections. Fixed docs stating it is not possible
2024-08-07 14:03:53 +02:00
max-vostrikov
d4782665f8
Merge pull request #67974 from ClickHouse/fix_docs_for_clickhousekeeperclient
fix docks for clickhouse-keeper-client
2024-08-07 12:00:15 +00:00
János Benjamin Antal
8b5c8e5361 Merge remote-tracking branch 'origin/master' into kafka-zookeeper 2024-08-07 11:33:21 +00:00
maxvostrikov
e8bf5129c0 fix docks for clickhouse-keeper-client
starting 24.7 paths are not accepted as bare strings, only as string literals - https://github.com/ClickHouse/ClickHouse/pull/65494
2024-08-07 12:44:31 +02:00
Ilya Yatsishin
47111eb5af
Merge pull request #67511 from ClickHouse/revert-67507-revert-66536-external-replace-to-null
Revert "Revert "Add settings to replace external engines to Null during create""
2024-08-07 10:38:04 +00:00
maxvostrikov
a9735f470c squash! fix docks for groupConcat function documentation had wrong usage of a function. Examples of usage - 763952bf36/tests/queries/0_stateless/03156_group_concat.sql 2024-08-07 12:09:26 +02:00
maxvostrikov
de41ffa18f fix docks for groupConcat function
documentation had wrong usage of a function. Examples of usage - 763952bf36/tests/queries/0_stateless/03156_group_concat.sql
2024-08-07 12:06:14 +02:00
Duc Canh Le
7fa6948884 Merge branch 'master' into drop_detached_partition_all
Fix CI
2024-08-07 09:20:06 +00:00
Blargian
b76e4acbc0 fix another typo 2024-08-07 11:13:56 +02:00
Blargian
4f2b1c36b7 Fix typo from previous PR 2024-08-07 11:12:09 +02:00
Blargian
5eb896b9f1 Add documentation for toDecimal32 and variants 2024-08-07 10:43:41 +02:00
lgbo-ustc
bd4d648f63 update doc 2024-08-07 15:15:32 +08:00
lgbo-ustc
81b9d6d3f5 add doc 2024-08-07 08:57:56 +08:00
lgbo-ustc
9845aeac0f support percent_rank in old analyzer 2024-08-07 08:57:56 +08:00
lgbo-ustc
b8c6beeb7a update doc 2024-08-07 08:57:56 +08:00
Nikita Taranov
053285dc1c Merge branch 'master' into keep_alive_max_reqs 2024-08-06 20:55:48 +01:00
Robert Schulze
915e6f6565
Merge pull request #67908 from Blargian/docs_toFloat
Update documentation of `toFloat32/64` and variants
2024-08-06 19:52:07 +00:00
Ilya Yatsishin
881d57a764
Merge branch 'master' into revert-67507-revert-66536-external-replace-to-null 2024-08-06 21:04:17 +02:00
Vitaly Baranov
bf33aabec4 Add documentation.
(cherry picked from commit 083fff6ed6)
2024-08-06 20:15:51 +02:00
Blargian
0bb8d1a8ed optimize formatting by 3% 2024-08-06 17:10:34 +02:00
robot-clickhouse
e2eeb6f180 Update version_date.tsv and changelogs after v24.6.3.95-stable 2024-08-06 12:54:27 +00:00
Robert Schulze
d09c82ff76
Cosmetics II 2024-08-06 12:36:09 +00:00
Blargian
5313c9539e update documentation for Float32/64 and variants 2024-08-06 14:34:06 +02:00
Andrey Zvonov
97f1cfc232 add ** glob to hdfs docs 2024-08-06 11:18:47 +00:00
Nikita Mikhaylov
b4a6f249ab
Merge pull request #66438 from ClickHouse/analyzer-beta
Analyzer is finally Beta
2024-08-06 09:39:29 +00:00
Duc Canh Le
c6c2fce9d2 update document
Signed-off-by: Duc Canh Le <duccanh.le@ahrefs.com>
2024-08-06 07:51:35 +00:00
Michael Kolupaev
42da208561 Merge remote-tracking branch 'origin/master' into mv4 2024-08-05 22:38:07 +00:00
robot-clickhouse
e4c71aec4d Update version_date.tsv and changelogs after v24.5.5.78-stable 2024-08-05 16:12:25 +00:00
Alexey Milovidov
aa66203f17 Update architecture.md 2024-08-05 15:29:14 +00:00
Nikita Mikhaylov
62d5e83f25 Double space... 2024-08-05 15:29:14 +00:00
Nikita Mikhaylov
cc5dd9830e Modify the docs 2024-08-05 15:29:14 +00:00
Nikita Mikhaylov
4e9c3baa25 Move analyzer to Beta stage 2024-08-05 15:29:14 +00:00
Yarik Briukhovetskyi
9b06753bb7
Merge branch 'master' into hive_style_partitioning 2024-08-05 17:02:27 +02:00
Alexey Milovidov
f1a096e8f6
Merge pull request #67723 from ClickHouse/fix-log-and-exception-messages-formatting
Fix test `00002_log_and_exception_messages_formatting`
2024-08-05 12:34:33 +00:00
János Benjamin Antal
8f6476adb7 Merge remote-tracking branch 'origin/master' into kafka-zookeeper 2024-08-05 12:26:00 +00:00
Alexey Milovidov
07f039677e
Merge pull request #67771 from ClickHouse/whitespaces
Whitespaces
2024-08-05 12:21:20 +00:00