Robert Schulze
c4720d9728
Minor doc fixups
2024-09-09 11:18:42 +00:00
JackyWoo
6539cbd1ce
Add CountMin to dictionary file and a little fixup
2024-09-09 17:34:15 +08:00
JackyWoo
eb800d9e39
Rename statistics of type count_min to countmin
2024-09-09 11:45:39 +08:00
Robert Schulze
a73eb1c177
Merge pull request #67013 from JackyWoo/add_statistics_minmax
...
Add `min_max` statistics type
2024-09-06 16:14:04 +00:00
JackyWoo
3974e9060a
Fix docs and some fixups
2024-09-06 18:50:02 +08:00
Robert Schulze
7a98f7fecc
Add testcase for ANN index usage with subquery
2024-09-04 10:21:19 +00:00
JackyWoo
a9b6c04705
Merge branch 'master' into add_statistics_minmax
...
# Conflicts:
# src/Storages/Statistics/Statistics.cpp
2024-08-29 14:35:33 +08:00
Robert Schulze
9fb4c23c06
Merge pull request #68678 from rschu1ze/usearch-2.14
...
Vector similarity index: make `bf16` the default quantization
2024-08-28 08:45:02 +00:00
vdimir
bb22736bc3
Merge pull request #68867 from ucasfl/url-engine
...
Add virtual column _headers for url table engine
2024-08-27 13:27:19 +00:00
divanik
3581eb34e7
Merge branch 'master' of github.com:ClickHouse/ClickHouse into divanik/add_local_and_azure_iceberg_support
2024-08-26 09:04:13 +00:00
flynn
e1e692a968
Fix and update docs
2024-08-25 08:00:16 +00:00
vdimir
642657d02c
Merge pull request #68612 from leonkozlowski/docs/fix-merge-tree-primary-key-docs
...
patch: fix reference to sorting key in primary key docs
2024-08-23 16:29:25 +00:00
Robert Schulze
c40c8b7adb
Enable bf16 + f64 quantization, make bf16 the default
2024-08-23 07:32:34 +00:00
JackyWoo
2502ca766f
Merge branch 'master' into add_statistics_minmax
...
# Conflicts:
# src/Storages/Statistics/ConditionSelectivityEstimator.cpp
# src/Storages/Statistics/Statistics.cpp
# src/Storages/Statistics/Statistics.h
# src/Storages/Statistics/StatisticsCountMinSketch.cpp
# src/Storages/Statistics/StatisticsCountMinSketch.h
# src/Storages/Statistics/StatisticsTDigest.cpp
# src/Storages/Statistics/StatisticsTDigest.h
# src/Storages/Statistics/StatisticsUniq.cpp
# src/Storages/Statistics/StatisticsUniq.h
2024-08-21 10:56:23 +08:00
János Benjamin Antal
677b28e1ac
Fix docs
2024-08-20 17:04:19 +02:00
leonkozlowski
e416a2b3d2
patch: fix reference to sorting key in primary key docs
2024-08-20 09:42:19 -04:00
divanik
ef1f0e2aaf
Fix typo in docs
2024-08-15 13:16:09 +00:00
divanik
6a96eb5a4e
Merge branch 'divanik/add_local_and_azure_iceberg_support' of github.com:ClickHouse/ClickHouse into divanik/add_local_and_azure_iceberg_support
2024-08-15 12:58:49 +00:00
Robert Schulze
38a2b0dcc7
Allow Array(Float64) as type of underlying column
2024-08-15 10:47:55 +00:00
Robert Schulze
6170a8663f
Bump usearch to 2.13.2
2024-08-14 08:04:00 +00:00
Daniil Ivanik
f9f41405cc
Merge branch 'master' into divanik/add_local_and_azure_iceberg_support
2024-08-13 18:24:02 +02:00
Robert Schulze
fb76cb90b1
Allow un-quoted skip index parameters
...
Previously, only this syntax to create a skip index worked:
INDEX index_name column_name TYPE vector_similarity('hnsw', 'L2Distance')
Now, this syntax will work as well:
INDEX index_name column_name TYPE vector_similarity(hnsw, L2Distance)
2024-08-12 15:32:25 +00:00
Robert Schulze
d2e79f0b92
Rework vector index parameters
...
USearch (similar to FAISS) allows to specify the distance function,
quantization, and various HNSW meta-parameters for index creation and
sarch. Some users wished for greater configurability, so let's expose
them.
Index creation now requires either
- 2 parameters (with the other 4 parameters taking on default values), or
- 6 parameters for full control
This commit also remove quantization `f64` (that would be upsampling).
2024-08-12 15:32:19 +00:00
Robert Schulze
785b6637fa
Rename index type "usearch" to "vector_similarity"
...
First, index type "vector_similarity" is more speaking and user-friendly
than "usearch". Second, we should not expose the name of the library
doing the job (usearch). Of course, the docs will continue to mention
usearch (credit where credit is due).
Existing setting `allow_experimental_usearch_index` was marked obsolete.
A new settings `allow_experimental_vector_similarity_index` was added.
2024-08-12 15:30:45 +00:00
Robert Schulze
40bed3e20f
Remove support for WHERE-type queries
...
These kind of vector search similarity queries are rather obscure and
rare in practice. They require the user to specify a maximum distance
which is not intuitive to obtain. Furthermore, these queries are not
natively supported in USearch, so the vector search index had to emulate
these queries.
Therefore simplifying the code base and restricting vector search to
ORDER-BY queries only.
2024-08-12 15:25:52 +00:00
Robert Schulze
218421c255
Remove Annoy indexes
...
Annoy indexes fell out of favor in the community, at least when it comes
to vector databases. Such indexes work okay-ish low dimensions but they
suffers badly from a curse of dimensionality which makes them inapt for
a high number of dimensions.
Now that Annoy is gone, issue (*) also disappears and we can drop
'no-ubsan', 'no-cpu-aarch64', and 'no-asan' from tests.
(*) spotify/annoy#456
2024-08-12 15:24:49 +00:00
divanik
61e616204e
Docs correct typos
2024-08-12 14:21:42 +00:00
divanik
58dd01d8b6
Add docs
2024-08-12 13:45:00 +00:00
Justin de Guzman
0071765138
Merge pull request #67940 from ClickHouse/prometheus-documentation
...
Add documentation for Prometheus protocols and TimeSeries engine.
2024-08-09 01:03:26 +00:00
Robert Schulze
076c4a9ce9
Merge pull request #67930 from rschu1ze/fix-stat-assert
...
Fix stress test error with TDigest statistics
2024-08-08 16:34:58 +00:00
Robert Schulze
37641a0b4b
Merge remote-tracking branch 'ClickHouse/master' into fix-stat-assert
2024-08-08 08:57:22 +00:00
János Benjamin Antal
92be2db5b6
Merge remote-tracking branch 'origin/master' into kafka-zookeeper
2024-08-08 08:01:43 +00:00
Kseniia Sumarokova
315fd5496a
Merge pull request #65386 from skyoct/feat-s3-field
...
Feat add _etag for object storage
2024-08-07 17:35:43 +00:00
Vitaly Baranov
bf33aabec4
Add documentation.
...
(cherry picked from commit 083fff6ed6
)
2024-08-06 20:15:51 +02:00
Robert Schulze
d09c82ff76
Cosmetics II
2024-08-06 12:36:09 +00:00
JackyWoo
8259a9827e
Update reference file for tests
2024-08-06 10:29:42 +08:00
JackyWoo
46da03030c
Add test for implicitly type conversion
2024-08-06 10:05:45 +08:00
JackyWoo
4fa30da118
Fix docs
2024-08-06 09:56:38 +08:00
JackyWoo
a36424fc8c
Add supported data types to documents
2024-08-05 18:59:09 +08:00
Robert Schulze
7765ff6d52
Minor fixups
2024-08-05 07:51:58 +00:00
JackyWoo
d1305d9fad
Some fixups and split tests
2024-08-05 11:26:57 +08:00
János Benjamin Antal
7aff8748b0
Address small review comments
2024-07-31 18:08:19 +00:00
János Benjamin Antal
23fa85e3ff
Apply suggestions from code review
...
Co-authored-by: Kseniia Sumarokova <54203879+kssenii@users.noreply.github.com>
2024-07-31 19:30:58 +02:00
JackyWoo
43a3f717be
Merge branch 'master' into add_statistics_minmax
2024-07-31 15:15:12 +08:00
János Benjamin Antal
b9670c782f
Merge remote-tracking branch 'origin/master' into kafka-zookeeper
2024-07-29 14:42:41 +00:00
Shri Bodas
d0c4c4151c
Update keepermap.md
...
Needs quotes around keeper path
2024-07-25 14:24:28 -07:00
JackyWoo
5ae356e6df
Add document for min_max statistics
2024-07-24 17:54:48 +08:00
JackyWoo
245359e536
Merge branch 'master' into add_statistics_cmsketch
2024-07-16 09:45:31 +08:00
János Benjamin Antal
1ecfba837e
Rename experimental flag to allow_experimental_kafka_offsets_storage_in_keeper
2024-07-15 09:03:05 +00:00
János Benjamin Antal
4f98df7f49
Merge remote-tracking branch 'origin/master' into kafka-zookeeper
2024-07-15 08:32:29 +00:00