Commit Graph

62747 Commits

Author SHA1 Message Date
Kruglov Pavel
5445669d4e
Don't create new dynamic path for null values 2024-08-15 20:55:36 +02:00
Kruglov Pavel
6c085e0a93
Fix empty types check 2024-08-15 20:51:46 +02:00
Alexey Milovidov
4e5b53aef5
Merge pull request #68309 from ClickHouse/pr-min-marks-to-read-overflow
Fix: min marks to read overflow with parallel replicas
2024-08-15 18:26:51 +00:00
Kseniia Sumarokova
a5cc3a3a9c
Update log message 2024-08-15 20:03:22 +02:00
Anton Popov
75f951dae5 limit task time for mutations 2024-08-15 17:59:31 +00:00
Alexey Milovidov
e5665c9f59
Merge pull request #68082 from ClickHouse/fix-trash-in-aws-s3
Fix trash (low-quality code) in AWS S3
2024-08-15 17:41:21 +00:00
Kseniia Sumarokova
f7d31d151e
Merge pull request #68353 from ClickHouse/add-try-reserve-failure-reason-to-log
Add debug info for `00180_no_seek_avoiding_when_reading_from_cache`
2024-08-15 16:24:25 +00:00
avogar
65493ccf12 Remove unused error code 2024-08-15 15:57:21 +00:00
avogar
7f0406a726 Remove JSONEmpty function and allow JSON in empty function 2024-08-15 15:53:09 +00:00
János Benjamin Antal
4ac90f0cbf
Merge pull request #68386 from ClickHouse/revert-68163-backups-processes
Revert "[RFC] Fix settings/current_database in system.processes for async BACKUP/RESTORE"
2024-08-15 15:23:49 +00:00
Nikolai Kochetov
4548957f5a Add default status to not-prepared processors. 2024-08-15 15:07:07 +00:00
kssenii
f6e1eb1643 Fix style check 2024-08-15 16:31:48 +02:00
Nikolai Kochetov
fc5a19949f
Merge pull request #68291 from ClickHouse/add-cancelletion-reason-to-pipeline-executor
Add execution status to PipelineExecutor.
2024-08-15 14:31:26 +00:00
kssenii
7d01c31312 Delete old code of named collections 2024-08-15 16:01:13 +02:00
Kruglov Pavel
1870f05207
Merge branch 'master' into json-data-type 2024-08-15 14:52:41 +02:00
Robert Schulze
7acf74437f
Fix spelling 2024-08-15 12:29:59 +00:00
Han Fei
03bfb1562b fix overflow 2024-08-15 14:26:01 +02:00
Kruglov Pavel
f539507592
Merge branch 'master' into variant_inference 2024-08-15 14:23:20 +02:00
kssenii
7e006f6581 Update delta lake test 2024-08-15 14:09:48 +02:00
Kruglov Pavel
16a5724045
Merge pull request #67794 from bigo-sg/opt_orc_string
Trivial optimize on ORC/Parquet string reading
2024-08-15 12:07:56 +00:00
Kseniia Sumarokova
65cbe94d54
Update DeltaLakeMetadata.cpp 2024-08-15 14:05:33 +02:00
Nikita Taranov
418c3fa658
Merge pull request #63677 from KevinyhZou/improve_hash_join_by_reduce_vector_emplace
Improve all join performance by append `RowRefList` or `RowRef` to AddedColumns for lazy output
2024-08-15 12:04:36 +00:00
avogar
8db2c65595 Merge branch 'master' of github.com:ClickHouse/ClickHouse into better-datetime-inference 2024-08-15 11:24:14 +00:00
Robert Schulze
38a2b0dcc7
Allow Array(Float64) as type of underlying column 2024-08-15 10:47:55 +00:00
Robert Schulze
b9548504d9
Cosmetics 2024-08-15 10:47:55 +00:00
Robert Schulze
6739320bdc
Don't call 'validate' ('make' calls it under the hood already) 2024-08-15 10:47:55 +00:00
Anton Popov
a552747082 Merge remote-tracking branch 'upstream/master' into HEAD 2024-08-15 10:42:23 +00:00
shiyer7474
b82c231886 Code review feedback - used removeNullable() 2024-08-15 10:37:13 +00:00
Igor Nikonov
880954ea58
Merge branch 'master' into pr-lost-decimal-conversion 2024-08-15 12:28:39 +02:00
Konstantin Bogdanov
014c1650f8
Merge pull request #68326 from canhld94/fix_sync_replica_format
Fix missing sync replica mode in query `SYSTEM SYNC REPLICA`
2024-08-15 10:01:21 +00:00
avogar
bea8e65f4f Fix tests 2024-08-15 09:48:28 +00:00
Han Fei
9f6e472b0c process regexp flags correctly 2024-08-15 11:47:41 +02:00
János Benjamin Antal
3af8ba2deb
Revert "[RFC] Fix settings/current_database in system.processes for async BACKUP/RESTORE" 2024-08-15 11:21:20 +02:00
kssenii
8950491fa5 Fix unit test build 2024-08-15 11:18:15 +02:00
Kseniia Sumarokova
172d379a83
Merge pull request #68356 from ClickHouse/fix-scheduler-data-race
Fix data race in `DynamicResourceManager::updateConfiguration`
2024-08-15 09:07:19 +00:00
avogar
0bb076a4d3 Improve schema inference of date times 2024-08-15 08:08:00 +00:00
avogar
fed37d24f0 Merge branch 'master' of github.com:ClickHouse/ClickHouse into json-data-type 2024-08-15 08:02:15 +00:00
Yakov Olkhovskiy
f0845eeacb
Merge pull request #68262 from bigo-sg/fix_68162
Fix crash in lag/lead
2024-08-15 06:58:18 +00:00
megao
c9bfff3934 fix progress value of view_refreshview 2024-08-15 10:52:17 +08:00
megao
7bebc448f3 fix progress value of view_refreshview 2024-08-15 10:42:22 +08:00
Alexey Milovidov
5cf3cfc3d7 Merge branch 'master' into format-settings-parsing 2024-08-15 03:50:21 +02:00
Alexey Milovidov
dbbd2c7875
Merge pull request #68248 from rschu1ze/update-usearch
Bump usearch to v2.13.2
2024-08-15 01:24:17 +00:00
Alexey Milovidov
e9d6b2a4b6
Merge pull request #58289 from ClickHouse/vdimir/followup-56996
Followup for #56996
2024-08-15 01:21:32 +00:00
lgbo-ustc
690c4d0803 update 2024-08-15 09:05:33 +08:00
Miсhael Stetsyuk
cef5d80090
Merge pull request #68361 from ClickHouse/rename-S3DiskNoKeyErrors-to-DiskS3NoSuchKeyErrors
Rename: `S3DiskNoKeyErrors` -> `DiskS3NoSuchKeyErrors`
2024-08-15 00:08:38 +00:00
Anton Popov
9f44cd8592 fix typo 2024-08-14 22:19:17 +00:00
Anton Popov
08b768476f Merge remote-tracking branch 'upstream/master' into HEAD 2024-08-14 22:18:15 +00:00
Michael Kolupaev
e00357155d Merge remote-tracking branch 'origin/master' into qast 2024-08-14 21:52:48 +00:00
Michael Kolupaev
43f4a3665d Check that setProcessListElement() is not called on global context 2024-08-14 21:35:11 +00:00
Igor Nikonov
03aa07d94d Merge remote-tracking branch 'origin/master' into pr-min-marks-to-read-overflow 2024-08-14 21:00:31 +00:00
Anton Popov
6e42dfd761
Merge pull request #68015 from CurtizJ/merges-profile-events
Add profile events for merges
2024-08-14 20:59:49 +00:00
Igor Nikonov
aa38024b0e Fix UBSan: lower upper bound for min_marks_for_concurrent_read 2024-08-14 20:59:08 +00:00
Michael Kolupaev
3ca00440af Fix off-by-one inline function info in stack traces 2024-08-14 20:51:23 +00:00
Kruglov Pavel
da4da13667
Fix style 2024-08-14 22:50:14 +02:00
Igor Nikonov
d290aed56d
Merge branch 'master' into pr-lost-decimal-conversion 2024-08-14 22:48:34 +02:00
avogar
7251bc59f8 Fix tidy build 2024-08-14 20:22:19 +00:00
Michael Stetsyuk
2e5f45a7ad rename: S3DiskNoKeyErrors -> DiskS3NoSuchKeyErrors 2024-08-14 17:53:33 +00:00
Alexey Milovidov
9786f07f11 Merge branch 'master' into fix-trash-in-aws-s3 2024-08-14 19:48:31 +02:00
Han Fei
207d160dd2 address comments 2024-08-14 19:01:54 +02:00
kssenii
209d4eb016 Fix build 2024-08-14 19:00:17 +02:00
Alexey Milovidov
e11018b955
Merge pull request #68280 from ClickHouse/vdimir/fix-join-ut
Fix FullSortingJoinTest.AsofGreaterGeneratedTestData with empty data
2024-08-14 16:42:38 +00:00
serxa
cf58e8c1e3 fix data race in DynamicResourceManager::updateConfiguration 2024-08-14 16:35:45 +00:00
Han Fei
acc88af2cb fix style 2024-08-14 18:27:20 +02:00
Han Fei
23bb47293d cache result column with ptr hex string 2024-08-14 18:13:00 +02:00
Han Fei
c2f6751daf fix bug in key condition 2024-08-14 17:16:16 +02:00
kssenii
07c4a072fe Add debug logging 2024-08-14 16:50:51 +02:00
Kseniia Sumarokova
8cdbe6333b
Merge pull request #67664 from 1on/master
Fix error on generated columns in MaterializedPostgreSQL
2024-08-14 13:43:40 +00:00
jsc0218
f9bafdc510
Merge pull request #66169 from jsc0218/LWDRebuildProj
Add Rebuild Option In Projection and LWD
2024-08-14 13:27:32 +00:00
avogar
fc23f1c1ff Fix tests 2024-08-14 12:20:33 +00:00
Nikita Taranov
3c5d588302
Merge pull request #62003 from nickitat/cpu_cgroup_aware
Fix cpu metric collection in cgroups
2024-08-14 11:08:01 +00:00
Kseniia Sumarokova
2b1fce007f
Merge pull request #68288 from ClickHouse/try-fix-postgres-crash
Fix postgres crash
2024-08-14 11:01:35 +00:00
Yakov Olkhovskiy
f4280203a0
Merge pull request #63781 from ClickHouse/ft-chunked-protocol
Add chunked wrapper to native protocol
2024-08-14 10:35:20 +00:00
Igor Nikonov
6cfbbd665b Merge remote-tracking branch 'origin/master' into pr-min-marks-to-read-overflow 2024-08-14 10:22:57 +00:00
Igor Nikonov
0e0272b2ff Better check for overflow
+ limit min_marks_for_concurrent_read
2024-08-14 10:21:23 +00:00
Duc Canh Le
8d7319ccab fix wrong format of SYSTEM SYNC REPLICA query
Signed-off-by: Duc Canh Le <duccanh.le@ahrefs.com>
2024-08-14 10:19:37 +00:00
robot-clickhouse
992e70337f
Merge pull request #68306 from ClickHouse/bump_version_24.9.1.1
Update version after release
2024-08-14 09:55:18 +00:00
Nikita Taranov
95e07ce7b3
Merge branch 'master' into improve_hash_join_by_reduce_vector_emplace 2024-08-14 11:43:37 +02:00
avogar
ca5fd57db7 Fix builds 2024-08-14 09:35:54 +00:00
李扬
2eaa9b9ae8
Merge branch 'ClickHouse:master' into opt_lc_name 2024-08-14 16:58:13 +08:00
Antonio Andelic
24faabb57b
Merge pull request #68282 from ClickHouse/fix-min-max-time-timeseries
Fix min/max time columns in TimeSeries table
2024-08-14 08:04:58 +00:00
Robert Schulze
6170a8663f
Bump usearch to 2.13.2 2024-08-14 08:04:00 +00:00
Nikolai Kochetov
38919ed351 Merge branch 'master' into add-cancelletion-reason-to-pipeline-executor 2024-08-14 08:03:56 +00:00
Antonio Andelic
f740cf4eaa Fix data race on SampleKey 2024-08-14 09:54:03 +02:00
Alexey Milovidov
393b16bd8e Merge branch 'master' into fix-trash-in-aws-s3 2024-08-14 04:55:05 +02:00
Alexey Milovidov
ecb08a0d04 Merge branch 'master' into format-settings-parsing 2024-08-14 04:51:51 +02:00
Michael Kolupaev
e302e2ef32 Conflicts 2024-08-14 01:15:16 +00:00
Michael Kolupaev
ab843c3cf5 Merge remote-tracking branch 'origin/master' into mv4 2024-08-14 00:29:48 +00:00
avogar
46f41d1718 Fix 2024-08-13 23:51:49 +00:00
avogar
217963757e Fix 2024-08-13 23:21:24 +00:00
avogar
835fc9ca76 Add more tests, fix conflicts 2024-08-13 22:37:46 +00:00
Igor Nikonov
98024a723f
Merge pull request #67891 from canhld94/fix_exception_in_destructor
Catch exception in destructor of LocalFileHolder
2024-08-13 21:35:57 +00:00
avogar
55ede2dab6 Merge branch 'master' of github.com:ClickHouse/ClickHouse into json-data-type 2024-08-13 21:19:17 +00:00
Igor Nikonov
b9ffa929ba Fix: min marks to read overflow with parallel replicas 2024-08-13 21:08:53 +00:00
Michael Kolupaev
5767d7d1b3 Merge remote-tracking branch 'origin/master' into qast 2024-08-13 20:56:06 +00:00
Michael Kolupaev
0133806e6c Fix 2024-08-13 20:55:51 +00:00
avogar
24ddea6945 Fix tests 2024-08-13 20:49:16 +00:00
Max Kainov
a9226f49e7 remove name with cyrillic letter 2024-08-13 20:24:40 +00:00
Michael Kolupaev
f4b8a98d06
Merge pull request #68249 from ClickHouse/rset
Fix 'Refresh set entry already exists'
2024-08-13 20:07:31 +00:00
Michael Kolupaev
074cd7371a Merge remote-tracking branch 'origin/master' into mv4 2024-08-13 20:04:17 +00:00
robot-clickhouse
6af5fedf42 Update autogenerated version to 24.9.1.1 and contributors 2024-08-13 19:26:35 +00:00
Kruglov Pavel
6c8ad63104
Merge pull request #68132 from Avogar/better-dynamic
Reimplement Dynamic type
2024-08-13 19:21:52 +00:00
avogar
c75112827c Fis style 2024-08-13 18:05:27 +00:00
Anton Popov
9833ef0bed slightly better 2024-08-13 17:50:13 +00:00
avogar
f82822c993 Merge branch 'master' of github.com:ClickHouse/ClickHouse into json-data-type 2024-08-13 17:14:30 +00:00
avogar
6d2f17ea3b Add more tests 2024-08-13 17:12:38 +00:00
Daniil Ivanik
e02b434d2f
Merge pull request #68238 from ClickHouse/divanik/remove_useless_logs_about_zero_replication
Remove unnecessary logs in MergeTreeData.cpp
2024-08-13 16:21:32 +00:00
Alexander Tokmakov
eddf948246
Merge pull request #68250 from ClickHouse/tavplubix-patch-10
Replace segfault in Replicated database with logical error
2024-08-13 16:15:09 +00:00
Yarik Briukhovetskyi
99503dfb03
Merge branch 'ClickHouse:master' into interval_type_conversion 2024-08-13 18:06:44 +02:00
Robert Schulze
88f2d2e67d
Bump usearch to v2.7.8 2024-08-13 15:53:21 +00:00
Robert Schulze
98c18eb341
Bump usearch to 2.4.1 2024-08-13 15:53:11 +00:00
Nikolai Kochetov
c5ae139c97 Cleanup. 2024-08-13 15:18:07 +00:00
Nikita Taranov
dd3fa7c3b6
Merge pull request #61793 from nickitat/keep_alive_max_reqs
Implement `max_keep_alive_requests` setting for server
2024-08-13 15:06:43 +00:00
Robert Schulze
2ffcc97af2
Merge pull request #63675 from rschu1ze/vector-search
Initial implementation of vector similarity index
2024-08-13 15:06:20 +00:00
vdimir
4fad666a8b
Merge branch 'master' into vdimir/followup-56996 2024-08-13 17:00:07 +02:00
kssenii
5a6090ad05 Fix 2024-08-13 16:48:12 +02:00
Nikolai Kochetov
04286bc270 Add status to PipelineExecutor. Verify status of pusing pipeline. 2024-08-13 14:45:05 +00:00
Yakov Olkhovskiy
3e8a177622
Merge pull request #61908 from ClickHouse/ci-fuzzer-enable
CI: enable libfuzzer (fixing build and docker)
2024-08-13 14:22:09 +00:00
avogar
710cf1a223 Calculate statistics for paths in shared data 2024-08-13 13:55:57 +00:00
Antonio Andelic
973b240579 Fix min/max time columns 2024-08-13 15:40:45 +02:00
Peter Nguyen
94604126fe
Merge branch 'ClickHouse:master' into add_create_if_not_exists_setting 2024-08-13 07:25:26 -06:00
Yarik Briukhovetskyi
39c25663ae
Merge pull request #67879 from bigo-sg/opt_orc_writer
Avoid allocating unnecessary capacity for array column while writing orc & some minor refactors
2024-08-13 12:51:11 +00:00
vdimir
d2be1bf693
Fix FullSortingJoinTest.AsofGreaterGeneratedTestData with empty data 2024-08-13 12:33:44 +00:00
kevinyhzou
85bd63a2ac rebase and resolve conflict 2024-08-13 19:16:32 +08:00
Robert Schulze
99282e526a
Merge pull request #68235 from sakulali/query_cache_tag
QueryCache: Add tagging
2024-08-13 10:44:10 +00:00
Yarik Briukhovetskyi
086c0f03a6
Merge pull request #65997 from yariks5s/hive_style_partitioning
Implementing Hive-style partitioning
2024-08-13 10:04:21 +00:00
Yarik Briukhovetskyi
d82e9b79da
Merge branch 'ClickHouse:master' into interval_type_conversion 2024-08-13 12:01:19 +02:00
taiyang-li
e46c5a75ef fix building issue 2024-08-13 17:59:05 +08:00
avogar
2896dd51d8 Implemenet prepareForSquashing for ColumnObject 2024-08-13 09:52:43 +00:00
shiyer7474
7f005a6ca4 Fix small value DateTime64 constant folding in nested query 2024-08-13 08:38:30 +00:00
lgbo-ustc
2a51b6c403 fix crash in lag/lead 2024-08-13 14:01:41 +08:00
taiyang-li
51a65dca49 fix conflicts 2024-08-13 11:45:48 +08:00
Yakov Olkhovskiy
93a922f169 merge master 2024-08-13 03:28:25 +00:00
pufit
ae5223854f
Merge pull request #67653 from ClickHouse/pufit/inconsistent-formating-grant-current-grants
Fix inconsistent formatting for `GRANT CURRENT GRANTS`
2024-08-13 03:21:26 +00:00
Alexey Milovidov
b80dd738b0 Fix error 2024-08-13 02:41:00 +02:00
Alexey Milovidov
995b29cb07 Merge branch 'master' of github.com:ClickHouse/ClickHouse into more-strict-tryparse 2024-08-13 02:38:40 +02:00
Michael Kolupaev
5e6f728248 safeGet 2024-08-13 00:35:40 +00:00
Michael Kolupaev
5d8ee17166 Merge remote-tracking branch 'origin/master' into qast 2024-08-12 23:56:11 +00:00
Michael Kolupaev
ddd5a96950 Fix 2024-08-12 23:56:02 +00:00
Michael Kolupaev
f12609440f fashion 2024-08-12 23:09:57 +00:00
Alexey Milovidov
30b2cb606f Merge branch 'master' of github.com:ClickHouse/ClickHouse into format-settings-parsing 2024-08-13 00:01:01 +02:00
Alexey Milovidov
aa38315673 Merge branch 'master' of github.com:ClickHouse/ClickHouse into fix-trash-in-aws-s3 2024-08-12 23:54:45 +02:00
Kruglov Pavel
01b2e67f2e
Merge branch 'master' into json-data-type 2024-08-12 23:32:35 +02:00
avogar
83cb991f75 Fix special build 2024-08-12 21:30:30 +00:00
avogar
8136e6a452 Update new prepareForSquashing method for ColumnDynamic 2024-08-12 21:29:26 +00:00
János Benjamin Antal
ac6826392d
Merge pull request #67554 from ClickHouse/fix-message-queue-sink-from-http-interface
Fix message queue sink from http interface
2024-08-12 21:29:14 +00:00
Anton Popov
dccf34dc95 fix drift of profile event time 2024-08-12 21:20:57 +00:00
Michael Kolupaev
15ca86db7a Merge remote-tracking branch 'origin/master' into mv4 2024-08-12 21:17:45 +00:00
Michael Kolupaev
c218560639 Don't enable allow_materialized_view_with_bad_select yet, someone has to add support for fixture reuse in test_replicated_database first 2024-08-12 21:14:42 +00:00
János Benjamin Antal
6eb4a71ad3
Merge pull request #68163 from azat/backups-processes
[RFC] Fix settings/current_database in system.processes for async BACKUP/RESTORE
2024-08-12 21:07:55 +00:00
Alexander Tokmakov
5a683796a0
Update DatabaseReplicated.cpp 2024-08-12 22:34:14 +02:00
avogar
03182c7a8f Merge branch 'master' of github.com:ClickHouse/ClickHouse into better-dynamic 2024-08-12 20:29:24 +00:00
Michael Kolupaev
469c1698b0 Fix 'Refresh set entry already exists' 2024-08-12 19:32:13 +00:00
Han Fei
40382451a2
Merge pull request #68186 from rschu1ze/stats-tests-refactoring
Refactor tests for (experimental) statistics
2024-08-12 18:58:19 +00:00
Robert Schulze
c22265b889
Some fixups 2024-08-12 17:45:38 +00:00
avogar
f8011d53d6 Fix data types parsing 2024-08-12 17:14:43 +00:00
Kruglov Pavel
ba85cc8d59
Merge pull request #67043 from Avogar/improve-squashing
Improve columns squashing for String/Array/Map/Variant/Dynamic types
2024-08-12 17:14:15 +00:00
avogar
0c209242b1 Fix build 2024-08-12 16:28:18 +00:00
Nikita Taranov
f7af4c5643 don't report system-wide metrics when cgroup metrics present 2024-08-12 17:27:43 +01:00
Robert Schulze
fe537045c9
Merge remote-tracking branch 'ClickHouse/master' into query_cache_tag 2024-08-12 16:16:32 +00:00
Yarik Briukhovetskyi
3a6e05eb43
try to fix includes 2024-08-12 18:03:42 +02:00
avogar
44e267ec76 Fix conflicts and style check 2024-08-12 15:58:40 +00:00
avogar
cfd854882b Merge branch 'master' of github.com:ClickHouse/ClickHouse into json-data-type 2024-08-12 15:57:57 +00:00
avogar
44d4784da5 Use std::string_view for lookups in hash table 2024-08-12 15:52:39 +00:00
Yarik Briukhovetskyi
ea1cd66575
fix tidy 2024-08-12 17:32:43 +02:00
Robert Schulze
fb76cb90b1
Allow un-quoted skip index parameters
Previously, only this syntax to create a skip index worked:

   INDEX index_name column_name TYPE vector_similarity('hnsw', 'L2Distance')

Now, this syntax will work as well:

  INDEX index_name column_name TYPE vector_similarity(hnsw, L2Distance)
2024-08-12 15:32:25 +00:00
Robert Schulze
d2e79f0b92
Rework vector index parameters
USearch (similar to FAISS) allows to specify the distance function,
quantization, and various HNSW meta-parameters for index creation and
sarch. Some users wished for greater configurability, so let's expose
them.

Index creation now requires either
- 2 parameters (with the other 4 parameters taking on default values), or
- 6 parameters for full control

This commit also remove quantization `f64` (that would be upsampling).
2024-08-12 15:32:19 +00:00
Robert Schulze
cc5c64e1ed
Add migration helper for legacy 'annoy' and 'usearch' indexes types
Index types 'annoy' and 'usearch' were removed and replaced by
'vector_similarity' indexes in an earlier commit.

This means unfortuantely, that if customers have tables with these
indexes and upgrade, their database might not start anymore - the
system loads the metadata at startup, thinks something is wrong with
such tables, and halts immediately.

This commit adds support for loading and attaching such indexes back.
Data insert or use (search) return an error which recommends a migration
to 'vector_similarity' indexes. The implementation is generally similar
to what has recently been implemented for 'full_text' indexes [1, 2].

[1] https://github.com/ClickHouse/ClickHouse/pull/64656
[2] https://github.com/ClickHouse/ClickHouse/pull/64846
2024-08-12 15:31:27 +00:00
Robert Schulze
785b6637fa
Rename index type "usearch" to "vector_similarity"
First, index type "vector_similarity" is more speaking and user-friendly
than "usearch". Second, we should not expose the name of the library
doing the job (usearch). Of course, the docs will continue to mention
usearch (credit where credit is due).

Existing setting `allow_experimental_usearch_index` was marked obsolete.
A new settings `allow_experimental_vector_similarity_index` was added.
2024-08-12 15:30:45 +00:00
Robert Schulze
021fad920e
Cosmetics: minor stuff 2024-08-12 15:30:41 +00:00
Robert Schulze
2aa037985b
Cosmetics: simplify inheritance hierarchy 2024-08-12 15:30:38 +00:00
Robert Schulze
901906159d
Cosmetics: ApproximateNearestNeighborInformation --> Info + nest in class 2024-08-12 15:30:35 +00:00
Robert Schulze
6170aad43e
Cosmetics: ApproximateNearestNeighborIndexesCommon --> VectorSimilarityCondition 2024-08-12 15:30:30 +00:00
Robert Schulze
e20eff635e
Cosmetics: variable naming 2024-08-12 15:30:27 +00:00
Robert Schulze
1bf320a1a8
Cosmetics: metric --> distance_function (for consistent terminology) 2024-08-12 15:30:24 +00:00
Robert Schulze
3f47b42d71
Remove funny typedef 2024-08-12 15:30:21 +00:00
Robert Schulze
fb26a9e6d4
Cosmetics: whitespaces 2024-08-12 15:30:18 +00:00
Robert Schulze
0f1765a273
Cosmetics: function naming 2024-08-12 15:30:14 +00:00
Robert Schulze
a8167abca2
Cosmetics: use native types/functions 2024-08-12 15:30:10 +00:00
Robert Schulze
9ad890e399
Cosmetics: whitespaces 2024-08-12 15:30:07 +00:00
Robert Schulze
27a6931a35
Cosmetics: variable naming 2024-08-12 15:29:59 +00:00
Robert Schulze
289c27c804
Introduce version for for index files in persistence 2024-08-12 15:29:02 +00:00
Robert Schulze
74de79e52b
Addd logging of basic statistics 2024-08-12 15:28:46 +00:00
Kruglov Pavel
bba4a90a9c
Merge branch 'master' into better-dynamic 2024-08-12 17:28:09 +02:00
Robert Schulze
8853b3359b
Remove useless templatization
Makes the code cleaner, compile faster, and the binary smaller.
2024-08-12 15:27:06 +00:00
Nikita Taranov
57a614857c address review comments 2024-08-12 16:27:01 +01:00
Robert Schulze
4f23f7754b
Cosmetics 2024-08-12 15:26:05 +00:00
Robert Schulze
7f611681df
Add a similar sanity check as in other skipping indexes 2024-08-12 15:26:01 +00:00
Robert Schulze
f944ef25bb
Better handling of errors during add, search, and save 2024-08-12 15:25:58 +00:00
Robert Schulze
40bed3e20f
Remove support for WHERE-type queries
These kind of vector search similarity queries are rather obscure and
rare in practice. They require the user to specify a maximum distance
which is not intuitive to obtain. Furthermore, these queries are not
natively supported in USearch, so the vector search index had to emulate
these queries.

Therefore simplifying the code base and restricting vector search to
ORDER-BY queries only.
2024-08-12 15:25:52 +00:00
Robert Schulze
abb8e61981
Remove support code for Lp norm in vector search
It is a generalization of other norms, too expensive to calculate and
not relevant in practice. Also, Usearch doesn't support it.
2024-08-12 15:25:48 +00:00
Robert Schulze
65186f0b69
Remove tuple support
Indexes for approximate nearest neighbourhood (ANN) search (USearch) can
be build on columns of type Array(Float32) or Tuple(Float32[, Float32[, ...]]).
In practice, Arrays(Float32) is the only relevant data type.
Arrays store high-dimensional embeddings consecutively (--> cache
locality) and the additional flexibility of different data types in a
tuple is not needed for vector search.

Therefore removing support for ANN indexes over tuple columns to
simplify the code, tests and docs.
2024-08-12 15:25:39 +00:00
Robert Schulze
218421c255
Remove Annoy indexes
Annoy indexes fell out of favor in the community, at least when it comes
to vector databases. Such indexes work okay-ish low dimensions but they
suffers badly from a curse of dimensionality which makes them inapt for
a high number of dimensions.

Now that Annoy is gone, issue (*) also disappears and we can drop
'no-ubsan', 'no-cpu-aarch64', and 'no-asan' from tests.

(*) spotify/annoy#456
2024-08-12 15:24:49 +00:00
Robert Schulze
d7211f9d12
Fix CMake integration of usearch and annoy
Registers usearch and annoy properly via configure_config.cmake and
config.h.in like all other 3rd party libs, instead of (mis)using
target_compile_definitions.
2024-08-12 15:24:18 +00:00
Robert Schulze
a39b9cf643
Un-screw usearch's build description
No directory 'SimSIMD-map' exists, the build only worked because SimSIMD
support in usearch was (accidentally?) disabled. This commit corrects
the build description. SimSIMD support in usearch will be enabled by a
later commit.
2024-08-12 15:24:14 +00:00
divanik
eb3ffb7184 Add supportsReplication 2024-08-12 15:09:16 +00:00
Yarik Briukhovetskyi
1cc8457268
fix reviews, fix crash in fuzzer 2024-08-12 16:57:47 +02:00
Robert Schulze
85f63b056b
Merge pull request #68135 from ClickHouse/refactor-field-get
Only use Field::safeGet - Field::get prone to type punning
2024-08-12 14:25:11 +00:00
jsc0218
a9d19c7aca Merge remote-tracking branch 'origin/master' into LWDRebuildProj 2024-08-12 14:06:25 +00:00
avogar
8522776c33 Remove unused code 2024-08-12 13:49:01 +00:00
avogar
9834457c26 Fix copying arguments 2024-08-12 13:48:18 +00:00
avogar
a329456146 Fix review comments 2024-08-12 13:47:10 +00:00
Nikita Taranov
633b15d7a4 Merge branch 'master' into cpu_cgroup_aware 2024-08-12 14:18:36 +01:00
Robert Schulze
037a1006fd
Merge remote-tracking branch 'ClickHouse/master' into ci-fuzzer-enable 2024-08-12 12:28:32 +00:00
Anton Popov
3172bf8d76 better accounting of time for merge of projections 2024-08-12 12:23:32 +00:00
Nikita Taranov
2f546fb513
Merge pull request #68098 from aiven-sal/aiven-sal/segfault
Fix UB in hopEnd, hopStart, tumbleEnd, and tumbleStart
2024-08-12 12:09:23 +00:00
avogar
9296d4ecfb Merge branch 'master' of github.com:ClickHouse/ClickHouse into json-data-type 2024-08-12 11:55:39 +00:00
Sema Checherinda
5e836bc20e
Merge pull request #67472 from ClickHouse/chesema-02765
speed up system flush logs
2024-08-12 11:51:55 +00:00
avogar
38f3131e11 Fix review comments 2024-08-12 11:39:36 +00:00
Yarik Briukhovetskyi
8bc89ac8df
Merge branch 'master' into hive_style_partitioning 2024-08-12 11:44:45 +02:00
vdimir
5c54c7025b
Followup for #56996 2024-08-12 08:25:54 +00:00
Robert Schulze
0aa30b10d5
Merge pull request #68069 from rschu1ze/cmake-cleanup
Minor CMake cleanup
2024-08-12 06:43:00 +00:00
Robert Schulze
574c445be9
Refactor tests for (experimental) statistics 2024-08-12 05:56:16 +00:00
Alexey Milovidov
69978dbc6d Merge branch 'master' into format-settings-parsing 2024-08-12 07:11:26 +02:00
Alexey Milovidov
741e1da2bd Merge branch 'master' into fix-trash-in-aws-s3 2024-08-12 06:01:07 +02:00
Alexey Milovidov
1e15574a25 Fix ExternalDistributed 2024-08-12 05:53:43 +02:00
Alexey Milovidov
4866581a67 Fix DeltaLake partitioned by Bool when it is true 2024-08-12 04:59:21 +02:00
李扬
623c530f67
Merge branch 'ClickHouse:master' into opt_lc_name 2024-08-12 10:33:24 +08:00
sakulali
957a0b6ea4 Add a setting query_cache_tag 2024-08-12 08:40:01 +08:00
Alexey Milovidov
c2cb8e6c7d Merge branch 'master' of github.com:ClickHouse/ClickHouse into fix-trash-in-aws-s3 2024-08-12 01:04:17 +02:00
Peter Nguyen
868a1e78f2 Add support for new create_if_not_exists setting in InterpreterCreateQuery.cpp 2024-08-11 16:59:52 -06:00
Peter Nguyen
f90b8327be Update SettingsChangesHistory.cpp with new create_if_not_exists settings 2024-08-11 16:59:52 -06:00
Peter Nguyen
967bd05663 Add create_if_not_exists setting to Settings.h 2024-08-11 16:59:52 -06:00
Yakov Olkhovskiy
5c8665c660 fix system.kafka_consumers and doc, fix tidy 2024-08-11 20:40:55 +00:00
Igor Nikonov
51c2e136db
Merge branch 'master' into pr-lost-decimal-conversion 2024-08-11 21:27:18 +02:00
Azat Khuzhin
8a48b33344 Fix settings/current_database in system.processes for async BACKUP/RESTORE
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-08-11 21:27:08 +02:00
Yakov Olkhovskiy
8e706265e6 fix 2024-08-11 16:29:35 +00:00
Yakov Olkhovskiy
4fec61da55 fix wrong datatype in system.kafka_consumers 2024-08-11 12:35:27 +00:00
Alexander Tokmakov
53bc1b7e35
Revert "Refactor tests for (experimental) statistics" 2024-08-11 13:19:36 +02:00
Robert Schulze
45db564354
Merge pull request #68034 from rschu1ze/stats-tests-refactoring
Refactor tests for (experimental) statistics
2024-08-11 08:43:31 +00:00
Alexey Milovidov
ee3eec0a2a
Update Credentials.cpp 2024-08-11 04:47:26 +02:00
Yakov Olkhovskiy
e93584e741 fix Field conversion to IPv4 2024-08-10 23:02:30 +00:00
Igor Nikonov
9ce97e918b
Merge branch 'master' into patch-2 2024-08-11 00:07:46 +02:00
János Benjamin Antal
79a964bfff Merge remote-tracking branch 'origin/master' into fix-message-queue-sink-from-http-interface 2024-08-10 21:53:57 +00:00
János Benjamin Antal
7aaa038571
Merge pull request #67922 from ClickHouse/fix_trivial_count_non_deterministic_func
Fix wrong `count()` result when there is non-deterministic function in predicate
2024-08-10 21:13:12 +00:00
Yakov Olkhovskiy
80e9269963 allow UInt64 <-> Int64 conversion 2024-08-10 19:39:59 +00:00
alesapin
c945209427
Merge pull request #68003 from ClickHouse/ignore_broken_projections_on_start
Ignore disappeared projections on start
2024-08-10 19:34:40 +00:00
Anton Popov
a925faf8f9
Merge pull request #68053 from CurtizJ/enable-optimize_functions_to_subcolumns_2
Enable setting `optimize_functions_to_subcolumns` by default
2024-08-10 18:32:47 +00:00
Alexey Milovidov
7524b8f767 A slight improvement 2024-08-10 19:55:22 +02:00
Alexey Milovidov
dfedcae505 Merge branch 'master' of github.com:ClickHouse/ClickHouse into fix-trash-in-aws-s3 2024-08-10 19:52:21 +02:00
Alexey Milovidov
4c4d1e8c65
Merge pull request #67233 from ClickHouse/debug_exec_dict
Debug logging for #67002
2024-08-10 17:41:21 +00:00
Yakov Olkhovskiy
0a8fb05ece fix after merge 2024-08-10 16:23:23 +00:00
Antonio Andelic
3a2be13a93
Merge pull request #67975 from ClickHouse/update-minio-statless
Update minio in stateless tests
2024-08-10 16:15:14 +00:00
Antonio Andelic
16e52b547b
Merge pull request #68106 from ClickHouse/fix-with-retries-data-race
Fix race in `WithRetries`
2024-08-10 15:43:25 +00:00
jsc0218
70727c604a Merge remote-tracking branch 'origin/master' into LWDRebuildProj 2024-08-10 14:41:03 +00:00
Yakov Olkhovskiy
9a45c136b8 merge master 2024-08-10 13:53:32 +00:00
Yakov Olkhovskiy
c1b5b908ba hide Field::get in private, only use Field::safeGet 2024-08-10 13:01:55 +00:00
李扬
1e9e19bb1c
Merge branch 'ClickHouse:master' into opt_orc_writer 2024-08-10 20:28:31 +08:00
avogar
d243feea21 Fix special builds 2024-08-10 11:21:28 +00:00
Igor Nikonov
389ac9e84b Merge remote-tracking branch 'origin/master' into pr-lost-decimal-conversion 2024-08-10 10:47:57 +00:00
jsc0218
c26b3cb445 handle the case of packed storage 2024-08-10 02:27:23 +00:00