Commit Graph

7956 Commits

Author SHA1 Message Date
Robert Schulze
912663b719
Revert "Move CatBoost evaluation into clickhouse-library-bridge" 2022-08-31 20:54:43 +02:00
Robert Schulze
ca01286028
Merge pull request #39629 from ClickHouse/catboost-bridge
Move CatBoost evaluation into clickhouse-library-bridge
2022-08-31 16:16:11 +02:00
DanRoscigno
50f9b12af8 review feedback 2022-08-31 09:12:48 -04:00
Dan Roscigno
ddac1b3f11
Merge pull request #40560 from DanRoscigno/add-backup
Add backup restore docs
2022-08-31 08:19:00 -04:00
Robert Schulze
40468d3304
Fix typo in docs 2022-08-30 20:45:31 +00:00
Denny Crane
0d7cc82267
Update string-functions.md 2022-08-30 11:08:23 -03:00
DanRoscigno
f72b341e8b add status info 2022-08-30 09:34:08 -04:00
Alexander Tokmakov
6fdfb964d0
Revert "Add Annoy index" 2022-08-30 15:10:10 +03:00
Andrey Zvonov
93f9abf130
upd2 2022-08-30 14:41:40 +03:00
Andrey Zvonov
14adea8792
fix error in docs 2022-08-30 14:40:26 +03:00
Kseniia Sumarokova
c88db2ef97
Merge pull request #40751 from kssenii/fix-mysql-timeouts
Fix issue with mysql db / table function timeouts
2022-08-30 11:59:01 +02:00
Robert Schulze
cc4225109f
Merge pull request #37215 from Vector-Similarity-Search-for-ClickHouse/annoy-2
Test failures are unrelated, merging.
2022-08-30 09:25:57 +02:00
DanRoscigno
d712a91a20 add alternatives 2022-08-29 19:36:20 -04:00
DanRoscigno
0abeebd3ca updated with dev help 2022-08-29 19:29:10 -04:00
Alexey Milovidov
0190c56faf
Merge pull request #40770 from den-crane/patch-40
Doc. Fix cache dictionaries doc.
2022-08-30 02:19:32 +03:00
DanRoscigno
3a65d58c13 updated with dev help 2022-08-29 18:33:26 -04:00
Robert Schulze
64a6aa328e
fix: broken links in documentation (hopefully) 2022-08-29 20:27:06 +00:00
Robert Schulze
6b2b3c1eb3
feat: implement catboost in library-bridge
This commit moves the catboost model evaluation out of the server
process into the library-bridge binary. This serves two goals: On the
one hand, crashes / memory corruptions of the catboost library no longer
affect the server. On the other hand, we can forbid loading dynamic
libraries in the server (catboost was the last consumer of this
functionality), thus improving security.

SQL syntax:

  SELECT
    catboostEvaluate('/path/to/model.bin', FEAT_1, ..., FEAT_N) > 0 AS prediction,
    ACTION AS target
  FROM amazon_train
  LIMIT 10

Required configuration:

  <catboost_lib_path>/path/to/libcatboostmodel.so</catboost_lib_path>

*** Implementation Details ***

The internal protocol between the server and the library-bridge is
simple:

- HTTP GET on path "/extdict_ping":
  A ping, used during the handshake to check if the library-bridge runs.

- HTTP POST on path "extdict_request"
  (1) Send a "catboost_GetTreeCount" request from the server to the
      bridge, containing a library path (e.g /home/user/libcatboost.so) and
      a model path (e.g. /home/user/model.bin). Rirst, this unloads the
      catboost library handler associated to the model path (if it was
      loaded), then loads the catboost library handler associated to the
      model path, then executes GetTreeCount() on the library handler and
      finally sends the result back to the server. Step (1) is called once
      by the server from FunctionCatBoostEvaluate::getReturnTypeImpl(). The
      library path handler is unloaded in the beginning because it contains
      state which may no longer be valid if the user runs
      catboost("/path/to/model.bin", ...) more than once and if "model.bin"
      was updated in between.
  (2) Send "catboost_Evaluate" from the server to the bridge, containing
      the model path and the features to run the interference on. Step (2)
      is called multiple times (once per chunk) by the server from function
      FunctionCatBoostEvaluate::executeImpl(). The library handler for the
      given model path is expected to be already loaded by Step (1).

Fixes #27870
2022-08-29 20:26:45 +00:00
Dan Roscigno
76a45aa750
Merge branch 'master' into add-backup 2022-08-29 16:23:53 -04:00
Denny Crane
fe0f18f21d
Update external-dicts-dict-layout.md 2022-08-29 15:19:15 -03:00
Kseniia Sumarokova
c5c48e44ea
Merge branch 'master' into fix-mysql-timeouts 2022-08-29 19:33:29 +02:00
DanRoscigno
76a3212fc8 replace symlinks 2022-08-29 12:26:17 -04:00
DanRoscigno
c4b8137d31 replace symlinks 2022-08-29 12:19:50 -04:00
Filatenkov Artur
d73f661732
Merge branch 'master' into annoy-2 2022-08-29 17:33:13 +03:00
kssenii
0a6c4b9265 Fix 2022-08-29 16:20:53 +02:00
alesapin
7ce0afc0df
Merge pull request #40670 from Avogar/kafka
Add setting to disable limit on kafka_num_consumers
2022-08-29 10:53:35 +02:00
DanRoscigno
b50fa8b5a9 replace symlinks 2022-08-28 17:34:50 -04:00
Dan Roscigno
96cd94196e
Merge branch 'ClickHouse:master' into add-more-slugs 2022-08-28 12:06:37 -04:00
DanRoscigno
fad2e071eb replace symlinks with includes 2022-08-28 11:58:59 -04:00
DanRoscigno
5d1e3ee4d8 remove duplicate 2022-08-28 11:04:51 -04:00
DanRoscigno
5b5fcc56aa add slugs 2022-08-28 10:53:34 -04:00
Alexey Milovidov
ddff5e9145
Merge pull request #40712 from ClickHouse/ci-cmake-self-extractable
cmake: Disable default ENABLE_CLICKHOUSE_SELF_EXTRACTING
2022-08-28 17:12:19 +03:00
DanRoscigno
db1a3b717c add slugs 2022-08-28 09:58:27 -04:00
DanRoscigno
c4caa35cfd add frontmatter dashes 2022-08-28 09:53:52 -04:00
Robert Schulze
df934d8762
Merge pull request #40217 from zvonand/zvonand-minmax
Fix conversion Date32 / DateTime64 / Date to narrow types
2022-08-28 09:42:39 +02:00
Yakov Olkhovskiy
d96f32d655
Update developer-instruction.md 2022-08-27 23:46:09 -04:00
Alexey Milovidov
6e564b18bf
Merge pull request #40600 from FrankChen021/check_url_arg
Validate the CompressionMethod parameter of URL table engine
2022-08-27 19:29:55 +03:00
Alexey Milovidov
859defa687
Merge pull request #40680 from DanRoscigno/add-slugs-everywhere
Add slugs everywhere
2022-08-27 00:05:22 +03:00
Mikhail f. Shiryaev
12bbc4a276
Merge pull request #40681 from ClickHouse/obsolete-tgz
Fix clickhouse-server doinst.sh
2022-08-26 22:04:34 +02:00
DanRoscigno
79f8e1c03c add slugs to all docs 2022-08-26 15:49:26 -04:00
Mikhail f. Shiryaev
3f967b1822
Fix _includes/install/tgz.sh script for linux 2022-08-26 20:57:38 +02:00
Alexey Milovidov
eac54c0375
Merge pull request #40671 from vmihailenco/chore/uptrace
chore: add Uptrace tool and go-clickhouse client
2022-08-26 21:11:16 +03:00
FArthur-cmd
f3d0083e57 Merge branch 'master' of https://github.com/ClickHouse/ClickHouse into annoy-2 2022-08-26 13:53:53 +00:00
Vladimir Mihailenco
d23d42251f chore: add Uptrace tool and go-clickhouse client 2022-08-26 16:04:33 +03:00
avogar
e31bba7731 Add docs 2022-08-26 12:52:15 +00:00
Peignon Melvyn
18c75d8ca9
Update settings.md
Fix according to: aaafb797c0/src/Core/Settings.h (L586)
2022-08-26 12:27:16 +02:00
DanRoscigno
ad56971e61 add text 2022-08-25 16:25:39 -04:00
Dan Roscigno
c9dea66f8d
Merge pull request #38308 from DanRoscigno/38284-add-grouping-function-docs
38284 add grouping function docs
2022-08-25 16:03:31 -04:00
DanRoscigno
5847c5cdbd reducing samples 2022-08-25 15:46:48 -04:00
Alexey Milovidov
64b8b8294d
Update grouping_function.md 2022-08-25 22:01:13 +03:00