Commit Graph

1240 Commits

Author SHA1 Message Date
Alexey Milovidov
0d1d177013
Merge pull request #41131 from JackyWoo/add_function_java_int_hash
Support Java integers hashing in `javaHash`
2022-10-02 18:10:51 +03:00
Duc Canh Le
99bf356220
Merge branch 'master' into ch_canh_fix_decrypt_with_null 2022-10-02 14:35:42 +08:00
Roman Vasin
f716fc2fe5 Put list of functions in correct order in docs; Use new formatting in exception messages. 2022-09-30 07:54:35 +00:00
Roman Vasin
45414b251d Merge branch 'master' of github.com:ClickHouse/ClickHouse into ADQM-583 2022-09-29 17:02:49 +00:00
Roman Vasin
9dfb2d058c Add DateTime64 returing functions in Note box 2022-09-29 14:23:50 +00:00
Robert Schulze
6381631836
Merge pull request #41824 from ClickHouse/tryBase58
Add function tryBase58Decode()
2022-09-29 12:38:58 +02:00
Duc Canh Le
6fd234d652 fix document typo and format 2022-09-28 20:06:18 +08:00
Robert Schulze
82139fad0e
Docs: Remove obsolete modelEvaluate() mention 2022-09-28 11:47:16 +00:00
Duc Canh Le
170ca26435 add document 2022-09-28 17:55:20 +08:00
Alexey Milovidov
683b7e5e95
Merge branch 'master' into add_function_java_int_hash 2022-09-28 01:30:26 +03:00
Robert Schulze
33891c275e
Fix typo 2022-09-27 10:20:39 +00:00
Robert Schulze
aa7e62ad5f
Add functio ntryBase58Decode()
- makes it consistent with tryBase64Decode(), follow-up to #39292

- additionally the following minor changes:

  - rename Common/base58.h|cpp to Common/Base58.h|cpp for constency with
    Common/Base64.h|cpp

  - check that (encode|decode|tryDecode)Base64() gets just one argument
2022-09-27 10:18:36 +00:00
Robert Schulze
38aecc15a2
Merge pull request #41214 from arenadata/ADQM-528-B
Add enable_extended_results_for_datetime_functions option to return results of type Date32
2022-09-17 19:25:11 +02:00
Robert Schulze
b32b02d844
Merge pull request #40897 from ClickHouse/catboost-bridge-resurrected
Move CatBoost evaluation into clickhouse-library-bridge
2022-09-16 13:12:09 +02:00
Roman Vasin
d82cd7b007 Add many cosmetic changes to C++ code and MD docs 2022-09-16 09:37:30 +00:00
Roman Vasin
5c242de43a Merge branch 'master' of github.com:ClickHouse/ClickHouse into ADQM-528-B 2022-09-15 13:47:51 +00:00
Roman Vasin
3ce66eab83 Correct all needed MD files 2022-09-15 13:28:11 +00:00
JackyWoo
afcbf3a853
Merge branch 'master' into add_function_java_int_hash 2022-09-15 10:08:26 +08:00
JackyWoo
05516bfd03 fix docs 2022-09-15 10:07:39 +08:00
Denny Crane
69d90de9a3
Update date-time-functions.md 2022-09-14 15:23:59 -03:00
Denny Crane
cbdbe1077a
Update date-time-functions.md 2022-09-14 12:16:32 -03:00
JackyWoo
e3caa6f096 some fixes 2022-09-13 21:21:35 +08:00
Denny Crane
132727d212
Doc. Fix links. 2022-09-08 14:37:42 -03:00
Robert Schulze
60f9f6855d
feat: implement catboost in library-bridge
This commit moves the catboost model evaluation out of the server
process into the library-bridge binary. This serves two goals: On the
one hand, crashes / memory corruptions of the catboost library no longer
affect the server. On the other hand, we can forbid loading dynamic
libraries in the server (catboost was the last consumer of this
functionality), thus improving security.

SQL syntax:

  SELECT
    catboostEvaluate('/path/to/model.bin', FEAT_1, ..., FEAT_N) > 0 AS prediction,
    ACTION AS target
  FROM amazon_train
  LIMIT 10

Required configuration:

  <catboost_lib_path>/path/to/libcatboostmodel.so</catboost_lib_path>

*** Implementation Details ***

The internal protocol between the server and the library-bridge is
simple:

- HTTP GET on path "/extdict_ping":
  A ping, used during the handshake to check if the library-bridge runs.

- HTTP POST on path "extdict_request"
  (1) Send a "catboost_GetTreeCount" request from the server to the
      bridge, containing a library path (e.g /home/user/libcatboost.so) and
      a model path (e.g. /home/user/model.bin). Rirst, this unloads the
      catboost library handler associated to the model path (if it was
      loaded), then loads the catboost library handler associated to the
      model path, then executes GetTreeCount() on the library handler and
      finally sends the result back to the server. Step (1) is called once
      by the server from FunctionCatBoostEvaluate::getReturnTypeImpl(). The
      library path handler is unloaded in the beginning because it contains
      state which may no longer be valid if the user runs
      catboost("/path/to/model.bin", ...) more than once and if "model.bin"
      was updated in between.
  (2) Send "catboost_Evaluate" from the server to the bridge, containing
      the model path and the features to run the interference on. Step (2)
      is called multiple times (once per chunk) by the server from function
      FunctionCatBoostEvaluate::executeImpl(). The library handler for the
      given model path is expected to be already loaded by Step (1).

Fixes #27870
2022-09-08 09:01:32 +00:00
Denny Crane
a75eb5ad84
Update date-time-functions.md 2022-09-07 15:59:23 -03:00
Denny Crane
0071ef9e38
Update date-time-functions.md 2022-09-07 15:56:31 -03:00
Kruglov Pavel
582216a3ca
Merge pull request #39919 from pzhdfy/UniqSketch
UniqThetaSketch support set operation such as union/intersect/not
2022-09-05 13:42:14 +02:00
Alexey Milovidov
1963baa11e
Merge pull request #40964 from ClickHouse/fix-docs-formatting
Fix formatting of notes box in documentation
2022-09-04 03:39:39 +03:00
Denny Crane
dd19b0856e
Doc. mapApply, mapFilter, mapUpdate (#40961)
* Update tuple-map-functions.md

* Update tuple-map-functions.md
2022-09-04 00:43:39 +02:00
Robert Schulze
385427ade8
Fix formatting of notes box in documentation
Follow-up to PR #38435
2022-09-03 17:25:14 +00:00
Fangyuan Deng
bc7d661668
Merge branch 'master' into UniqSketch 2022-09-01 19:31:53 +08:00
pzhdfy
acec516271 add docs 2022-09-01 19:31:01 +08:00
Robert Schulze
56eece40ec
Merge pull request #40736 from LevyCory/add-offset-to-formatDateTime
Add timezone offset support  to `formatDateTime`
2022-09-01 09:50:17 +02:00
Robert Schulze
912663b719
Revert "Move CatBoost evaluation into clickhouse-library-bridge" 2022-08-31 20:54:43 +02:00
Robert Schulze
ca01286028
Merge pull request #39629 from ClickHouse/catboost-bridge
Move CatBoost evaluation into clickhouse-library-bridge
2022-08-31 16:16:11 +02:00
Denny Crane
0d7cc82267
Update string-functions.md 2022-08-30 11:08:23 -03:00
Andrey Zvonov
93f9abf130
upd2 2022-08-30 14:41:40 +03:00
Andrey Zvonov
14adea8792
fix error in docs 2022-08-30 14:40:26 +03:00
Cory Levy
1e2eee7146 Remove unnecessary backslashes in markdown sql blocks 2022-08-29 16:31:31 -04:00
Robert Schulze
6b2b3c1eb3
feat: implement catboost in library-bridge
This commit moves the catboost model evaluation out of the server
process into the library-bridge binary. This serves two goals: On the
one hand, crashes / memory corruptions of the catboost library no longer
affect the server. On the other hand, we can forbid loading dynamic
libraries in the server (catboost was the last consumer of this
functionality), thus improving security.

SQL syntax:

  SELECT
    catboostEvaluate('/path/to/model.bin', FEAT_1, ..., FEAT_N) > 0 AS prediction,
    ACTION AS target
  FROM amazon_train
  LIMIT 10

Required configuration:

  <catboost_lib_path>/path/to/libcatboostmodel.so</catboost_lib_path>

*** Implementation Details ***

The internal protocol between the server and the library-bridge is
simple:

- HTTP GET on path "/extdict_ping":
  A ping, used during the handshake to check if the library-bridge runs.

- HTTP POST on path "extdict_request"
  (1) Send a "catboost_GetTreeCount" request from the server to the
      bridge, containing a library path (e.g /home/user/libcatboost.so) and
      a model path (e.g. /home/user/model.bin). Rirst, this unloads the
      catboost library handler associated to the model path (if it was
      loaded), then loads the catboost library handler associated to the
      model path, then executes GetTreeCount() on the library handler and
      finally sends the result back to the server. Step (1) is called once
      by the server from FunctionCatBoostEvaluate::getReturnTypeImpl(). The
      library path handler is unloaded in the beginning because it contains
      state which may no longer be valid if the user runs
      catboost("/path/to/model.bin", ...) more than once and if "model.bin"
      was updated in between.
  (2) Send "catboost_Evaluate" from the server to the bridge, containing
      the model path and the features to run the interference on. Step (2)
      is called multiple times (once per chunk) by the server from function
      FunctionCatBoostEvaluate::executeImpl(). The library handler for the
      given model path is expected to be already loaded by Step (1).

Fixes #27870
2022-08-29 20:26:45 +00:00
DanRoscigno
c4b8137d31 replace symlinks 2022-08-29 12:19:50 -04:00
Cory Levy
deb2a97020 Add docs 2022-08-28 22:28:34 -04:00
DanRoscigno
5b5fcc56aa add slugs 2022-08-28 10:53:34 -04:00
DanRoscigno
db1a3b717c add slugs 2022-08-28 09:58:27 -04:00
DanRoscigno
c4caa35cfd add frontmatter dashes 2022-08-28 09:53:52 -04:00
zvonand
e257f9d0cd update docs, tests + small fixes 2022-08-24 01:09:14 +03:00
zvonand
a9fd733871 updated docs 2022-08-24 01:09:14 +03:00
Andrey Zvonov
f40bd2194f
Update docs/en/sql-reference/functions/date-time-functions.md
Co-authored-by: Robert Schulze <robert@clickhouse.com>
2022-08-23 23:42:19 +03:00
Andrey Zvonov
b54a04d48f
Update docs/en/sql-reference/functions/date-time-functions.md
Co-authored-by: Robert Schulze <robert@clickhouse.com>
2022-08-23 23:42:01 +03:00
Andrey Zvonov
6c1fa6844f
Update docs/en/sql-reference/functions/date-time-functions.md
Co-authored-by: Robert Schulze <robert@clickhouse.com>
2022-08-23 23:41:19 +03:00
Andrey Zvonov
946223df20
Update docs/en/sql-reference/functions/date-time-functions.md
Co-authored-by: Robert Schulze <robert@clickhouse.com>
2022-08-23 23:41:01 +03:00
Andrey Zvonov
e4f00e2f15
Update docs 1
Co-authored-by: Robert Schulze <robert@clickhouse.com>
2022-08-23 23:40:34 +03:00
zvonand
d789dabc9d Fixed , updated docs 2022-08-22 17:36:56 +03:00
zvonand
963d838647 updated docs 2022-08-22 17:36:56 +03:00
Denny Crane
b6329293a9
Update date-time-functions.md 2022-08-21 16:45:13 -03:00
Denny Crane
c926d04c21
Update date-time-functions.md 2022-08-21 16:42:40 -03:00
Denny Crane
54374e22e6
Update date-time-functions.md 2022-08-19 15:09:40 -03:00
Robert Schulze
24615059fb
Merge pull request #40319 from ClickHouse/custom-error-code-in-throwif
Allow custom error code in throwIf
2022-08-19 09:49:35 +02:00
Robert Schulze
f121941916
feat: allow custom error code for SQL function throwIf() 2022-08-17 21:14:40 +00:00
Stephan
5252d41fdb
Provide missing documentation for arrayWithConstant
Documentation of `arrayWithConstant(l, const)`-function, as this was not part of the documentation so far.
2022-08-17 13:21:03 +02:00
Denny Crane
f574b49825
Update array-join.md 2022-08-15 16:45:07 -03:00
Denny Crane
f3b349fd18
Update array-join.md 2022-08-15 16:42:55 -03:00
Denny Crane
73f51a0f19 doc array-join 2022-08-15 16:40:48 -03:00
Robert Schulze
55ff546322
Fix typo 2022-08-11 20:42:59 +00:00
Dan Roscigno
35ee71a908
Merge pull request #40110 from ClickHouse/fix-docs-for-39287
Fix documentation of "modulo(a, b)"
2022-08-11 16:14:28 -04:00
Robert Schulze
aa42a42e0f
Fix documentation of "modulo(a, b)"
Fixes #39287

ClickHouse uses the same semantics for modulo on floats as Python, i.e.
  4.2 % 2.0 = 0.2
and not as previously documented:
  4.2 % 2.0 --> (drop decimal places) --> 4 % 2 = 0.

Fixed the documentation.
2022-08-11 08:23:43 +00:00
Alexey Milovidov
8374f31306
Merge pull request #39425 from arenadata/ADQM-485
Add support of dates from year 1900 to 2299 for Date32 and DateTime64
2022-08-11 05:01:53 +03:00
Alexey Milovidov
53097b3d65
Merge pull request #40015 from tbragin/master
Add parseDateTime64BestEffortUS* functions
2022-08-10 20:00:50 +03:00
Tanya Bragin
be51187f85 Update tests to use UTC, and docs to consistent order 2022-08-09 17:27:06 -07:00
Robert Schulze
76cc0cdc27
Remove redundancy in documentation of parseDateTimeBestEffortUS[OrNull|OrZero]()
The documentation of functions
- parseDateTimeBestEffortUS(),
- parseDateTimeBestEffortUSOrNull() and
- parseDateTimeBestEffortUSOrZero()
previously repeated the text of functions
- parseDateTimeBestEffort(),
- parseDateTimeBestEffortOrNull() and
- parseDateTimeBestEffortOrZero().

In the docs of the first function, the duplication  was removed and we
now emphasize the difference to the non-US version more clearly.

The docs of the second and third functions were just stripped of the
duplication. Chinese docs did not mention the US-date variants, hence
nothing was changed in them.
2022-08-09 16:54:06 +00:00
Robert Schulze
7c59331e0e
Make examples for function 'parseDateTimeBestEffort()' less ambiguous
"12/12" and "12-12" could be interpreted as MM/DD or DD/MM and therefore
were not great examples.
2022-08-09 08:56:05 +00:00
Tanya Bragin
eb61db3b67 initial changes to close #37492 2022-08-08 19:45:00 -07:00
Anton Popov
f94d4d4877
Merge branch 'master' into hash-functions-map 2022-08-02 13:26:54 +02:00
Anton Popov
2a841d0860 update docs for hash functions 2022-08-01 14:22:40 +00:00
Kruglov Pavel
dfdfabec94
Merge pull request #39218 from evillique/file_default_value
Add default argument to the function `file`
2022-08-01 13:04:19 +02:00
Robert Schulze
b52843d5fd
Merge pull request #37951 from zvonand/dt64_timeslots
Fix timeSlots for DateTime64
2022-07-30 20:49:05 +02:00
Roman Vasin
b462366415 Merge branch 'master' of github.com:ClickHouse/ClickHouse into ADQM-485 2022-07-25 17:55:47 +00:00
Alexey Milovidov
6fdcb009ff
Merge pull request #39533 from ClickHouse/now-in-block
Add function `nowInBlock`
2022-07-25 04:22:11 +03:00
Denny Crane
a1a1d357b4
Update string-replace-functions.md 2022-07-24 20:15:48 -03:00
Alexey Milovidov
0492f23d07 Add the docs 2022-07-24 23:11:54 +02:00
Andrey Zvonov
e473606cd1
Merge branch 'master' into zvonand-b58-datatype 2022-07-21 15:55:05 +02:00
Roman Vasin
e3192cf753 Correct docs to reflect new range 1900..2299 for Date32 and DateTime64; Cleanup code 2022-07-20 15:19:02 +00:00
Nikolay Degterinsky
c9ae525da7
Merge branch 'master' into file_default_value 2022-07-20 14:22:58 +02:00
Kruglov Pavel
12221cffc9
Merge pull request #39071 from jiahui-97/parse_timedelta
implementation of parseTimeDelta function
2022-07-19 12:39:56 +02:00
jiahui-97
e7af88b688 implementation of parseTimeDelta function
Co-authored-by: Kruglov Pavel <48961922+Avogar@users.noreply.github.com>
2022-07-19 09:33:02 +08:00
Andrey Zvonov
e0d1954fac
Merge branch 'master' into dt64_timeslots 2022-07-16 22:33:46 +02:00
zvonand
728219e640 updated due to review 2022-07-16 22:16:19 +02:00
zvonand
9ac7e41e07 updated docs 2022-07-16 19:45:33 +02:00
zvonand
d07a652883 merge 2022-07-16 19:05:07 +02:00
zvonand
4ab52b6873 added new DataType + fixes 2022-07-16 18:58:47 +02:00
Xoel Lopez Barata
10625e7c54
Fix typo
Replace "indicies" by "indices"
2022-07-14 17:41:48 +02:00
Vladimir C
b5011b5157
Merge pull request #39169 from FArthur-cmd/add_docs_for_dist_func
Documentation for array's distance functions
2022-07-14 12:36:17 +02:00
Nikolay Degterinsky
9ca2935d72 Add default argument to function file 2022-07-14 09:54:44 +00:00
Nikolay Degterinsky
e3a79520c1 Merge remote-tracking branch 'upstream/master' into translate 2022-07-13 11:42:40 +00:00
FArthur-cmd
411a08630c add docs 2022-07-13 10:29:13 +00:00
Nikolay Degterinsky
185978bfbe Add docs 2022-07-13 01:52:25 +00:00
Andrey Zvonov
7bfe155f09
Merge branch 'master' into dt64_timeslots 2022-07-12 10:26:38 +03:00
Nikolay Degterinsky
5c5c7ae6db Fix typos 2022-07-07 02:17:57 +00:00
DanRoscigno
95c3eff4c7 remove 404ing original article URL 2022-06-30 11:17:38 -04:00
Nikolay Degterinsky
384603a656
Merge pull request #38592 from ClibMouse/PR_38549_updates
Updated Geo Functions Index.md
2022-06-30 03:20:37 +02:00