Commit Graph

4758 Commits

Author SHA1 Message Date
zvonand
c329e382ab resolve conflicts 2023-11-29 16:03:07 +01:00
Han Fei
6733438adc Merge branch 'master' into hanfei/statistic 2023-11-28 16:35:00 +01:00
Robert Schulze
b56b48d2de
Update docs 2023-11-28 14:59:20 +00:00
zvonand
c306d21b54 merge master + resolve conflicts 2023-11-28 15:51:21 +01:00
Kruglov Pavel
06b9f8129f
Merge pull request #57298 from nellicus/patch-2
replace incorrect field name in coalesce example
2023-11-28 14:59:01 +01:00
Raúl Marín
48fff248e5 Merge remote-tracking branch 'blessed/master' into fix_respect_nulls 2023-11-28 12:28:36 +01:00
Raúl Marín
1a3f4032b1 Fix doc reference 2023-11-28 12:27:35 +01:00
Antonio Andelic
a61f32867e
Merge pull request #56488 from lingtaolf/feature/getHTTPHeader
add function getClientHTTPHeader
2023-11-28 11:41:56 +01:00
Robert Schulze
1448276bae
Merge pull request #55872 from azat/hour-no-leading-zero
Add ability to parse hours/months without leading zeros
2023-11-28 11:07:00 +01:00
robot-ch-test-poll3
e24b9e5ca2
Merge pull request #57276 from phallstrom/patch-1
Update URL to related content about LowCardinality
2023-11-28 10:59:14 +01:00
Antonio Bonuccelli
612c6be204
replace incorrect field name in coalesce example
field in the example should be `telegram` (not `icq`)
2023-11-28 10:20:02 +01:00
Ubuntu
a85cf758b0 Fuzz JSON table function
Create a table function `fuzzJSON`

An example query:
```
CREATE NAMED COLLECTION json_fuzzer AS json_str={};

SELECT *
FROM fuzzJSON(json_fuzzer, json_str = '{"students" : ["Alice", "Bob"]}', random_seed = 666, max_output_length = 128, probability = 0.9)
LIMIT 3

Query id: 7f802052-efb0-41b4-87fa-03b7dd290e9d

┌─json──────────────────────────────────────────────────────────────────────────────────┐
│ {"ade0yX":[9200406572736542991, true, "sm"]}                                          │
│ {"students":["Alice", "eSN3WO#a6NYTBe0$#OWwyIQ"], "cVoP2BuQugQ":17384271928263249403} │
│ {"students":["Alice", "Bob", "T+-k4+PJGkL%XRRaF2BoeN@&A"]}                            │
└───────────────────────────────────────────────────────────────────────────────────────┘
```

Next step:
* Generate invalid string

Fixes #35962

add Object('json')

use named collection
2023-11-28 01:58:24 +00:00
slvrtrn
497e5abc47 Add a doc entry. 2023-11-28 00:09:36 +01:00
Philip Hallstrom
6d7f160bcc
Update URL to related content about LowCardinality 2023-11-27 15:05:59 -08:00
Han Fei
7bd2ea3add Merge branch 'master' into hanfei/statistic 2023-11-27 23:31:23 +01:00
Raúl Marín
010a6074f8 Try to fix docs 2023-11-27 23:29:03 +01:00
Raúl Marín
3395c7c745 Provide a custom implementation for respect_null aggregations 2023-11-27 19:29:20 +01:00
Robert Schulze
4c236b34f0
Another small cleanup 2023-11-27 11:09:07 +00:00
Kruglov Pavel
b10e46b2bc
Merge pull request #57126 from Avogar/size-virtual-column
Add _size virtual column to s3/file/hdfs/url/azureBlobStorage engines
2023-11-27 10:12:18 +01:00
Robert Schulze
a8671049e1
Various smaller fixups 2023-11-27 08:49:54 +00:00
Nikolay Degterinsky
de0876ed68
Merge pull request #57234 from evillique/fix-docs-anchors
Fix docs
2023-11-27 01:22:43 +01:00
Nikolay Degterinsky
faa3a43803 Fix docs 2023-11-26 19:21:39 +00:00
Azat Khuzhin
315906b380 Add ability to parse hours/months without leading zeros
Recently I was looking into one dataset, that has such format.

So I've changed the meaning of the following formatters for
formatDateTime()/parseDateTime(), like in other implementations [1]:

- %l/%k - to handle hours without leading zeros
- %c - handle months without leading zeros

  [1]: https://www.php.net/strftime

And since this is kind of behaviour change (even though it is unlikely
will break something), there is a new setting to control this new
behaviour:
- parsedatetime_enable_format_without_leading_zeros=1 (default ON)
- formatdatetime_enable_format_without_leading_zeros=0 (default OFF, since this could change users output)

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-11-26 19:10:17 +01:00
Robert Schulze
4088ec0eac
Merge pull request #57199 from rschu1ze/docs-math-funcs
Docs: Improve math function docs
2023-11-25 21:22:53 +01:00
Robert Schulze
2e7c16e138
Fix broken links 2023-11-25 19:47:32 +00:00
Kseniia Sumarokova
ea7c68febe
Merge pull request #56705 from ClickHouse/docs-s3-gcs-improvements
[Docs] Add details to S3 and GCS table functions
2023-11-24 16:17:40 +01:00
Robert Schulze
75ceaf6200
Docs: Improve math function docs 2023-11-24 14:31:39 +00:00
Yarik Briukhovetskyi
69205769d0
Merge branch 'ClickHouse:master' into group_sorted_array_function 2023-11-23 20:23:47 +01:00
Tomáš Hromada
956a8add35
Update other-functions.md
Fix `initializeAgregation` typo
2023-11-23 15:57:19 +01:00
Kruglov Pavel
b84e3cf683
Merge branch 'master' into size-virtual-column 2023-11-22 19:25:00 +01:00
avogar
6b64efb55a Add docs 2023-11-22 18:21:30 +00:00
zvonand
5153798aeb Introduced fileCluster table function
Added fileCluster function
Added test and docs
2023-11-22 15:06:04 +01:00
Bhavna Jindal
a324bd2780 added macro to enable/disable pocketfft lib 2023-11-21 06:52:47 -08:00
Bhavna Jindal
baaa15b5b0 added macro to enable/disable pocketfft lib 2023-11-21 06:52:47 -08:00
Bhavna Jindal
ad99cb902f changed method name, updated pocketfft repo reference 2023-11-21 06:52:47 -08:00
Bhavna Jindal
e3826786f1 Added testcases, documentation and addressed comments 2023-11-21 06:52:47 -08:00
凌涛
e86aa433c5
Merge branch 'master' into feature/getHTTPHeader 2023-11-21 19:48:47 +08:00
凌涛
c5f16725ec add function getClientHTTPHeader 2023-11-21 11:12:01 +08:00
slvrtrn
9f96b58979 Update docs, add more tests. 2023-11-20 16:22:47 +01:00
robot-clickhouse-ci-1
4bf3845fbf
Merge pull request #56962 from ClickHouse/docs-count-distinct-perf-tip
[Docs] Add perf tip for COUNT(DISTINCT expr)
2023-11-19 20:55:15 +01:00
Justin de Guzman
9d3c62ec41
[Docs] Recommend against OPTIMIZE FINAL in OPTIMIZE page 2023-11-18 18:51:35 -08:00
Justin de Guzman
a894671e8a
[Docs] Add perf tip for COUNT(DISTINCT expr) 2023-11-18 17:53:32 -08:00
Robert Schulze
4353507724
Merge pull request #56540 from slvrtrn/concat-arbitrary-types
Support arguments of arbitary type in function `concat()`
2023-11-18 17:21:03 +01:00
slvrtrn
d504ad6e99 Merge remote-tracking branch 'origin' into concat-arbitrary-types 2023-11-17 13:45:12 +01:00
Denny Crane
3765a1c77b
Update random-functions.md 2023-11-16 16:07:49 -04:00
Robert Schulze
0b45e4bfee
Merge remote-tracking branch 'ClickHouse/master' into concat-arbitrary-types 2023-11-15 15:44:36 +00:00
Robert Schulze
f21dd37d18
Some fixups 2023-11-15 15:43:41 +00:00
Robert Schulze
eb08bba607 Update docs 2023-11-15 16:37:24 +01:00
slvrtrn
c7cd4fa972 Update concat docs 2023-11-15 14:57:34 +01:00
Ethan Shea
37dc535d78
Remove tabs 2023-11-13 16:16:07 -05:00
Ethan Shea
207c4c50cb
Improve legibility of window functions list
Before, markdown was rendering these as one big paragraph.
2023-11-13 16:12:05 -05:00
justindeguzman
bdb624f20f [Docs] Add details on S3 table function memory usage/streaming 2023-11-13 13:03:25 -08:00
justindeguzman
de42b7f9e0 [Docs] Add details to S3 and GCS table functions 2023-11-13 12:57:05 -08:00
Robert Schulze
361a3ca11f
Merge pull request #56519 from rschu1ze/qc-ignore-nondeterministic-queries
Query cache: Allow to ignore non-deterministic queries
2023-11-13 21:32:44 +01:00
Denny Crane
c93efc929a fix typo in the doc 2023-11-13 16:54:21 +03:00
Kseniia Sumarokova
b7915e6983
Merge pull request #55782 from aiven-sal/aiven-sal/namedcoll_pr
NamedCollections: prevent fields overriding in functions' calls
2023-11-13 09:07:09 +01:00
rondo_1895
04b75a3ac9
Update except.md
Symbol Error
2023-11-13 11:37:19 +08:00
Robert Schulze
bea529c9a1
Try to stabilize test results 2023-11-12 15:11:38 +00:00
Robert Schulze
a511f41438
Docs: Fix overly long quote which pushed the right pane out of view 2023-11-12 10:51:22 +00:00
Shri Bodas
bc68fcf759 canonicalRand is replaced by randCanonical 2023-11-11 10:11:47 -08:00
Robert Schulze
82f5e92bf5
Update docs of fromUnixTimeStamp[InJodaSyntax]() 2023-11-10 11:11:42 +00:00
Salvatore Mesoraca
7d206cbc3c
NamedCollections: prevent fields overriding in functions call
Add syntax in SQL and XML to mark specific fields to allow
override or not.
Also add a new setting to control the default behaviour when
overriding support is not specified.
2023-11-09 17:25:16 +01:00
Robert Schulze
86b8ebf71c
Merge remote-tracking branch 'rschu1ze/master' into arrayRandomSample-improved 2023-11-07 15:25:29 +00:00
Robert Schulze
842cc36598
Update docs 2023-11-07 10:25:27 +00:00
Robert Schulze
54b5d573d7
Merge remote-tracking branch 'rschu1ze/master' into idatatype-convenience-functions 2023-11-07 10:18:30 +00:00
Robert Schulze
e22c673299
Revert "Revert "Add function arrayRandomSample()""
This reverts commit e47e50e30d.
2023-11-07 10:16:13 +00:00
robot-ch-test-poll1
845a235ad8
Merge pull request #56237 from ClickHouse/justindeguzman-patch-1
[Docs] Specify IAM role necessary for GCS table function
2023-11-07 01:20:06 +01:00
Alexey Milovidov
e47e50e30d
Revert "Add function arrayRandomSample()" 2023-11-07 02:49:11 +03:00
Robert Schulze
8846cc6770
Merge pull request #56327 from rschu1ze/parseQueryOrNull
Improve `formatQuery()`
2023-11-06 22:48:16 +01:00
Robert Schulze
5cfcf98d4a
Merge pull request #56277 from rschu1ze/show-columns-connection-type
Make type names in `SHOW COLUMNS` dependent on connection type
2023-11-06 16:24:27 +01:00
Robert Schulze
8896134531
Cleanup IDataType convenience functions 2023-11-06 13:25:32 +00:00
Robert Schulze
ae1dcb5254
Move documentation of string similarity functions to better location 2023-11-06 13:25:08 +00:00
vdimir
3b775dee53
Merge pull request #55126 from ClickHouse/vdimir/jaccard_index_perf
Add function jaccardIndex back with better performance
2023-11-06 10:39:02 +01:00
Robert Schulze
049c811a38
Make type names in SHOW COLUMNS dependent only on connection type 2023-11-05 21:20:20 +00:00
Robert Schulze
8a64066ee1
+ formatQueryOrNull() 2023-11-05 20:24:23 +00:00
Robert Schulze
480e284db1
Merge pull request #56088 from jh0x/feature-from-days-zero
Add function `fromDaysSinceYearZero()`
2023-11-04 00:17:04 +01:00
Nikita Mikhaylov
9666549d15
Merge pull request #56280 from ClickHouse/melvynator-patch-2
Update detach.md
2023-11-03 14:45:04 +01:00
vdimir
55a1baf029
Merge pull request #56022 from ClickHouse/vdimir/check_all_tables
Add CHECK query for all the tables
2023-11-03 10:10:42 +01:00
Dmitry Novik
2db74997a8
Fix typo 2023-11-03 01:34:07 +01:00
Dmitry Novik
f6733ac7ab
Add documentation for ATTACH DATABASE 2023-11-03 01:21:11 +01:00
Dmitry Novik
49d553369e
Add info about DETACH DATABASE 2023-11-03 01:12:58 +01:00
Peignon Melvyn
5d939f6df0
Update detach.md
Added detach database
2023-11-03 00:23:56 +01:00
Robert Schulze
910477b1bd
Add FROM_DAYS() alias 2023-11-02 21:43:17 +00:00
Robert Schulze
0b587d3cea
Remove overflow setting
Reason 1:
- https://github.com/ClickHouse/ClickHouse/pull/55696#issuecomment-1774130793
  Basically: Setting `date_time_overflow_behavior` applies only to
  conversions between Date <-> Date32 <-> DateTime <-> DateTime64 but
  it does not control the behavior of functions.

Reason 2:
- It's overkill (even if well-intented overkill). ClickHouse generally
  has a garbage in - garbate out philosophy, so that should be the
  default behavior which corresponds to behavior "ignore". Moreover,
  function 'fromDaysSinceYearZero()' mostly exists for compatibility /
  parity with MySQL and MySQL takes the same stance.
2023-11-02 21:43:17 +00:00
Robert Schulze
492af27feb
Update docs 2023-11-02 20:56:01 +00:00
robot-clickhouse
13adae0e42
Merge pull request #56244 from ne1r0n/patch-4
[Docs] fix typo
2023-11-02 14:59:19 +01:00
Nikita Taranov
b078443826
Fix typo in largestTriangleThreeBuckets.md (#56263) 2023-11-02 14:45:13 +01:00
vdimir
14ba4696f9
Rename jaccardIndex -> stringJaccardIndex, upd doc 2023-11-02 11:37:50 +00:00
vdimir
1f936d1966
upd byteHammingDistance doc 2023-11-02 11:37:50 +00:00
Alexander Chashnikov
3d88614e6b
fix typo
asteriks -> asterisk
2023-11-02 10:56:17 +02:00
Justin de Guzman
782d67766d
[Docs] Specify IAM role necessary for GCS table function 2023-11-01 21:56:20 -07:00
Joanna Hulboj
757dc8b944 Fix typos, add to/fromDaysSinceYearZero to fuzz 2023-11-01 17:48:58 +00:00
Azat Khuzhin
c25d6cd624
Rename directory monitor concept into background INSERT (#55978)
* Limit log frequence for "Skipping send data over distributed table" message

After SYSTEM STOP DISTRIBUTED SENDS it will constantly print this
message.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>

* Rename directory monitor concept into async INSERT

Rename the following query settings (with preserving backward
compatiblity, by keeping old name as an alias):
- distributed_directory_monitor_sleep_time_ms -> distributed_async_insert_sleep_time_ms
- distributed_directory_monitor_max_sleep_time_ms -> distributed_async_insert_max_sleep_time_ms
- distributed_directory_monitor_batch -> distributed_async_insert_batch_inserts
- distributed_directory_monitor_split_batch_on_failure -> distributed_async_insert_split_batch_on_failure

Rename the following table settings (with preserving backward
compatiblity, by keeping old name as an alias):
- monitor_batch_inserts -> async_insert_batch
- monitor_split_batch_on_failure -> async_insert_split_batch_on_failure
- directory_monitor_sleep_time_ms -> async_insert_sleep_time_ms
- directory_monitor_max_sleep_time_ms -> async_insert_max_sleep_time_ms

And also update all the references:

    $ gg -e directory_monitor_ -e monitor_ tests docs | cut -d: -f1 | sort -u | xargs sed -e 's/distributed_directory_monitor_sleep_time_ms/distributed_async_insert_sleep_time_ms/g' -e 's/distributed_directory_monitor_max_sleep_time_ms/distributed_async_insert_max_sleep_time_ms/g' -e 's/distributed_directory_monitor_batch_inserts/distributed_async_insert_batch/g' -e 's/distributed_directory_monitor_split_batch_on_failure/distributed_async_insert_split_batch_on_failure/g' -e 's/monitor_batch_inserts/async_insert_batch/g' -e 's/monitor_split_batch_on_failure/async_insert_split_batch_on_failure/g' -e 's/monitor_sleep_time_ms/async_insert_sleep_time_ms/g' -e 's/monitor_max_sleep_time_ms/async_insert_max_sleep_time_ms/g' -i

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>

* Rename async_insert for Distributed into background_insert

This will avoid amigibuity between general async INSERT's and INSERT
into Distributed, which are indeed background, so new term express it
even better.

Mostly done with:

    $ git di HEAD^ --name-only | xargs sed -i -e 's/distributed_async_insert/distributed_background_insert/g' -e 's/async_insert_batch/background_insert_batch/g' -e 's/async_insert_split_batch_on_failure/background_insert_split_batch_on_failure/g' -e 's/async_insert_sleep_time_ms/background_insert_sleep_time_ms/g' -e 's/async_insert_max_sleep_time_ms/background_insert_max_sleep_time_ms/g'

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>

* Mark 02417_opentelemetry_insert_on_distributed_table as long

CI: https://s3.amazonaws.com/clickhouse-test-reports/55978/7a6abb03a0b507e29e999cb7e04f246a119c6f28/stateless_tests_flaky_check__asan_.html
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>

---------

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-11-01 15:09:39 +01:00
Nikolay Degterinsky
cd779d4f50
Merge pull request #55856 from ClickHouse/rewrite-delete-docs
Rewrite DELETE docs
2023-11-01 00:10:01 +01:00
Joanna Hulboj
e5b4d7fac7 Fix failing checks 2023-10-31 21:00:11 +00:00
Robert Schulze
13b2946ae2
Merge pull request #55960 from rschu1ze/adddate-stringargs
Support string-encoded date arguments in `addDate()`
2023-10-31 15:37:34 +01:00
Raúl Marín
f3a3ec53f6
Merge pull request #56124 from den-crane/patch-64
Beautify `show merges`
2023-10-31 11:02:10 +01:00
Denny Crane
3535adf2af
Update show.md 2023-10-30 09:53:23 -03:00
Han Fei
19cdc6dfe2 Merge branch 'master' into hanfei/statistic 2023-10-30 04:53:18 +01:00
Alexey Milovidov
3631e476eb
Merge pull request #54947 from amosbird/minmax-combinator
Introduce -ArgMin/-ArgMax combinators.
2023-10-30 03:35:28 +01:00