Commit Graph

109467 Commits

Author SHA1 Message Date
Alexey Milovidov
134e50ddc4
Merge branch 'master' into input_format_null_as_default-improvement 2023-02-23 10:23:06 +03:00
Alexey Milovidov
4de5399768
Merge pull request #46727 from ClickHouse/Avogar-patch-2
Better exception message during Tuple JSON deserialization
2023-02-23 10:13:18 +03:00
Alexey Milovidov
e1f4303516
Merge pull request #46760 from ClickHouse/changelog-23.2
Add changelog for 23.2
2023-02-23 10:03:05 +03:00
Alexey Milovidov
cabb8fb81b Add changelog for 23.2 2023-02-23 08:02:37 +01:00
Alexey Milovidov
cb2ec24ea9 Add changelog for 23.2 2023-02-23 08:02:14 +01:00
Alexey Milovidov
f2509ba9c6 Add changelog for 23.2 2023-02-23 08:00:17 +01:00
Alexey Milovidov
7d837eea86
Update FieldFromAST.cpp 2023-02-23 08:15:14 +03:00
Alexey Milovidov
164c0c9927
Merge pull request #46730 from ClickHouse/evillique-patch-1
Ask for the password in clickhouse-client interactively
2023-02-23 08:05:08 +03:00
Alexey Milovidov
e3f24b2b68
Merge pull request #46744 from ClickHouse/remove_unused_field
Remove unused MergeTreeReadTask::remove_prewhere_column
2023-02-23 08:02:54 +03:00
ducle.canh
4ab568b91a minor fix test format 2023-02-23 09:36:20 +08:00
ducle.canh
0aa72624a6 Merge branch 'master' into ch_fix_aggregate_arithmetic 2023-02-23 09:35:45 +08:00
HarryLeeIBM
6fa908ff10 Use latest libunwind for s390x 2023-02-22 17:24:15 -08:00
pufit
c4646708a6 ru documentation 2023-02-22 18:54:39 -05:00
Mikhail f. Shiryaev
904e6ae769
On out-of-space at returns error, we must terminate still 2023-02-23 00:45:51 +01:00
pufit
b878ef9f01 en documentation 2023-02-22 18:28:30 -05:00
pufit
0fcda19d20 Fix description typo 2023-02-22 18:27:23 -05:00
HarryLeeIBM
776d4d9bb4 More refactoring 2023-02-22 15:19:25 -08:00
robot-clickhouse-ci-2
e3e5d83f96
Merge pull request #46745 from ClickHouse/Avogar-patch-3
Update docs about format table function
2023-02-22 23:40:10 +01:00
Igor Nikonov
271b72abf4
Merge pull request #46642 from ClickHouse/remove_redundant_sorting_fix
Fix: remove redundant sorting optimization
2023-02-22 23:33:15 +01:00
Anton Popov
d5864fa88e allow to fallback from async insert in case of large amount of data 2023-02-22 21:59:24 +00:00
Alexey Milovidov
5154b04cfb
Merge pull request #46732 from ClickHouse/fix-sonarcloud-job
Fix SonarCloud Job
2023-02-23 00:31:54 +03:00
Jiebin Sun
d6710d9b34 Align all the SSE4.1 requirement and use needle_size
Align all the SSE4.1 requirement from StringSearcher. Use needle_size
in while loop to make the code clean.

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>
2023-02-22 16:15:26 -05:00
HarryLeeIBM
e979a86604 Minor change for adding constexpr 2023-02-22 12:50:46 -08:00
HarryLeeIBM
18b93fc212 More refactoring for better style 2023-02-22 12:41:00 -08:00
Robert Schulze
81bf43157f
Allow configuration of Kafka topics with periods
The Kafka table engine allows global configuration and per-Kafka-topic
configuration. The latter uses syntax <kafka_TOPIC>, e.g. for topic
"football":

  <kafka_football>
      <retry_backoff_ms>250</retry_backoff_ms>
      <fetch_min_bytes>100000</fetch_min_bytes>
  </kafka_football>

Some users had to find out the hard way that such configuration doesn't
take effect if the topic name contains a period, e.g. "sports.football".
The reason is that ClickHouse configuration framework already uses
periods as level separators to descend the configuration hierarchy.
(Besides that, per-topic configuration at the same level as global
configuration could be considered ugly.)

Note that Kafka topics may contain characters "a-zA-Z0-9._-" (*) and
a tree-like topic organization using periods is quite common in
practice.

This PR deprecates the existing per-topic configuration syntax (but
continues to support it for backward compat) and introduces a new
per-topic configuration syntax below the global Kafka configuration of
the form:

<kafka>
   <topic name="football">
       <retry_backoff_ms>250</retry_backoff_ms>
       <fetch_min_bytes>100000</fetch_min_bytes>
   </topic>
</kafka>

The period restriction doesn't apply to XML attributes, so <topic
name="sports.football"> will work. Also, everything Kafka-related is
below <kafka>.

Considered but rejected alternatives:
- Extending Poco ConfigurationView with custom separators (e.g."/"
  instead of "."). Won't work easily because ConfigurationView only
  builds a path but defers descending the configuration tree to the
  normal configuration classes.
- Reloading the configuration file in StorageKafka (instead of reading
  the loaded file) but with a custom separator. This mode is supported
  by XML configuration. Too ugly and error-prone since the true
  configuration is composed from multiple configuration files.

(*) https://stackoverflow.com/a/37067544
2023-02-22 20:35:09 +00:00
pufit
1157bf2d9f Removed unnecessary ifs 2023-02-22 15:15:09 -05:00
Yakov Olkhovskiy
fadbeb8ebd T64 codec support for IPv4 2023-02-22 19:25:48 +00:00
Yakov Olkhovskiy
99095446af review suggestions 2023-02-22 17:22:13 +00:00
Jiebin Sun
1f62135ba7 Make the optimized SIMD StringSearcher clean
This patch has revised the name of value and added comments to make
the SIMD StringSearcher clean and easy to understand based on pull
request 46289.

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>
2023-02-22 12:18:21 -05:00
Jiebin Sun
d220e7f4fc Optimize the SIMD StringSearcher if needle_size is large
This patch offers an additional optimization when the needle_size is
large. If the needle_size is larger than the haystack_size, there is
no need to search any more.

The optimized SIMD StringSearcher has led at most 41.7% than Volnitsky
algorithm when the needle_size is less than 21, and fallen behind only
about 1% even when the needle_size is bigger than 50, which is not
considered as a common case.

Test platform: ICX server
Test query: SELECT COUNT(*) FROM hits WHERE URL LIKE '%{Needle}%';

Needle_size	opt/baseline
5		141.7%
6		129.4%
8		118.5%
9		112.3%
10		107.4%
14		103.4%
20		100.2%
21		100.7%
51		99.0%

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>
2023-02-22 11:58:17 -05:00
Jiebin Sun
f5a6a86dec Optimize the SIMD StringSearcher by searching first two chars
This patch offers the optimized SIMD StringSearcher by searching the first
and second chars together rather than only the first char, which will result
in big performance gain. The patch also provides a quick path when the needle
size is 1.

With this patch, I have tested the 43 queries in clickbench on ICX server.
Query 20 has got 35% performance gain. Other StringSearcher related queries
have got around 10% performance improvement. And the overall geomean of all
the queries has got 4.1% performance gain.

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>
2023-02-22 11:55:30 -05:00
Alexander Tokmakov
e660c0838c fix multipart requests 2023-02-22 17:54:35 +01:00
Kruglov Pavel
0e7143070e
Update docs about format table function 2023-02-22 17:51:29 +01:00
Alexander Gololobov
67dcd9694c Remove unused MergeTreeReadTask::remove_prewhere_column 2023-02-22 17:49:22 +01:00
avogar
50caa3d66c Update docs 2023-02-22 16:41:49 +00:00
Julio Jimenez
d0bd8877ce
Merge branch 'master' into fix-sonarcloud-job 2023-02-22 11:41:24 -05:00
avogar
e0931dbdbe Enable input_format_json_ignore_unknown_keys_in_named_tuple by default 2023-02-22 16:40:53 +00:00
avogar
5af6ac534e Use smaller test file 2023-02-22 15:51:47 +00:00
Nikolay Degterinsky
af992ca2db
Better 2023-02-22 16:51:36 +01:00
pufit
8e7533fa57
Merge pull request #46564 from AVMusorin/update-time-distribution-queue
Added `last_exception_time` column into distribution_queue table
2023-02-22 10:43:35 -05:00
Julio Jimenez
4f31e59dcd
Fix SonarCloud Job
Signed-off-by: Julio Jimenez <julio@clickhouse.com>
2023-02-22 10:34:27 -05:00
avogar
638b28cd85 Better test file 2023-02-22 15:21:06 +00:00
avogar
986dd72870 Fix possible clickhouse-local abort on JSONEachRow schema inference 2023-02-22 15:18:13 +00:00
Nikolay Degterinsky
cdbff57e6c
Ask for password interactively 2023-02-22 15:58:06 +01:00
Maksim Kita
40d2798cb4 Analyzer AutoFinalOnQueryPass fix 2023-02-22 15:51:13 +01:00
Alexander Gololobov
b0427c2e3c
Merge pull request #46660 from ClickHouse/fix_backup_test
Fix integration test: terminate old version without wait
2023-02-22 15:20:26 +01:00
Azat Khuzhin
9ab4944b9e Handle input_format_null_as_default for nested types
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-02-22 15:15:52 +01:00
Antonio Andelic
7f5fb77ed5
Increase table retries in cluster copier tests (#46590) 2023-02-22 15:09:48 +01:00
Kseniia Sumarokova
bec094cd79
Merge pull request #46712 from kssenii/add-iceberg-doc
Add iceberg engine doc
2023-02-22 14:49:03 +01:00
Robert Schulze
4fd4e77737
Poco: POCO_HAVE_INT64 is always defined 2023-02-22 13:48:29 +00:00