Commit Graph

109207 Commits

Author SHA1 Message Date
Robert Schulze
6aa4e93284
Merge branch 'master' into StringSearcher 2023-02-23 11:54:05 +01:00
Kseniia Sumarokova
7e064ccbd4
Merge pull request #46667 from kssenii/hide-named-collections-under-setting
Allow to hide only values from system.named_collections while making collection visible
2023-02-23 11:44:50 +01:00
Kseniia Sumarokova
30a8eb0c2f
Merge pull request #46670 from kssenii/hide-disk-setting
Hide arguments of custom disk merge tree setting
2023-02-23 11:42:53 +01:00
Robert Schulze
7d34822c7c
Merge branch 'master' into hlee-s390x-xxhash 2023-02-23 11:29:26 +01:00
Robert Schulze
052010592c
Poco: Remove some dead code 2023-02-23 10:13:07 +00:00
Robert Schulze
d2cd249422
Merge pull request #46728 from ClickHouse/rs/poco-have-int64
Poco: POCO_HAVE_INT64 is always defined
2023-02-23 11:02:43 +01:00
Mikhail f. Shiryaev
f2b1654a44
Add overwriting failed reports as pending on rerun 2023-02-23 10:38:22 +01:00
Mikhail f. Shiryaev
e290ceec52
Reorder imports 2023-02-23 10:38:21 +01:00
Mikhail f. Shiryaev
33a2e2b3d8
Set default per_page=100 for GitHub 2023-02-23 10:38:09 +01:00
Ilya Yatsishin
f64caafaf9
Update CHANGELOG.md 2023-02-23 10:34:23 +01:00
Tyler Hannan
9054eb49ec
removing dated event 2023-02-23 10:06:18 +01:00
Alexey Milovidov
0db943e596
Merge pull request #46687 from ClickHouse/tavplubix-patch-5
Disable flaky test_ttl_move_memory_usage.py
2023-02-23 10:27:25 +03:00
Alexey Milovidov
c0bc549e77
Merge pull request #46709 from azat/bg-pool-no-query
BackgroundSchedulePool should not have any query context
2023-02-23 10:24:46 +03:00
Alexey Milovidov
134e50ddc4
Merge branch 'master' into input_format_null_as_default-improvement 2023-02-23 10:23:06 +03:00
Alexey Milovidov
4de5399768
Merge pull request #46727 from ClickHouse/Avogar-patch-2
Better exception message during Tuple JSON deserialization
2023-02-23 10:13:18 +03:00
Alexey Milovidov
e1f4303516
Merge pull request #46760 from ClickHouse/changelog-23.2
Add changelog for 23.2
2023-02-23 10:03:05 +03:00
Alexey Milovidov
cabb8fb81b Add changelog for 23.2 2023-02-23 08:02:37 +01:00
Alexey Milovidov
cb2ec24ea9 Add changelog for 23.2 2023-02-23 08:02:14 +01:00
Alexey Milovidov
f2509ba9c6 Add changelog for 23.2 2023-02-23 08:00:17 +01:00
Alexey Milovidov
7d837eea86
Update FieldFromAST.cpp 2023-02-23 08:15:14 +03:00
Alexey Milovidov
164c0c9927
Merge pull request #46730 from ClickHouse/evillique-patch-1
Ask for the password in clickhouse-client interactively
2023-02-23 08:05:08 +03:00
Alexey Milovidov
e3f24b2b68
Merge pull request #46744 from ClickHouse/remove_unused_field
Remove unused MergeTreeReadTask::remove_prewhere_column
2023-02-23 08:02:54 +03:00
ducle.canh
4ab568b91a minor fix test format 2023-02-23 09:36:20 +08:00
ducle.canh
0aa72624a6 Merge branch 'master' into ch_fix_aggregate_arithmetic 2023-02-23 09:35:45 +08:00
HarryLeeIBM
6fa908ff10 Use latest libunwind for s390x 2023-02-22 17:24:15 -08:00
pufit
c4646708a6 ru documentation 2023-02-22 18:54:39 -05:00
Mikhail f. Shiryaev
904e6ae769
On out-of-space at returns error, we must terminate still 2023-02-23 00:45:51 +01:00
pufit
b878ef9f01 en documentation 2023-02-22 18:28:30 -05:00
pufit
0fcda19d20 Fix description typo 2023-02-22 18:27:23 -05:00
HarryLeeIBM
776d4d9bb4 More refactoring 2023-02-22 15:19:25 -08:00
robot-clickhouse-ci-2
e3e5d83f96
Merge pull request #46745 from ClickHouse/Avogar-patch-3
Update docs about format table function
2023-02-22 23:40:10 +01:00
Igor Nikonov
271b72abf4
Merge pull request #46642 from ClickHouse/remove_redundant_sorting_fix
Fix: remove redundant sorting optimization
2023-02-22 23:33:15 +01:00
Anton Popov
d5864fa88e allow to fallback from async insert in case of large amount of data 2023-02-22 21:59:24 +00:00
Alexey Milovidov
5154b04cfb
Merge pull request #46732 from ClickHouse/fix-sonarcloud-job
Fix SonarCloud Job
2023-02-23 00:31:54 +03:00
Jiebin Sun
d6710d9b34 Align all the SSE4.1 requirement and use needle_size
Align all the SSE4.1 requirement from StringSearcher. Use needle_size
in while loop to make the code clean.

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>
2023-02-22 16:15:26 -05:00
HarryLeeIBM
e979a86604 Minor change for adding constexpr 2023-02-22 12:50:46 -08:00
HarryLeeIBM
18b93fc212 More refactoring for better style 2023-02-22 12:41:00 -08:00
Robert Schulze
81bf43157f
Allow configuration of Kafka topics with periods
The Kafka table engine allows global configuration and per-Kafka-topic
configuration. The latter uses syntax <kafka_TOPIC>, e.g. for topic
"football":

  <kafka_football>
      <retry_backoff_ms>250</retry_backoff_ms>
      <fetch_min_bytes>100000</fetch_min_bytes>
  </kafka_football>

Some users had to find out the hard way that such configuration doesn't
take effect if the topic name contains a period, e.g. "sports.football".
The reason is that ClickHouse configuration framework already uses
periods as level separators to descend the configuration hierarchy.
(Besides that, per-topic configuration at the same level as global
configuration could be considered ugly.)

Note that Kafka topics may contain characters "a-zA-Z0-9._-" (*) and
a tree-like topic organization using periods is quite common in
practice.

This PR deprecates the existing per-topic configuration syntax (but
continues to support it for backward compat) and introduces a new
per-topic configuration syntax below the global Kafka configuration of
the form:

<kafka>
   <topic name="football">
       <retry_backoff_ms>250</retry_backoff_ms>
       <fetch_min_bytes>100000</fetch_min_bytes>
   </topic>
</kafka>

The period restriction doesn't apply to XML attributes, so <topic
name="sports.football"> will work. Also, everything Kafka-related is
below <kafka>.

Considered but rejected alternatives:
- Extending Poco ConfigurationView with custom separators (e.g."/"
  instead of "."). Won't work easily because ConfigurationView only
  builds a path but defers descending the configuration tree to the
  normal configuration classes.
- Reloading the configuration file in StorageKafka (instead of reading
  the loaded file) but with a custom separator. This mode is supported
  by XML configuration. Too ugly and error-prone since the true
  configuration is composed from multiple configuration files.

(*) https://stackoverflow.com/a/37067544
2023-02-22 20:35:09 +00:00
pufit
1157bf2d9f Removed unnecessary ifs 2023-02-22 15:15:09 -05:00
Yakov Olkhovskiy
fadbeb8ebd T64 codec support for IPv4 2023-02-22 19:25:48 +00:00
Yakov Olkhovskiy
99095446af review suggestions 2023-02-22 17:22:13 +00:00
Jiebin Sun
1f62135ba7 Make the optimized SIMD StringSearcher clean
This patch has revised the name of value and added comments to make
the SIMD StringSearcher clean and easy to understand based on pull
request 46289.

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>
2023-02-22 12:18:21 -05:00
Jiebin Sun
d220e7f4fc Optimize the SIMD StringSearcher if needle_size is large
This patch offers an additional optimization when the needle_size is
large. If the needle_size is larger than the haystack_size, there is
no need to search any more.

The optimized SIMD StringSearcher has led at most 41.7% than Volnitsky
algorithm when the needle_size is less than 21, and fallen behind only
about 1% even when the needle_size is bigger than 50, which is not
considered as a common case.

Test platform: ICX server
Test query: SELECT COUNT(*) FROM hits WHERE URL LIKE '%{Needle}%';

Needle_size	opt/baseline
5		141.7%
6		129.4%
8		118.5%
9		112.3%
10		107.4%
14		103.4%
20		100.2%
21		100.7%
51		99.0%

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>
2023-02-22 11:58:17 -05:00
Jiebin Sun
f5a6a86dec Optimize the SIMD StringSearcher by searching first two chars
This patch offers the optimized SIMD StringSearcher by searching the first
and second chars together rather than only the first char, which will result
in big performance gain. The patch also provides a quick path when the needle
size is 1.

With this patch, I have tested the 43 queries in clickbench on ICX server.
Query 20 has got 35% performance gain. Other StringSearcher related queries
have got around 10% performance improvement. And the overall geomean of all
the queries has got 4.1% performance gain.

Signed-off-by: Jiebin Sun <jiebin.sun@intel.com>
2023-02-22 11:55:30 -05:00
Kruglov Pavel
0e7143070e
Update docs about format table function 2023-02-22 17:51:29 +01:00
Alexander Gololobov
67dcd9694c Remove unused MergeTreeReadTask::remove_prewhere_column 2023-02-22 17:49:22 +01:00
avogar
50caa3d66c Update docs 2023-02-22 16:41:49 +00:00
Julio Jimenez
d0bd8877ce
Merge branch 'master' into fix-sonarcloud-job 2023-02-22 11:41:24 -05:00
avogar
e0931dbdbe Enable input_format_json_ignore_unknown_keys_in_named_tuple by default 2023-02-22 16:40:53 +00:00
avogar
5af6ac534e Use smaller test file 2023-02-22 15:51:47 +00:00