Commit Graph

143976 Commits

Author SHA1 Message Date
Nikita Mikhaylov
efbf40ad28
Merge pull request #64846 from rschu1ze/simplify-inverted-indexes
Simplify handling of old 'inverted' indexes
2024-06-05 20:04:16 +00:00
pufit
e7b7c3aebe Update query before replication 2024-06-05 16:00:08 -04:00
Nikita Mikhaylov
f42452d51e Add settings to changes history 2024-06-05 21:57:23 +02:00
Nikita Mikhaylov
106c1529ed Introduce an alias 2024-06-05 21:14:26 +02:00
wudidapaopao
c96800e96d
Merge branch 'ClickHouse:master' into fix_integer_accurate_cast 2024-06-06 03:06:09 +08:00
Nikita Mikhaylov
eb8520758a Done 2024-06-05 20:51:07 +02:00
Ali
007a8ca32e
Update docs/en/operations/named-collections.md
Co-authored-by: Nikolay Degterinsky <43110995+evillique@users.noreply.github.com>
2024-06-05 22:05:45 +03:30
Ali
844a94383b
Update docs/en/operations/named-collections.md
Co-authored-by: Nikolay Degterinsky <43110995+evillique@users.noreply.github.com>
2024-06-05 22:05:36 +03:30
Ali
8aea0c3fec
Omitted extra spaces
Co-authored-by: Nikolay Degterinsky <43110995+evillique@users.noreply.github.com>
2024-06-05 22:05:19 +03:30
alesapin
6ccd6ee77f
Merge pull request #64843 from ClickHouse/upload_blob_storage_log
Upload blob_storage_log from stateless tests
2024-06-05 18:26:03 +00:00
Ali
082efe2668
Update docs/en/operations/named-collections.md
Co-authored-by: Nikolay Degterinsky <43110995+evillique@users.noreply.github.com>
2024-06-05 21:40:28 +03:30
Ali
0316fa701f
Update docs/en/operations/named-collections.md
Co-authored-by: Nikolay Degterinsky <43110995+evillique@users.noreply.github.com>
2024-06-05 21:40:23 +03:30
Ali
927720a020
Update docs/en/operations/named-collections.md
Co-authored-by: Nikolay Degterinsky <43110995+evillique@users.noreply.github.com>
2024-06-05 21:40:13 +03:30
xogoodnow
46f14046e4 Added example on how to create named collection for kafka using ddl 2024-06-05 21:34:53 +03:30
Alexey Milovidov
bfb2cc3793
Update 02271_fix_column_matcher_and_column_transformer.sql 2024-06-05 19:42:11 +02:00
Azat Khuzhin
6cfd5b2165 Fix possible assertion when size of precommitted_parts <= precommitted_parts_need_rename
CI founds [1]:

    Logical error: 'precommitted_parts.size() >= precommitted_parts_need_rename.size()'

  [1]: https://s3.amazonaws.com/clickhouse-test-reports/61973/5c1e6a3e956917bdbb7eaa467934e5b75f17a923/stateless_tests__tsan__s3_storage__[5_5].html

The problem is that after precommitted_parts cleaned from detached parts
it may be less then precommitted_parts_need_rename, so to avoid this,
let's just copy it to a new container.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:24 +02:00
Azat Khuzhin
6f522c1d61 Do not remove detached parts in Transaction::rollback
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:24 +02:00
Azat Khuzhin
3675c27fe9 Require explicit rename of parts in transaction
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:24 +02:00
Azat Khuzhin
ca2c720d0e Avoid race between cleanup thread and renameMergedTemporaryPart()
The problem was that with this patch set renameMergedTemporaryPart() is
called without temporary_directory_lock holded (in MergeTask), since it
is reseted just before calling renameMergedTemporaryPart(), and this can
be seen in logs:

    2024.03.29 19:56:42.126919 [ 1341 ] {ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95::-8_0_138_2_2} <Trace> test_btnct5cr.alter_table_0 (ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95) (MergerMutator): Merged 50 parts: [-8_0_0_0_2, -8_138_138_0] -> -8_0_138_2_2
    2024.03.29 19:56:42.127034 [ 1341 ] {ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95::-8_0_138_2_2} <Debug> test_btnct5cr.alter_table_0 (ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95): Committing part -8_0_138_2_2 to zookeeper
    2024.03.29 19:56:42.128462 [ 884 ] {} <Warning> test_btnct5cr.alter_table_0 (ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95): Removing temporary directory /var/lib/clickhouse/store/ea7/ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95/tmp_merge_-8_0_138_2_2/
    2024.03.29 19:56:42.128647 [ 1341 ] {ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95::-8_0_138_2_2} <Debug> test_btnct5cr.alter_table_0 (ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95): Part -8_0_138_2_2 committed to zookeeper
    ...
    2024.03.29 19:56:54.586084 [ 57841 ] {bf240267-0620-4294-afc1-479c58e6be89} <Error> executeQuery: std::exception. Code: 1001, type: std::__1::__fs::filesystem::filesystem_error, e.what() = filesystem error: in file_size: No such file or directory ["/var/lib/clickhouse/store/ea7/ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95/-8_0_138_2_2/data.cmrk3"]

This should fix failures of 00993_system_parts_race_condition_drop_zookeeper in [1].

  [1]: https://s3.amazonaws.com/clickhouse-test-reports/61973/f6f826c85dd5b7bb8db16286fd10dcf441a440f7/stateless_tests__coverage__[4_6].html

Though now it looks hackish...

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:24 +02:00
Azat Khuzhin
b41d08a2b6 Use renameParts() explicitly to avoid leaving parts in detached
Since there is an assertion that does not allows to remove detached
parts during cleanup, which sounds good in general, but breaks this new
code.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:24 +02:00
Azat Khuzhin
ee546fa00a Fix replacing parts with empty
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:24 +02:00
Azat Khuzhin
6c3db34aae Remove unused locked_parts from MergeTreeData::Transaction
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:24 +02:00
Azat Khuzhin
78088ce59a Reduce lock contention for MergeTree tables (by renaming parts without holding lock)
Under heavy load, or not so heavy but with fsync_part_directory=1,
time that renameTo() holds DataPartsLock will be increased, and this
will affect almost every operation with this table.

On one of production clusters I saw ~60 seconds with
fsync_part_directory=1.

Move the renameTo() out from the critical section.

v2: instead of using DataPartsLock.lock.lock()/unlock() move the renameTo() into MergeTreeData::Transaction::commit()
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:23 +02:00
Azat Khuzhin
66a2962cce Add reason into "Part {} is broken and need manual correction" message
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:23 +02:00
alesapin
966b08f192 Fix segfault for a very tricky case 2024-06-05 19:26:20 +02:00
Nikolai Kochetov
8bbf46ce2f Trying to fix a test. 2024-06-05 17:15:48 +00:00
xogoodnow
d99f8b4acc Added Named collections for accessing Kafka 2024-06-05 20:35:22 +03:30
Alexey Milovidov
0a67eb868e
Merge pull request #57098 from ClickHouse/analyzer-fuzz-2
Analyzer fuzzer 2
2024-06-05 16:52:41 +00:00
kssenii
5ec3699a19 Fix test 2024-06-05 18:36:13 +02:00
Sergei Trifonov
9c7a8727d0
Merge pull request #64244 from kirillgarbar/check_zk_path
Prevent conversion to Replicated if zookeeper path already exists
2024-06-05 16:36:10 +00:00
Nikita Mikhaylov
b2144b45d0
Update 10_question.yaml 2024-06-05 18:00:03 +02:00
Alexey Milovidov
9324784ceb
Update settings.md 2024-06-05 17:54:44 +02:00
Nikolai Kochetov
ee5100171e Avoid duplicating names for new inputs of ActionsDAG::split result. 2024-06-05 15:50:27 +00:00
xogoodnow
5aec5ea8ef Added mlock_executable parameter 2024-06-05 19:15:00 +03:30
xogoodnow
3cd699fc80 Added listen_try and listen_reuse_port parameters 2024-06-05 18:48:45 +03:30
Nikita Mikhaylov
4b27d38fe6
Update 10_question.yaml 2024-06-05 17:13:30 +02:00
Nikita Mikhaylov
e29185f05e
Replace Markdown with YAML 2024-06-05 17:02:09 +02:00
vdimir
5251febf03
Merge pull request #64393 from ClickHouse/vdimir/blob_storage_log_fix_recursion
Prevent recursive logging in blob_storage_log
2024-06-05 14:28:01 +00:00
kssenii
98b7805697 Fix unused field 2024-06-05 16:19:17 +02:00
Mikhail Gorshkov
7ce67265c7 PR post-review fixes 2024-06-05 14:06:31 +00:00
Robert Schulze
8bdd291049
Simplify handling of old 'inverted' indexes 2024-06-05 13:27:07 +00:00
kssenii
2a30c77346 Fix compatibility 2024-06-05 15:21:51 +02:00
Mikhail Gorshkov
83901b82c9 Merge branch 'master' of github.com:ClickHouse/ClickHouse into ADQM-1892 2024-06-05 13:13:06 +00:00
Anton Popov
5e9a41bd9f fix untacked memory in MemoryTrackerSwitcher 2024-06-05 13:10:37 +00:00
alesapin
1c346d5c2e Bump 2024-06-05 15:04:48 +02:00
alesapin
59784a4cf2 Upload blob_storage_log from stateless tests 2024-06-05 15:04:15 +02:00
Han Fei
ac430bb01d
Merge pull request #59357 from hanfei1991/hanfei/stats_uniq
Introduce statistics of type "number of distinct values"
2024-06-05 12:56:52 +00:00
Azat Khuzhin
11905682a9 Check python code with flake8
Recently assert-on-tuple had been introduced in tests [1], let's prevent
this.

  [1]: https://github.com/ClickHouse/ClickHouse/pull/56367#discussion_r1437098533

v2: pin flake8 to 4.0.1 (instead of originally 6.1) due to other dependencies, hope that it will find such errors
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 14:46:38 +02:00
Azat Khuzhin
b2535d7f50 Fix invalid escape sequence warnings
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 14:46:38 +02:00
Azat Khuzhin
a474816fc7 Add missing botocore import into clickhouse_backupview.py
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 14:46:38 +02:00