Commit Graph

144120 Commits

Author SHA1 Message Date
Alexey Milovidov
d561f50965
Merge pull request #64865 from ClickHouse/melvynator-patch-2
Added settings about lightweight deletes
2024-06-06 03:32:39 +02:00
Peignon Melvyn
33cd9c274c
Update settings.md 2024-06-06 03:03:19 +02:00
Alexey Milovidov
9ab06931d2
Merge pull request #64845 from ClickHouse/follow-up-s3-queue
Follow-up to #64349
2024-06-06 02:56:58 +02:00
Alexey Milovidov
891c869995
Merge pull request #64850 from ClickHouse/nikitamikhaylov-patch-1
Replace Markdown with YAML for ISSUE_TEMPLATE
2024-06-06 02:23:42 +02:00
Alexey Milovidov
4dac9dacc3
Merge pull request #64854 from xogoodnow/mlock-parameter
Added mlock_executable parameter
2024-06-06 02:21:40 +02:00
Alexey Milovidov
b8fe70ba3d
Merge pull request #64863 from xogoodnow/aspelling-dic-add
added mlock and mlockall to aspell-dict to be ignored
2024-06-06 02:21:24 +02:00
Alexey Milovidov
81c647e680
Merge pull request #64844 from CurtizJ/fix-memory-tracker
Fix untracked memory in `MemoryTrackerSwitcher`
2024-06-06 02:18:43 +02:00
Alexey Milovidov
dc9dbf1cb5
Merge pull request #64852 from xogoodnow/add-config-parameters
Added listen_try and  listen_reuse_port parameters
2024-06-06 02:17:38 +02:00
Alexey Milovidov
a8ec97e370
Merge pull request #57101 from ClickHouse/analyzer-fuzz-4
Analyzer fuzzer 4
2024-06-06 02:07:10 +02:00
Alexey Milovidov
9d8bc4d54d
Merge pull request #59427 from seandhaynes/master
Use scheduleOrThrow in MergeTree data selection and initialization to avoid deadlocks
2024-06-06 02:06:22 +02:00
xogoodnow
c55e79f283 Added words at the right place 2024-06-06 02:19:15 +03:30
xogoodnow
6ff11f54d0 added mlock and mlockall 2024-06-06 02:10:19 +03:30
Nikita Fomichev
e6bf468e56 stateless tests: add missing order by 2024-06-05 22:58:19 +02:00
Nikita Fomichev
103a448e71 stateless tests: complete parallel_parsing test 2024-06-05 22:58:19 +02:00
Nikita Fomichev
d59ccbdb50 stateless tests: Add test for unequal JOINs 2024-06-05 22:58:19 +02:00
Nikita Fomichev
54bc5d12ef Add randomization of new settings in clickhouse-test
prefer_external_sort_block_bytes
cross_join_min_rows_to_compress
cross_join_min_bytes_to_compress
max_parsing_threads
min_external_table_block_size_bytes
2024-06-05 22:57:10 +02:00
Nikita Fomichev
42097398b3 Do not remove trailing space in *.reference test files (by modern IDEs) by adding it to .editorconfig 2024-06-05 22:55:59 +02:00
Alexey Milovidov
8863736459 Fix style 2024-06-05 22:53:08 +02:00
Nikita Mikhaylov
e35d8c29ea
Update 10_question.yaml 2024-06-05 22:47:05 +02:00
Nikita Mikhaylov
0d50dd302b Bump 2024-06-05 22:25:27 +02:00
Nikita Mikhaylov
efbf40ad28
Merge pull request #64846 from rschu1ze/simplify-inverted-indexes
Simplify handling of old 'inverted' indexes
2024-06-05 20:04:16 +00:00
pufit
e7b7c3aebe Update query before replication 2024-06-05 16:00:08 -04:00
Nikita Mikhaylov
f42452d51e Add settings to changes history 2024-06-05 21:57:23 +02:00
Nikita Mikhaylov
106c1529ed Introduce an alias 2024-06-05 21:14:26 +02:00
wudidapaopao
c96800e96d
Merge branch 'ClickHouse:master' into fix_integer_accurate_cast 2024-06-06 03:06:09 +08:00
Nikita Mikhaylov
eb8520758a Done 2024-06-05 20:51:07 +02:00
Ali
007a8ca32e
Update docs/en/operations/named-collections.md
Co-authored-by: Nikolay Degterinsky <43110995+evillique@users.noreply.github.com>
2024-06-05 22:05:45 +03:30
Ali
844a94383b
Update docs/en/operations/named-collections.md
Co-authored-by: Nikolay Degterinsky <43110995+evillique@users.noreply.github.com>
2024-06-05 22:05:36 +03:30
Ali
8aea0c3fec
Omitted extra spaces
Co-authored-by: Nikolay Degterinsky <43110995+evillique@users.noreply.github.com>
2024-06-05 22:05:19 +03:30
alesapin
6ccd6ee77f
Merge pull request #64843 from ClickHouse/upload_blob_storage_log
Upload blob_storage_log from stateless tests
2024-06-05 18:26:03 +00:00
Ali
082efe2668
Update docs/en/operations/named-collections.md
Co-authored-by: Nikolay Degterinsky <43110995+evillique@users.noreply.github.com>
2024-06-05 21:40:28 +03:30
Ali
0316fa701f
Update docs/en/operations/named-collections.md
Co-authored-by: Nikolay Degterinsky <43110995+evillique@users.noreply.github.com>
2024-06-05 21:40:23 +03:30
Ali
927720a020
Update docs/en/operations/named-collections.md
Co-authored-by: Nikolay Degterinsky <43110995+evillique@users.noreply.github.com>
2024-06-05 21:40:13 +03:30
xogoodnow
46f14046e4 Added example on how to create named collection for kafka using ddl 2024-06-05 21:34:53 +03:30
Alexey Milovidov
bfb2cc3793
Update 02271_fix_column_matcher_and_column_transformer.sql 2024-06-05 19:42:11 +02:00
Azat Khuzhin
6cfd5b2165 Fix possible assertion when size of precommitted_parts <= precommitted_parts_need_rename
CI founds [1]:

    Logical error: 'precommitted_parts.size() >= precommitted_parts_need_rename.size()'

  [1]: https://s3.amazonaws.com/clickhouse-test-reports/61973/5c1e6a3e956917bdbb7eaa467934e5b75f17a923/stateless_tests__tsan__s3_storage__[5_5].html

The problem is that after precommitted_parts cleaned from detached parts
it may be less then precommitted_parts_need_rename, so to avoid this,
let's just copy it to a new container.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:24 +02:00
Azat Khuzhin
6f522c1d61 Do not remove detached parts in Transaction::rollback
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:24 +02:00
Azat Khuzhin
3675c27fe9 Require explicit rename of parts in transaction
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:24 +02:00
Azat Khuzhin
ca2c720d0e Avoid race between cleanup thread and renameMergedTemporaryPart()
The problem was that with this patch set renameMergedTemporaryPart() is
called without temporary_directory_lock holded (in MergeTask), since it
is reseted just before calling renameMergedTemporaryPart(), and this can
be seen in logs:

    2024.03.29 19:56:42.126919 [ 1341 ] {ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95::-8_0_138_2_2} <Trace> test_btnct5cr.alter_table_0 (ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95) (MergerMutator): Merged 50 parts: [-8_0_0_0_2, -8_138_138_0] -> -8_0_138_2_2
    2024.03.29 19:56:42.127034 [ 1341 ] {ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95::-8_0_138_2_2} <Debug> test_btnct5cr.alter_table_0 (ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95): Committing part -8_0_138_2_2 to zookeeper
    2024.03.29 19:56:42.128462 [ 884 ] {} <Warning> test_btnct5cr.alter_table_0 (ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95): Removing temporary directory /var/lib/clickhouse/store/ea7/ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95/tmp_merge_-8_0_138_2_2/
    2024.03.29 19:56:42.128647 [ 1341 ] {ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95::-8_0_138_2_2} <Debug> test_btnct5cr.alter_table_0 (ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95): Part -8_0_138_2_2 committed to zookeeper
    ...
    2024.03.29 19:56:54.586084 [ 57841 ] {bf240267-0620-4294-afc1-479c58e6be89} <Error> executeQuery: std::exception. Code: 1001, type: std::__1::__fs::filesystem::filesystem_error, e.what() = filesystem error: in file_size: No such file or directory ["/var/lib/clickhouse/store/ea7/ea7a3fd2-cf47-4ec7-91a5-51c69fba1b95/-8_0_138_2_2/data.cmrk3"]

This should fix failures of 00993_system_parts_race_condition_drop_zookeeper in [1].

  [1]: https://s3.amazonaws.com/clickhouse-test-reports/61973/f6f826c85dd5b7bb8db16286fd10dcf441a440f7/stateless_tests__coverage__[4_6].html

Though now it looks hackish...

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:24 +02:00
Azat Khuzhin
b41d08a2b6 Use renameParts() explicitly to avoid leaving parts in detached
Since there is an assertion that does not allows to remove detached
parts during cleanup, which sounds good in general, but breaks this new
code.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:24 +02:00
Azat Khuzhin
ee546fa00a Fix replacing parts with empty
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:24 +02:00
Azat Khuzhin
6c3db34aae Remove unused locked_parts from MergeTreeData::Transaction
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:24 +02:00
Azat Khuzhin
78088ce59a Reduce lock contention for MergeTree tables (by renaming parts without holding lock)
Under heavy load, or not so heavy but with fsync_part_directory=1,
time that renameTo() holds DataPartsLock will be increased, and this
will affect almost every operation with this table.

On one of production clusters I saw ~60 seconds with
fsync_part_directory=1.

Move the renameTo() out from the critical section.

v2: instead of using DataPartsLock.lock.lock()/unlock() move the renameTo() into MergeTreeData::Transaction::commit()
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:23 +02:00
Azat Khuzhin
66a2962cce Add reason into "Part {} is broken and need manual correction" message
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-05 19:38:23 +02:00
alesapin
966b08f192 Fix segfault for a very tricky case 2024-06-05 19:26:20 +02:00
Nikolai Kochetov
8bbf46ce2f Trying to fix a test. 2024-06-05 17:15:48 +00:00
xogoodnow
d99f8b4acc Added Named collections for accessing Kafka 2024-06-05 20:35:22 +03:30
Alexey Milovidov
0a67eb868e
Merge pull request #57098 from ClickHouse/analyzer-fuzz-2
Analyzer fuzzer 2
2024-06-05 16:52:41 +00:00
kssenii
5ec3699a19 Fix test 2024-06-05 18:36:13 +02:00
Sergei Trifonov
9c7a8727d0
Merge pull request #64244 from kirillgarbar/check_zk_path
Prevent conversion to Replicated if zookeeper path already exists
2024-06-05 16:36:10 +00:00