Commit Graph

343 Commits

Author SHA1 Message Date
Alexey Milovidov
17e2189199 Fix backward compatibility check 2022-11-21 12:51:51 +01:00
alesapin
2dbf07f58a Splitting checks 2022-11-18 17:38:17 +01:00
Igor Nikonov
b12ebab65c Enable keeper fault injection and retries for insert queries in
funcntional tests
2022-11-11 18:07:15 +00:00
Alexander Tokmakov
acbad99b24 remove abandonable_lock part 2 2022-11-02 14:28:49 +01:00
Alexey Milovidov
4f582e19a8
Merge pull request #42479 from ClickHouse/Avogar-patch-1
Add error "Destination table is myself" to exception list in BC check
2022-10-25 04:55:12 +02:00
Alexey Milovidov
c4fd827442
Merge pull request #42483 from ClickHouse/Avogar-patch-2
Get current clickhouse version without sending query in BC check
2022-10-25 04:54:24 +02:00
Azat Khuzhin
f0aed68893 ci/stress: fix memory limits overrides for hung check
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-10-22 20:53:18 +02:00
Alexander Tokmakov
f958c155f3
Merge branch 'master' into fixes_for_replicated_merge_tree2 2022-10-19 17:25:24 +03:00
Antonio Andelic
10b176fe19
Merge pull request #41854 from azat/tests/stress/fix-dbg-sym-bc
tests: fix debug symbols (and possible crashes) for backward compatiblity check
2022-10-19 16:18:41 +02:00
Kruglov Pavel
2ca0d37e83
Get current clickhouse version without sending query in BC check 2022-10-19 13:25:03 +02:00
Kruglov Pavel
632965212f
Add error "Destination table is myself" to exception list in BC check 2022-10-19 12:52:32 +02:00
Alexander Tokmakov
4211605093 disable zk fault injections in bc check 2022-10-03 13:13:29 +02:00
Alexander Tokmakov
8741785a72 temporarily disable ThreadFuzzer with tsan 2022-09-29 18:08:35 +02:00
Azat Khuzhin
9b20468198 tests: fix debug symbols (and possible crashes) for backward compatiblity check
stress running previous version of the server w/o correct debug symbols
right now, since nobody restore clickhouse.debug file, and this can lead
to the following issues, like in [1]:
- incorrect stack traces
- gdb crashes
- clickhouse crashes, due to non-robust internal DWARF parser (probably)

  [1]: https://s3.amazonaws.com/clickhouse-test-reports/41730/8cc53a48ae99a765085f44a75fa49314d1f1cc7d/stress_test__ubsan_.html

Right now I decided not to rework the script to make it less error
prone, but simply fix the problem.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-09-27 22:50:28 +02:00
alesapin
69f3a66538 Keep the most important log in stress tests 2022-09-27 11:16:10 +02:00
Antonio Andelic
97cf045203
Merge pull request #41721 from ClickHouse/collect-correctly-logs-in-stress-test
Collect logs in Stress test using clickhouse-local
2022-09-27 08:43:44 +02:00
Antonio Andelic
eb78761a7e Collect necessary 2022-09-26 16:30:01 +00:00
Antonio Andelic
6f4a636e8f Remove wildcard 2022-09-26 11:21:53 +00:00
Antonio Andelic
8fde8b2c56 Try with multiple calls 2022-09-26 11:03:24 +00:00
Antonio Andelic
c60d9db687
Merge branch 'master' into ignore-attach-thread-keeper-errors 2022-09-26 08:38:48 +02:00
Antonio Andelic
5ff1bcd553
Merge branch 'master' into collect-correctly-logs-in-stress-test 2022-09-26 08:38:38 +02:00
alesapin
06e0f554d8 Fix fetch to local disk 2022-09-23 16:46:53 +02:00
Antonio Andelic
1d93c56d1a Collect logs using clickhouse-local 2022-09-23 10:54:16 +00:00
Antonio Andelic
a17a3e1de1 Ignore Keeper hardware errors 2022-09-23 08:23:57 +00:00
kssenii
46f74aaba9 Update stress/run.sh 2022-09-12 20:10:35 +02:00
Alexander Tokmakov
e77b9e4d0c
Merge pull request #40775 from azat/ci/core-dumps-rework
Rework core collecting on CI (eliminate gcore usage)
2022-09-09 20:20:10 +03:00
Alexey Milovidov
7776512b04
Merge pull request #41002 from azat/ci/fix-oom-check
ci/stress: clear dmesg before run to fix "OOM in dmesg" check
2022-09-06 06:41:36 +03:00
Alexander Tokmakov
b264be3c63
Merge branch 'master' into zookeeper_client_fault_injection 2022-09-05 22:13:09 +03:00
Azat Khuzhin
2724b67537 ci/stress: clear dmesg before run to fix "OOM in dmesg" check
CI: https://s3.amazonaws.com/clickhouse-test-reports/40772/afa137ae2b6108e72c2d6e43556a04548afa2ea9/stress_test__ubsan_.html
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-09-05 15:51:36 +02:00
Azat Khuzhin
25e3bebd9d Rework core collecting on CI (eliminate gcore usage)
gcore is a gdb command, that internally uses gdb to dump the core.

However with proper configuration of limits (core_dump.size_limit) it
should not be required, althought some issues is possible:
- non standard kernel.core_pattern
- sanitizers

So yes, gcore is more "universal" (you don't need to configure any
`kernel_pattern`), but it is ad-hoc, and it has drawbacks -
**it does not work when gdb fails**. For example gdb may fail with
`Dwarf Error: DW_FORM_strx1 found in non-DWO CU` in case of DWARF-5 [1].

  [1]: https://github.com/ClickHouse/ClickHouse/pull/40772#issuecomment-1236331323.

Let's try to switch to more native way.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-09-04 22:07:16 +02:00
Alexander Tokmakov
8bdb589c2b Merge branch 'master' into zookeeper_client_fault_injection 2022-08-29 13:34:57 +02:00
alesapin
133ca01447 Merge branch 'master' into stress_s3 2022-08-29 11:25:28 +02:00
Azat Khuzhin
ebc61a36e0 tests/stress: improve OOM detection (add separate check by dmesg)
Right now if you will look at the OOM errors:
- OOM killer (or signal 9) in clickhouse-server.log
- Backward compatibility check: OOM messages in clickhouse-server.log

Most of them are not real, but just clickhouse server got KILLed by
clickhouse stop, #40678 may imporove the situation, but to definitely
sure that there was OOM let's look at dmesg.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-08-27 12:46:58 +02:00
Azat Khuzhin
3b519c5d44 tests/stress: capture stacktrace of server hungs if pid was removed already
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-08-26 22:06:32 +03:00
alesapin
704c4b2c5b Stop thread fuzzer on shutdown 2022-08-26 11:54:54 +02:00
alesapin
3ff6489fae Merge branch 'master' into stress_s3 2022-08-25 13:14:58 +02:00
alesapin
ad692f732a Merge branch 'master' into stress_s3 2022-08-25 13:13:30 +02:00
alesapin
35f9815b8e Fix backward comp check 2022-08-24 14:43:02 +02:00
Azat Khuzhin
50bddc43dc tests/stress: ignore NETLINK_ERROR from checkPermissionsImpl
Since now with --privileged it has CAP_SYS_ADMIN and tries to
communicate via netlink.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-08-19 14:07:06 +02:00
Alexander Tokmakov
3d253ec51b
Merge branch 'master' into zookeeper_client_fault_injection 2022-08-18 21:23:50 +03:00
alesapin
922818bbd9 Merge branch 'stress_s3' of github.com:ClickHouse/ClickHouse into stress_s3 2022-08-18 14:46:50 +02:00
alesapin
932ea146f5 Merge branch 'master' into stress_s3 2022-08-18 13:14:47 +02:00
alesapin
86b1e33eed Disable cache on writes 2022-08-17 19:00:53 +02:00
alesapin
600d22851f Grep dangerous S3 errors 2022-08-17 12:43:11 +02:00
alesapin
0433b801d2 Configure properly 2022-08-17 12:27:15 +02:00
Alexander Tokmakov
ae000e9125
Merge branch 'master' into zookeeper_client_fault_injection 2022-08-17 12:48:54 +03:00
alesapin
1ec6627a70 Fix tables creation 2022-08-16 18:28:17 +02:00
kssenii
eb26b219b9 Merge master 2022-08-16 00:56:27 +02:00
Alexander Tokmakov
589c3408d2
Merge pull request #40234 from ClickHouse/better_message_on_restore_covered
Better error message when restoring covered parts
2022-08-15 22:01:48 +03:00
alesapin
243bd492fa Trying to fix it 2022-08-15 20:55:11 +02:00