Commit Graph

52288 Commits

Author SHA1 Message Date
qianmoQ
95a203c8ca
Fixed a problem with the translation of the document
Fixed a problem with the translation of the document
2020-11-20 15:21:36 +08:00
alesapin
bd5d2d89ad
Merge pull request #17201 from ClickHouse/fix_read_in_order_test
Trying to make read_in_order_many_parts more stable
2020-11-20 10:10:17 +03:00
alesapin
253a5cf82a
Merge pull request #17208 from ClickHouse/less_verbose_logging
Less verbose logging when fetch is impossible
2020-11-20 10:09:19 +03:00
Kruglov Pavel
51bcd286f2
Merge pull request #16845 from azat/cutToFirstSignificantSubdomainWithWWW
Add cutToFirstSignificantSubdomainWithWWW()
2020-11-20 01:44:57 +03:00
Pavel Kruglov
4c30857759 Minor change 2020-11-20 01:22:40 +03:00
Azat Khuzhin
5297417050 Fix Merge(Distributed()) with JOIN 2020-11-20 00:07:30 +03:00
alexey-milovidov
4b7a7ca12a
Merge pull request #17210 from den-crane/patch-5
Doc. Now FROM FINAL is parallelized
2020-11-19 23:18:46 +03:00
a.palagashvili
1d09a67e9f empty commit for force run 2020-11-19 23:06:41 +03:00
Azat Khuzhin
35231662b3 Improve performance of AggregatingMergeTree w/ SimpleAggregateFunction(String)
While reading from AggregatingMergeTree with
SimpleAggregateFunction(String) in primary key and
optimize_aggregation_in_order perf top shows:

    Samples: 1M of event 'cycles', 4000 Hz, Event count (approx.): 287759760270 lost: 0/0 drop: 0/0
      Children      Self  Shared Object         Symbol
    +   12.64%    11.39%  clickhouse            [.] memcpy
    +    9.08%     0.23%  [unknown]             [.] 0000000000000000
    +    8.45%     8.40%  clickhouse            [.] ProfileEvents::increment    # <-- this, and in debug it has not 0.08x overhead, but 5.8x overhead
    +    7.68%     7.67%  clickhouse            [.] LZ4_compress_fast_extState
    +    5.29%     5.22%  clickhouse            [.] DB::IAggregateFunctionHelper<DB::AggregateFunctionNullUnary<true, true> >::addFree

The reason is obvious, ProfileEvents is atomic counters (and also they
are nested):

<details>

```
    Samples: 7M of event 'cycles', 4000 Hz, Event count (approx.): 450726149337
    ProfileEvents::increment  /usr/bin/clickhouse [Percent: local period]
    Percent│
           │
           │
           │    Disassembly of section .text:
           │
           │    00000000078d8900 <ProfileEvents::increment(unsigned long, unsigned long)@@Base>:
           │    ProfileEvents::increment(unsigned long, unsigned long):
      0.17 │      push  %rbp
      0.00 │      mov   %rsi,%rbp
      0.04 │      push  %rbx
      0.20 │      mov   %rdi,%rbx
      0.17 │      sub   $0x8,%rsp
      0.26 │    → callq DB::CurrentThread::getProfileEvents
           │    ProfileEvents::Counters::increment(unsigned long, unsigned long):
      0.00 │      lea   0x0(,%rbx,8),%rdi
      0.05 │      nop
           │    unsigned long std::__1::__cxx_atomic_fetch_add<unsigned long, unsigned long>(std::__1::__cxx_atomic_base_impl<unsigned long>*, unsigned long, std::__1::memory_order):
      1.02 │      mov   (%rax),%rdx
     97.04 │      lock  add   %rbp,(%rdx,%rdi,1)
           │    ProfileEvents::Counters::increment(unsigned long, unsigned long):
      0.21 │      mov   0x10(%rax),%rax
      0.04 │      test  %rax,%rax
      0.00 │    → jne   78d8920 <ProfileEvents::increment(unsigned long, unsigned long)@@Base+0x20>
           │    ProfileEvents::increment(unsigned long, unsigned long):
      0.38 │      add   $0x8,%rsp
      0.00 │      pop   %rbx
      0.04 │      pop   %rbp
      0.38 │    ← retq
```

</details>

These ProfileEvents was ArenaAllocChunks (it shows ~1.5M events per
second), and the reason is that the table has
SimpleAggregateFunction(String) in PK, which requires Arena.
But most of the time there Arena wasn't even used, so avoid this cost by
re-creating Arena only if it was "used" (i.e. has new chunks).

Another possibility is to avoid populating Arena::head in ctor, but this
will make the Arena code more complex, so for now this was preferred.

Also as a long-term solution it worth looking at implementing them via
RCU (to move the extra overhead out from the write code path into read
side).
2020-11-19 23:06:12 +03:00
Denny Crane
ceb42affdc
final_parallel 2020-11-19 15:56:00 -04:00
Denny Crane
1a467930a7
final_parallel 2020-11-19 15:52:52 -04:00
sevirov
75f04fac40
DOCSUP-4162: Document the system.replicated_fetches system table (#16900)
* Create replicated_fetches.md

Задокументировал системную таблицу system.replicated_fetches.

* Edit and translate replicated_fetches.md

Поправил английскую версию и перевел на русский язык.

* Update replicated_fetches.md

Внес поправки.

* Update replicated_fetches.md

Внес изменения в русскую версию.

Co-authored-by: Dmitriy <sevirov@yandex-team.ru>
2020-11-19 21:12:22 +03:00
alexey-milovidov
45c6483ea4
Merge pull request #16929 from otrazhenia/evgsudarikova-DOCSUP-2958
DOCSUP-2958: Documented the output_format_tsv_null_representation setting
2020-11-19 21:07:38 +03:00
alexey-milovidov
3610074a83
Update settings.md 2020-11-19 21:07:15 +03:00
alexey-milovidov
50a7bc2a8a
Update settings.md 2020-11-19 21:06:58 +03:00
Alexander Kuzmenkov
0d43e4c448 fixup 2020-11-19 20:16:45 +03:00
alesapin
3f01096c86 Less verbose logging when fetch is impossible 2020-11-19 20:03:20 +03:00
Alexander Kuzmenkov
5cd23e269d cleanup 2020-11-19 19:55:56 +03:00
alesapin
a15b8afa73 Don't add tons of client coverage files in stateful tests with coverage 2020-11-19 19:07:36 +03:00
Alexander Kuzmenkov
87a306ed23 Enable OpenTelemetry tracing in functional tests 2020-11-19 18:58:33 +03:00
vdimir
36544a45b7
Merge remote-tracking branch 'upstream/master' into ip-dict-no-trie 2020-11-19 18:56:24 +03:00
Alexander Kuzmenkov
6cb378e072 cleanup 2020-11-19 18:52:11 +03:00
alesapin
2623d35f68
Merge pull request #17120 from ClickHouse/fix_granularity_on_block_borders
Fix index granularity calculation on block borders
2020-11-19 18:36:07 +03:00
Alexander Kuzmenkov
7fa779c2ef Remove escaping from toString(std::string)
This is just confusing, I'd expect it to be an identity. It looks
especially weird when you do something like `toString(Field("don't escape
me"))`. Let's see which tests are going to fail.
2020-11-19 18:26:44 +03:00
Nico Piderman
f1b8906418
Minor grammar correction in error messages
Noticed this when I got an error loading some JSONEachRow data.
'while read the value of' -> 'while reading the value of'.
2020-11-19 15:44:58 +01:00
Mikhail Filimonov
e7266625e8
Update cctz to the latest master, update tzdb to 2020d. 2020-11-19 14:52:44 +01:00
Nicolae Vartolomei
b6a330de77 Allow running subset of tests in fasttest 2020-11-19 13:18:03 +00:00
Nicolae Vartolomei
746f8e45f5 All new parts must have uuids 2020-11-19 13:18:03 +00:00
Nicolae Vartolomei
425dc4b11b Add unique identifiers IMergeTreeDataPart structure
For now uuids are not generated at all, they are present only if the
part is updated manually (as you can see in the integration test).

The only place where they can be seen today by an end user is in
`system.parts` table. I was looking for hiding this column behind an
option but couldn't find an easy way to do that.

Likely this is also required for WAL, but need to think how not to break
compatibility.

Relates to #13574, https://github.com/ClickHouse/ClickHouse/issues/13574

Next 1: In the upcoming PR the plan is to integrate de-duplication based on
these fingerprints in the query pipeline.

Next 2: We'll enable automatic generation of uuids and come up with a
way for conditionally sending uuids when processing distributed queries
only when part movement is in progress.
2020-11-19 13:14:25 +00:00
Nikita Mikhaylov
0d78002a22
Merge pull request #17179 from sundy-li/update-rocksdb-version
bump up rocksdb version to  v6.14.5
2020-11-19 15:32:46 +03:00
Alexander Kuzmenkov
f326536ef0 fixup 2020-11-19 15:30:44 +03:00
Alexander Kuzmenkov
bcf15170ed Merge remote-tracking branch 'origin/master' into HEAD 2020-11-19 15:29:51 +03:00
Alexander Kuzmenkov
24293ccb30 Merge remote-tracking branch 'origin/master' into HEAD 2020-11-19 15:28:37 +03:00
Alexander Kuzmenkov
f2b4c15ab0
Merge pull request #17152 from ClickHouse/aku/json-array
Write rows as JSON array in JSONEachRow output format
2020-11-19 15:27:31 +03:00
alesapin
87f7e0b399 More compatible watches in TestKeeper 2020-11-19 15:04:09 +03:00
alesapin
cdceafdd89 Trying to make read_in_order_many_parts more stable 2020-11-19 13:25:39 +03:00
tavplubix
8b6fbea199
trigger CI 2020-11-19 12:47:24 +03:00
a.palagashvili
0c4acef558 added zstd to docker image 2020-11-19 12:23:05 +03:00
Nikolai Kochetov
8a9792cd30
Merge branch 'master' into pf2 2020-11-19 12:01:15 +03:00
robot-clickhouse
e88c2c701b Update version_date.tsv after release 20.6.10.2 2020-11-19 11:26:58 +03:00
Peng Jian
fd0705c93d add doc 2020-11-19 15:44:47 +08:00
Peng Jian
10cefe4f51 fix code style 2020-11-19 15:44:47 +08:00
Peng Jian
3c86c8b3c9 fix test cases 2020-11-19 15:44:47 +08:00
Peng Jian
091f7065cd fix build 2020-11-19 15:44:47 +08:00
Peng Jian
2df0c3e0b0 Add helper function for extracting zookeeper name and path. 2020-11-19 15:44:47 +08:00
Peng Jian
d8ae52118b add test 2020-11-19 15:44:47 +08:00
Peng Jian
a0683ce460 Support mulitple ZooKeeper clusters 2020-11-19 15:44:47 +08:00
alesapin
978e577b0f
Merge pull request #16424 from filimonov/execute_merges_on_single_replica_v2
execute_merges_on_single_replica
2020-11-19 10:15:52 +03:00
Ivan
b711331e2c
Update visibleWidth.cpp 2020-11-19 10:14:23 +03:00
zhang2014
9df9691ccb Try fix integration test 2020-11-19 13:01:23 +08:00