ClickHouse/tests
Aleksei Semiglazov 921518db0a CLICKHOUSE-606: query deduplication based on parts' UUID
* add the query data deduplication excluding duplicated parts in MergeTree family engines.

query deduplication is based on parts' UUID which should be enabled first with merge_tree setting
assign_part_uuids=1

allow_experimental_query_deduplication setting is to enable part deduplication, default ot false.

data part UUID is a mechanism of giving a data part a unique identifier.
Having UUID and deduplication mechanism provides a potential of moving parts
between shards preserving data consistency on a read path:
duplicated UUIDs will cause root executor to retry query against on of the replica explicitly
asking to exclude encountered duplicated fingerprints during a distributed query execution.

NOTE: this implementation don't provide any knobs to lock part and hence its UUID. Any mutations/merge will
update part's UUID.

* add _part_uuid virtual column, allowing to use UUIDs in predicates.

Signed-off-by: Aleksei Semiglazov <asemiglazov@cloudflare.com>

address comments
2021-02-02 16:53:39 +00:00
..
ci Add more Fuzzer tasks 2021-01-17 21:14:57 +03:00
config ExecutableDictionarySource added implicit_key option 2021-02-01 12:35:47 +03:00
external_models/catboost
fuzz
instructions
integration CLICKHOUSE-606: query deduplication based on parts' UUID 2021-02-02 16:53:39 +00:00
perf_drafts
performance Fix double whitespace 2021-02-02 17:22:03 +03:00
queries Merge pull request #19960 from ClickHouse/json-extract-inaccurate-conversion-from-double-to-float 2021-02-02 15:15:37 +03:00
testflows Fixing reference to LDAP role mapping specification 2021-01-30 15:24:20 -05:00
users.d
.gitignore
clickhouse-client.xml
clickhouse-test Fix Python 2021-01-29 06:26:53 +03:00
clickhouse-test-server
client-test.xml
CMakeLists.txt
decimals_dictionary.xml
ints_dictionary.xml
msan_suppressions.txt Remove some suppressions 2021-01-22 21:06:40 +03:00
server-test.xml
stress
strings_dictionary.xml
tsan_suppressions.txt Merge branch 'master' into update_cassandra 2021-01-20 14:15:53 +03:00
ubsan_suppressions.txt Remove some suppressions 2021-01-22 21:06:40 +03:00
users.xml