Commit Graph

23 Commits

Author SHA1 Message Date
Evgeniia Sudarikova
00d963d40f minor changes 2021-05-13 20:26:03 +03:00
Evgeniia Sudarikova
0b66aa0e85 edited EN and RU 2021-05-13 20:05:58 +03:00
Azat Khuzhin
808d1a0215 Reimplement preallocate for hashed/sparse_hashed dictionaries
It was initially implemented in #15454, but was reverted in #21948 (due
to higher memory usage).

This implementation differs from the initial, since now there is
separate attribute to enable preallocation, before it was done
automatically, but this has problems with duplicates in the source.

Plus this implementation does not uses dynamic_cast, instead it extends
IDictionarySource interface.
2021-05-10 07:41:48 +03:00
Maksim Kita
6ae94fa25b
Update external-dicts-dict-layout.md 2021-04-23 11:40:05 +03:00
Olga Revyakina
fa7917e385 Updates 2021-04-21 19:36:59 +03:00
Olga Revyakina
cc63fe0af5 Start 2021-04-21 19:25:04 +03:00
Maksim Kita
bbe1960eea
Merge pull request #21573 from kitaisreal/updated-cache-dictionary-storage
Added specialized CacheDictionaryStorage
2021-03-25 21:29:01 +03:00
Maksim Kita
18ed9c5c67 SSDCacheDictionary remove max_stored_keys option 2021-03-18 12:55:17 +03:00
Alexey Milovidov
8c9bd09e6c Remove broken "Original article" 2021-03-14 15:19:04 +03:00
Denny Crane
1e56f907a8
Update external-dicts-dict-layout.md
range_hashed description corrected
2021-02-09 14:47:11 -04:00
vdimir
fcfe7f2c5d Add option access_to_key_from_attributes to documentation for ip_trie 2020-12-24 19:13:49 +00:00
Denny Crane
dea3641e5c
Update external-dicts-dict-layout.md 2020-11-03 20:47:42 -04:00
Nikolai Kochetov
963ce315b2
Merge pull request #15454 from azat/dict-prealloc
Speedup hashed/sparse_hashed dictionary loading by preallocating the hash table
2020-10-15 15:03:53 +03:00
Alexey Milovidov
db4db42b65 Fix broken links in docs 2020-10-13 20:23:29 +03:00
Azat Khuzhin
064f901ea8 Add ability to preallocate hashtables for hashed/sparsehashed dictionaries
preallocation can be used only when we know number of rows, and for this
we need:
- source clickhouse
- no filtering (i.e. lack of <where>), since filtering can filter
  too much rows and eventually it may allocate memory that will
  never be used.

For sparse_hash the difference is quite significant, preallocated
sparse_hash hashtable allocates ~33% faster (7.5 seconds vs 5 seconds
for insert, and the difference is more significant for higher number of
elements):

    $ ninja bench-sparse_hash-run
    [1/1] cd /src/ch/hashtable-bench/.cmake && ...ch/hashtable-bench/.cmake/bench-sparse_hash
    sparse_hash/insert: 7.574 <!--
    sparse_hash/find  : 2.14426
    sparse_hash/maxrss: 174MiB
    sparse_hash/time:   9710.51 msec (user+sys)

    $ time ninja bench-sparse_hash-preallocate-run
    [1/1] cd /src/ch/hashtable-bench/.cmake && ...-bench/.cmake/bench-sparse_hash-preallocate
    sparse_hash/insert: 5.0522 <!--
    sparse_hash/find  : 2.14024
    sparse_hash/maxrss: 174MiB
    sparse_hash/time:   7192.06 msec (user+sys)

P.S. the difference for sparse_hashed dictionary with 4e9 elements
(uint64, uint16) is ~18% (4975.905 vs 4103.569 sec)

v2: do not reallocate the dictionary from the progress callback
    Since this will access hashtable in parallel.
v3: drop PREALLOCATE() and do this only for source=clickhouse and empty
    <where>
2020-10-09 22:28:14 +03:00
Ivan Blinkov
258d2fd499
[docs] split various kinds of CREATE queries into separate articles (#12328)
* normalize

* split & adjust links

* re-normalize

* adjust ru links

* adjust ja/tr links

* partially apply e0d19d2aea

* reset contribs
2020-07-09 18:10:35 +03:00
Nikita Mikhaylov
e55577ba65 Merge branch 'nikvas0/ssd_dict' of git://github.com/nikvas0/ClickHouse into merging-ssh-cache 2020-06-24 16:45:18 +03:00
Ivan Blinkov
7170f3c534
[docs] split aggregate function and system table references (#11742)
* prefer relative links from root

* wip

* split aggregate function reference

* split system tables
2020-06-18 11:24:31 +03:00
Nikita Vasilev
e7324ec4d7 docs 2020-05-23 13:41:27 +03:00
Артем Стрельцов
6b12cfb8e4 documentation added 2020-05-13 19:05:05 +03:00
Artem Streltsov
5220d9a4db
small docs edit (#10731)
Co-authored-by: Артем Стрельцов <artemstreltsov@Air-2.local>
2020-05-07 23:00:30 +03:00
Nikolay Degterinsky
a3e557a5d2 documentation is added & added 'direct' layout into integragion test 2020-05-03 15:55:22 +03:00
Ivan Blinkov
d91c97d15d
[docs] replace underscores with hyphens (#10606)
* Replace underscores with hyphens

* remove temporary code

* fix style check

* fix collapse
2020-04-30 21:19:18 +03:00