Commit Graph

14 Commits

Author SHA1 Message Date
Denny Crane
1e56f907a8
Update external-dicts-dict-layout.md
range_hashed description corrected
2021-02-09 14:47:11 -04:00
vdimir
fcfe7f2c5d Add option access_to_key_from_attributes to documentation for ip_trie 2020-12-24 19:13:49 +00:00
Denny Crane
dea3641e5c
Update external-dicts-dict-layout.md 2020-11-03 20:47:42 -04:00
Nikolai Kochetov
963ce315b2
Merge pull request #15454 from azat/dict-prealloc
Speedup hashed/sparse_hashed dictionary loading by preallocating the hash table
2020-10-15 15:03:53 +03:00
Alexey Milovidov
db4db42b65 Fix broken links in docs 2020-10-13 20:23:29 +03:00
Azat Khuzhin
064f901ea8 Add ability to preallocate hashtables for hashed/sparsehashed dictionaries
preallocation can be used only when we know number of rows, and for this
we need:
- source clickhouse
- no filtering (i.e. lack of <where>), since filtering can filter
  too much rows and eventually it may allocate memory that will
  never be used.

For sparse_hash the difference is quite significant, preallocated
sparse_hash hashtable allocates ~33% faster (7.5 seconds vs 5 seconds
for insert, and the difference is more significant for higher number of
elements):

    $ ninja bench-sparse_hash-run
    [1/1] cd /src/ch/hashtable-bench/.cmake && ...ch/hashtable-bench/.cmake/bench-sparse_hash
    sparse_hash/insert: 7.574 <!--
    sparse_hash/find  : 2.14426
    sparse_hash/maxrss: 174MiB
    sparse_hash/time:   9710.51 msec (user+sys)

    $ time ninja bench-sparse_hash-preallocate-run
    [1/1] cd /src/ch/hashtable-bench/.cmake && ...-bench/.cmake/bench-sparse_hash-preallocate
    sparse_hash/insert: 5.0522 <!--
    sparse_hash/find  : 2.14024
    sparse_hash/maxrss: 174MiB
    sparse_hash/time:   7192.06 msec (user+sys)

P.S. the difference for sparse_hashed dictionary with 4e9 elements
(uint64, uint16) is ~18% (4975.905 vs 4103.569 sec)

v2: do not reallocate the dictionary from the progress callback
    Since this will access hashtable in parallel.
v3: drop PREALLOCATE() and do this only for source=clickhouse and empty
    <where>
2020-10-09 22:28:14 +03:00
Ivan Blinkov
258d2fd499
[docs] split various kinds of CREATE queries into separate articles (#12328)
* normalize

* split & adjust links

* re-normalize

* adjust ru links

* adjust ja/tr links

* partially apply e0d19d2aea

* reset contribs
2020-07-09 18:10:35 +03:00
Nikita Mikhaylov
e55577ba65 Merge branch 'nikvas0/ssd_dict' of git://github.com/nikvas0/ClickHouse into merging-ssh-cache 2020-06-24 16:45:18 +03:00
Ivan Blinkov
7170f3c534
[docs] split aggregate function and system table references (#11742)
* prefer relative links from root

* wip

* split aggregate function reference

* split system tables
2020-06-18 11:24:31 +03:00
Nikita Vasilev
e7324ec4d7 docs 2020-05-23 13:41:27 +03:00
Артем Стрельцов
6b12cfb8e4 documentation added 2020-05-13 19:05:05 +03:00
Artem Streltsov
5220d9a4db
small docs edit (#10731)
Co-authored-by: Артем Стрельцов <artemstreltsov@Air-2.local>
2020-05-07 23:00:30 +03:00
Nikolay Degterinsky
a3e557a5d2 documentation is added & added 'direct' layout into integragion test 2020-05-03 15:55:22 +03:00
Ivan Blinkov
d91c97d15d
[docs] replace underscores with hyphens (#10606)
* Replace underscores with hyphens

* remove temporary code

* fix style check

* fix collapse
2020-04-30 21:19:18 +03:00