hcz
8cae381428
Performance improvement
2019-05-08 16:20:25 +03:00
hcz
b721ae11f5
Fix const column detection
2019-05-08 16:20:25 +03:00
hcz
e1a236f55c
Update API
2019-05-08 16:20:25 +03:00
hcz
789b484108
Fix cmakelists
2019-05-08 16:20:25 +03:00
hcz
560246c0c3
Add Simdjson API
2019-05-08 16:20:25 +03:00
吴健
c687a85304
Merge branch 'bug-null-in' of github.com:janplus/ClickHouse into bug-null-in
2019-05-07 13:59:46 +08:00
吴健
84ee8cec93
Add function ignoreExceptNull to meet nullable functionIn's requirement for optimization of analyse.
2019-05-07 13:20:23 +08:00
Guillaume Tassery
3f8121d34f
format code style
2019-05-06 17:23:06 +07:00
Danila Kutenin
c280907f09
Zero initialization instead of memset
2019-05-06 11:34:28 +03:00
Danila Kutenin
512c7fa3a4
Comments
2019-05-06 00:40:37 +03:00
Danila Kutenin
8b235f7a4a
Comments
2019-05-06 00:37:58 +03:00
Danila Kutenin
6a6b30a56f
Less code with unique ptrs and some docs added
2019-05-06 00:35:08 +03:00
Danila Kutenin
f613c06c47
ngram vector_vector distance added
2019-05-05 23:48:46 +03:00
Danila Kutenin
ca4f098362
fix hyperscan to treat regular expressions as utf-8 expressions
2019-05-05 09:51:59 +03:00
Alexey Milovidov
c462b0fabb
Miscellaneous
2019-05-05 02:12:55 +03:00
alexey-milovidov
9e3c80d2ed
Merge pull request #5170 from yandex/cidr-to-range-better-code
...
Cidr to range better code
2019-05-02 19:29:36 +03:00
Alexey Milovidov
4dc0170d2c
CIDR functions: better code #5095
2019-05-02 16:04:24 +03:00
alexey-milovidov
478b721b36
Merge pull request #5167 from yandex/remove-useless-code-in-columns-description
...
Removed bad and useless code in ColumnsDescription
2019-05-02 02:16:36 +03:00
Alexey Milovidov
0381a6a1ec
Better code for CIDR to range functions #5095
2019-05-02 01:09:24 +03:00
Alexey Milovidov
3fa2784e7c
Removed unneeded file #5095
2019-05-02 01:00:56 +03:00
alexey-milovidov
2dccdb18eb
Merge pull request #5095 from PerformanceVision/CIDR_MASK
...
Functions for calculate lower and high range of an IP address using a CIDR mask
2019-05-02 01:00:29 +03:00
Alexey Milovidov
1be21e3cb6
Removed bad and useless code in ColumnsDescription
2019-05-02 00:43:05 +03:00
alexey-milovidov
35471f4234
Update in.cpp
2019-04-30 20:26:56 +03:00
alexey-milovidov
b88c981902
Update basename.cpp
2019-04-30 19:15:08 +03:00
alexey-milovidov
39b696f9df
Update basename.cpp
2019-04-30 19:14:53 +03:00
Guillaume Tassery
ee742f38d9
Formatting
2019-04-30 13:53:09 +07:00
Guillaume Tassery
9a5a0b2776
Formatting
2019-04-30 13:50:38 +07:00
吴健
acbeedad38
Fix null problem in FunctionIn
2019-04-30 11:50:30 +08:00
alexey-milovidov
109726d533
Update basename.cpp
2019-04-30 00:40:54 +03:00
Guillaume Tassery
95141804fb
Add a basename function
2019-04-29 16:40:44 +07:00
Guillaume Tassery
f21004ed7e
IPv4CIDRtoIPv4Range and IPv6CIDRtoIPv6Range return now an IP
2019-04-26 13:30:07 +07:00
Guillaume Tassery
7c4541b127
Merge branch 'master' into top_level_domain
2019-04-26 12:30:23 +07:00
Guillaume Tassery
40db4551f2
Add USE_GPERF on configuration file
2019-04-26 12:01:02 +07:00
Guillaume Tassery
2640a0716c
Add some pragma for ignoring some compilation flags for file generated by gperf
2019-04-24 17:09:46 +07:00
proller
ba4687e110
Cmake: dont use LINK_MODE (but use BUILD_SHARED_LIBS) ( #5091 )
2019-04-24 12:06:28 +03:00
Guillaume Tassery
68b0144178
Remove unused header and some typo in comment for IPv6 range function
2019-04-24 13:26:28 +07:00
Guillaume Tassery
efd780f20d
Apply fix for IPv6 range function when we use a column with more than one element
2019-04-24 12:19:36 +07:00
Guillaume Tassery
0e67d424da
Code reformating for ip range function
2019-04-24 11:18:17 +07:00
Guillaume Tassery
56393de68e
Handle UB case for CIDR IP range function and add documentation
2019-04-23 19:28:25 +07:00
Guillaume Tassery
8dee4fe5d1
Add function for get an ipv4 range using an ipv4 and a cidr mask
2019-04-23 14:02:23 +07:00
Guillaume Tassery
153c9d6455
Add function for get an IP range from a CIDR
2019-04-22 18:21:59 +07:00
Alexander Kozhikhov
fd07c97097
Merge branch 'master' of https://github.com/yandex/ClickHouse
2019-04-21 18:43:29 +03:00
Alexander Kozhikhov
daf4690d37
review fixes
2019-04-21 17:32:42 +03:00
Guillaume Tassery
bd9a583869
Adapt coding style top level domain lookup
2019-04-18 10:51:02 +07:00
Vitaly Baranov
7ef7a2d767
Disable ubsan error "outside the range of representable values" in bitSwapLastTwo.cpp
2019-04-17 19:14:31 +03:00
Guillaume Tassery
a904a4af7a
Remove useless file
2019-04-17 19:45:25 +07:00
Guillaume Tassery
65fc607c18
Modify tldlookup.gperf file for include only second level domain
2019-04-17 19:40:11 +07:00
Guillaume Tassery
1eccfacca5
Take into account when we have content after the dns on (cutTo)firstSignificantSubdomain function
2019-04-17 19:27:15 +07:00
Guillaume Tassery
4cc0ee677a
Use gperf perfect hash for checking with a better accurency the tld for firstSignificantSubdomain and cutToFirstSignificantSubdomain
2019-04-17 13:21:26 +07:00
Danila Kutenin
0abde66822
isValidUTF8 movemask is a bad choice, let's use testz
2019-04-16 15:49:22 +03:00
Alexander Kozhikhov
19021e76bb
style
2019-04-15 03:16:13 +03:00
Alexander Kozhikhov
12132b8fdf
fix in evalMLMethod
2019-04-15 02:57:14 +03:00
Alexey Milovidov
0db5ec9227
More tests
2019-04-12 03:52:24 +03:00
Alexey Milovidov
ceef92fa9e
Fixed inconsistent whitespaces
2019-04-12 03:37:07 +03:00
Alexey Milovidov
b4878dbffa
Added support for non-constant and negative size and length arguments for function substringUTF8
2019-04-12 03:34:11 +03:00
Alexey Milovidov
36d016d331
Added [[noreturn]] attributes for functions that throw unconditionally
2019-04-10 23:27:14 +03:00
Alexey Milovidov
5be06d8556
Simplified expression (suggested by PVS-Studio)
2019-04-10 23:05:25 +03:00
Alexey Milovidov
37e086f3bf
Merge branch 'master' of github.com:yandex/ClickHouse
2019-04-10 22:57:42 +03:00
Alexey Milovidov
b4efebc6ac
Fixed error found by PVS-Studio
2019-04-10 22:57:30 +03:00
Danila Kutenin
e9bef32618
License to valid utf8 function
2019-04-10 22:09:27 +03:00
Alexander Kozhikhov
c93aae6741
linear regression
2019-04-09 01:40:37 +03:00
Alexander Kozhikhov
a9ae6d0681
Merge branch 'master' of https://github.com/yandex/ClickHouse
2019-04-09 00:01:28 +03:00
Danila Kutenin
9dc6d8f0f5
Style
2019-04-08 00:23:15 +03:00
Danila Kutenin
3519598ae9
Accurate tail handling
2019-04-07 23:24:08 +03:00
Danila Kutenin
b603e51c6e
isValidUTF8 function added
2019-04-07 21:58:13 +03:00
Alexander Kozhikhov
f550304fa4
Merge branch 'master' of https://github.com/yandex/ClickHouse
2019-04-04 03:17:37 +03:00
Vitaly Baranov
842ce6cff8
Fix function cutQueryStringAndFragment().
2019-04-03 15:58:38 +03:00
Nikolai Kochetov
30d3496c36
Merge pull request #4871 from yandex/fix-array-intersect-impl
...
Fix array intersect
2019-04-02 17:05:01 +03:00
proller
45591c9d83
Build and test fixes ( #4873 )
2019-04-02 12:43:53 +03:00
Nikolai Kochetov
e8a8da1a4f
Fix array intersect.
2019-04-01 16:27:07 +03:00
Nikolai Kochetov
736e3c0f46
Merge pull request #4847 from fancyqlx/master
...
Fixed segment fault of arrayIntersect
2019-04-01 16:06:56 +03:00
Danila Kutenin
fca67c8820
Fix clang build
2019-04-01 08:05:52 +03:00
Danila Kutenin
3ce9686bc4
Some cosmetics
2019-04-01 00:04:45 +03:00
Danila Kutenin
8edb5c5b2d
Some cosmetics
2019-03-31 23:59:18 +03:00
Danila Kutenin
d509c226ab
Pool was not doing what I expected, great improvement of compilation
2019-03-31 23:55:36 +03:00
Danila Kutenin
f80fb09b81
Hyperscan better usage of scratch space and error handling
2019-03-31 23:19:22 +03:00
qianlixiang
bd6bc86ceb
Fixed style check errors
2019-03-31 13:03:08 +08:00
qianlixiang
32692fb0dd
Fixed style check errors
2019-03-31 12:52:24 +08:00
qianlixiang
dad023ea48
Fixed style check errors
2019-03-31 12:00:11 +08:00
qianlixiang
a1d2732d4c
Fixed segment fault of arrayIntersect and add a test
2019-03-31 11:44:05 +08:00
alexey-milovidov
cf492a4401
Merge pull request #4862 from danlark1/master
...
Optimize Volnitsky by inlining compare function
2019-03-31 02:24:33 +03:00
alexey-milovidov
254369324b
Miscellanous changes in ColumnArray ( #4857 )
...
* Miscellanous
* Miscellanous
* Miscellanous
2019-03-30 13:46:17 +03:00
Danila Kutenin
18d3813ff9
Optimize Volnitsky by inlining compare function
2019-03-30 12:39:45 +03:00
qianlixiang
bdb420cdfd
Fixed segment fault of arrayIntersect
2019-03-29 18:53:50 +08:00
alexey-milovidov
7aefd6663f
Merge pull request #4841 from danlark1/master
...
Added hyperscan fuzzy search
2019-03-29 12:39:25 +03:00
Danila Kutenin
cd850f6f98
Fix clang build
2019-03-29 10:04:44 +03:00
Danila Kutenin
b822923f5a
Better renamings of fuzzy string search
2019-03-29 04:27:17 +03:00
Danila Kutenin
ef0d622a5f
Added hyperscan fuzzy search
2019-03-29 04:02:05 +03:00
alexey-milovidov
e3c5da0a69
Merge pull request #4783 from yandex/cow-ptr-compositions
...
WIP. Attempt to make compositions with COWPtr more convenient
2019-03-29 02:09:39 +03:00
Danila Kutenin
d22d66a552
Fix exception message
2019-03-28 20:06:04 +03:00
Danila Kutenin
d4e67fa64c
Another bug fixed to the comparison
2019-03-28 19:58:16 +03:00
Danila Kutenin
70a7f4805d
Test to the restriction added, fix bug
2019-03-28 19:41:11 +03:00
Danila Kutenin
cae4a426c5
Hyperscan restriction
2019-03-28 17:59:25 +03:00
Alexey Milovidov
4417705f32
Merge branch 'master' into cow-ptr-compositions
2019-03-28 17:47:59 +03:00
Alexey Milovidov
faf9462c37
Added support for constant-constant case in multi-search functions
2019-03-27 22:25:58 +03:00
Alexey Milovidov
42dd521e45
Added a setting to disallow Hyperscan
2019-03-27 00:56:46 +03:00
proller
1584fd4367
Fix hyperscan library find and use ( #4798 )
2019-03-26 00:31:00 +03:00
Alexey Milovidov
fb19cf1eec
Attempt to make compositions with COWPtr more convenient
2019-03-25 04:43:54 +03:00
alexey-milovidov
06e79a7643
Merge pull request #4780 from danlark1/master
...
Hyperscan string search regular expression matching was added
2019-03-25 04:26:18 +03:00
Danila Kutenin
4b7824095e
Possible fixes to docker containers
2019-03-24 21:24:56 +03:00
Danila Kutenin
1c90d98d16
Clang fix
2019-03-24 02:56:52 +03:00
Danila Kutenin
725139f0f5
Renamings, fixes to search algorithms, more tests
2019-03-24 01:49:38 +03:00
Danila Kutenin
f8001b41ba
Hyperscan multi regular expressions search
2019-03-23 22:40:16 +03:00
Alexey Milovidov
847abfdbb1
Fixed bad code with arithmetic ops on aggregate function states
2019-03-23 05:35:50 +03:00
Alexey Milovidov
5a7237cc13
Miscellaneous
2019-03-23 04:27:08 +03:00
proller
d6d83d4a5c
Allow build without ssl ( #4525 ) ( #4750 )
2019-03-22 14:18:24 +03:00
alexey-milovidov
f7bdc657d3
Update hasAllAny.h
2019-03-15 02:12:30 +03:00
Alexey Milovidov
a459b2c298
Fixed insufficient type checking in hasAll, hasAny functions
2019-03-15 02:10:51 +03:00
proller
cf8001b6a9
Fix includes ( #4673 )
2019-03-13 16:58:04 +03:00
alexey-milovidov
617a0a8938
Merge pull request #4592 from yandex/fix-concurrent-alters2
...
Fix a race condition during concurrent ALTER COLUMN queries
2019-03-10 19:11:49 +03:00
alexey-milovidov
357e17959e
Merge pull request #4602 from danlark1/master
...
Rename trigramDistance to ngramDistance, add more functions with CaseInsensitive and UTF, update docs, more job done in perf, added some perf tests for string search that I would like to see
2019-03-09 21:19:59 +03:00
Alexey Zatelepin
3344955154
separate Read and Write lock holders, better method names
2019-03-07 21:10:21 +03:00
Alexey Zatelepin
ab025805a3
alter intention lock
2019-03-07 20:21:18 +03:00
Alexey Milovidov
3389cd1735
Merge branch 'master' into fix-function-replicate
2019-03-07 14:56:43 +03:00
proller
86d9876221
Merge remote-tracking branch 'upstream/master' into fix23
2019-03-06 22:58:47 +03:00
proller
ee15cb237c
Better fix
2019-03-06 22:19:21 +03:00
proller
9159d602f5
Fix bug in arrayEnumerateRanked
2019-03-06 20:19:26 +03:00
Alexey Milovidov
fb8e37f60f
Fix segfault in function "replicate"
2019-03-06 02:37:41 +03:00
Danila Kutenin
dcfd3fe37f
Comment in FunctionsStringSimilarity
2019-03-06 02:08:08 +03:00
Alexey Milovidov
5a644f0052
Miscellaneous
2019-03-06 02:05:47 +03:00
Alexey Milovidov
b5b3f52f0f
Added documentation
2019-03-06 01:43:18 +03:00
Danila Kutenin
952b5ea24a
Rename trigramDistance to ngramDistance, add more functions with CaseInsensitive and UTF, update docs, more job done in perf, added some perf tests for string search that I would like to see
2019-03-06 01:42:28 +03:00
Alexey Milovidov
224839a58a
Attempt to read the code #4475
2019-03-06 00:53:16 +03:00
proller
89014b5480
Build fixes ( #4591 )
2019-03-05 13:15:47 +03:00
Vitaly Baranov
61e21d50e1
Merge pull request #4207 from andyyzh/bitmap_feature
...
Added bitmap function feature with roaring bitmap
2019-03-04 22:42:47 +03:00
Alexey Milovidov
3033d329eb
Fixed error #4564
2019-03-04 18:50:23 +03:00
Alexey Milovidov
b29c24d3d9
Fixed error #4564
2019-03-04 18:13:33 +03:00
Alexey Milovidov
315b6f3878
Better string comparison (development)
2019-03-03 23:08:39 +03:00
Alexey Milovidov
865d54d9e3
Addition to prev. revision
2019-03-03 16:53:06 +03:00
Alexey Milovidov
d120fc788b
NaN comparisons are implementation specific - remove them from tests
2019-03-03 04:18:35 +03:00
Alexey Milovidov
bcf8cd6297
NaN comparisons are implementation specific - remove them from tests
2019-03-03 04:17:33 +03:00
Alexey Milovidov
0dc576e44f
Avoid performance degradation in few degenerate cases: added fast path for tautological compares #4405
2019-03-03 02:51:48 +03:00
Alexey Milovidov
8df4b8e270
Added comment #4409
2019-03-01 23:17:30 +03:00
Alexey Milovidov
3c2dd5257b
Fixed typo #4409
2019-03-01 23:16:56 +03:00
Alexey Milovidov
bc25bedbd4
Implemented function "flatten" #4409
2019-03-01 23:07:58 +03:00
Alexey Milovidov
9d0adf40d2
Merge branch 'array-flatten-function' of https://github.com/kzon/ClickHouse into kzon-array-flatten-function
2019-03-01 19:42:06 +03:00
alexey-milovidov
97cea95f74
Merge pull request #4536 from amosbird/fixedhashtable
...
A Proper lookup table that uses HashTable's API
2019-03-01 17:47:33 +03:00
Amos Bird
26ab5dd7a7
A Proper lookup table that uses HashTable's API
...
This is the first step of allowing heterogeneous cells in hash tables.
performance test results are
```
1. HashMap<UInt16, UInt8, TrivialHash, HashTableFixedGrower<16>>;
2. NewLookupMap<UInt16, UInt8>
ResolutionWidth 30000 1 .................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................223550276.46
ResolutionWidth 30000 2 .................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................248772721.24
Best: 2 - 24877272124
ResolutionWidth 100000 1 ..........................................................................................................................................................................................................................................................238498413.99
ResolutionWidth 100000 2 ..........................................................................................................................................................................................................................................................261808889.98
Best: 2 - 26180888998
ResolutionWidth 300000 1 ...................................................................................239307348.81
ResolutionWidth 300000 2 ...................................................................................257592761.30
Best: 2 - 25759276130
ResolutionWidth 1000000 1 .........................240144759.26
ResolutionWidth 1000000 2 .........................257093531.91
Best: 2 - 25709353191
ResolutionWidth 5000000 1 .....241573260.35
ResolutionWidth 5000000 2 .....259314162.79
Best: 2 - 25931416279
ResolutionDepth 30000 1 .................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................217108119.84
ResolutionDepth 30000 2 .................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................249459504.41
Best: 2 - 24945950441
ResolutionDepth 100000 1 ..........................................................................................................................................................................................................................................................229065162.17
ResolutionDepth 100000 2 ..........................................................................................................................................................................................................................................................253769105.64
Best: 2 - 25376910564
ResolutionDepth 300000 1 ...................................................................................233079225.18
ResolutionDepth 300000 2 ...................................................................................256316273.78
Best: 2 - 25631627378
ResolutionDepth 1000000 1 .........................234184633.51
ResolutionDepth 1000000 2 .........................261100491.57
Best: 2 - 26110049157
ResolutionDepth 5000000 1 .....233118795.66
ResolutionDepth 5000000 2 .....252436160.41
Best: 2 - 25243616041
```
2019-03-01 16:47:13 +08:00
Alexey Zatelepin
eee73dac42
Merge branch 'master' into fix-concurrent-alters
2019-02-28 19:19:21 +03:00
proller
aad2856eba
CLICKHOUSE-4179 : arrayEnumerateDenseRanked arrayEnumerateUniqRanked ( #4475 )
...
* Build fixes (clang 7.0.0+)
* better
* wip
* wip
* fix
* fix
* wip
* fix
* clean
* clean
* CLICKHOUSE-4179 WIP
* wip
* wip
* wip
* wip
* wip
* wip
* wip
* wip
* wip
* wip
* wip
* wip
* wip
* wip
* cf
* wip
* wip
* wip
* wip
* cf
* wip
* wip# Please enter the commit message for your changes. Lines starting
* wip# Please enter the commit message for your changes. Lines starting
* cf
* wip# Please enter the commit message for your changes. Lines starting
* wip
* cf
* wip
* wip
* clean
* clean
* clean
* clean
* wip
* wip
* wip
* cf
* wip
* wip
* wip
* cf
* clean
* wip
* clean
* cf
* clean
* clean
* test
* clean
* wip
* cf
* wip
* clean
* clean
* wip
* wip
* clean
* cf
* clean
* Requested changes
* cf
* no convertToFullColumnIfConst
* clean
* clean
* clean
* fix
* Remove comment
2019-02-28 01:18:58 +03:00
Alexey Zatelepin
0d7ab9e3ea
allow acquiring read lock to a table out of order if it was already locked by the same query [#CLICKHOUSE-3789]
...
this is important in preventing ALTER deadlocks
2019-02-27 21:34:38 +03:00
proller
23d70b2e04
Add Y_IGNORE ( #4521 )
2019-02-27 19:52:08 +03:00
alexey-milovidov
bffe514efa
Merge pull request #4466 from danlark1/master
...
Function trigramDistance() added for string similarity search
2019-02-25 21:20:29 +03:00
Георгий Кондратьев
b94de32423
Support xxHash with external lz4
...
xxhash.h does not exist in external lz4 because it is an implementation detail
and its symbols are namespaced with XXH_NAMESPACE macro. When lz4 is external,
xxHash has to be external too, and the dependents have to link to it.
This find_xxhash.cmake is based on find_lz4.cmake.
2019-02-25 07:14:37 +00:00
Danila Kutenin
6c50a55777
Faster hash function and one comment removed
2019-02-23 21:26:09 +03:00
Danila Kutenin
6736dcc96b
A bit better performance, return old name to function
2019-02-22 22:43:47 +03:00
Danila Kutenin
9ea1e948b3
better style in distance()
2019-02-22 06:50:06 +03:00
Danila Kutenin
3149eac681
some style
2019-02-22 06:07:35 +03:00
Danila Kutenin
ada849263b
Optimize distance function, some allocation, hashmap tricks
2019-02-22 06:02:10 +03:00
Danila Kutenin
3f636b3c34
Remove redundant include
2019-02-21 08:16:55 +03:00