Commit Graph

1738 Commits

Author SHA1 Message Date
hcz
8cae381428 Performance improvement 2019-05-08 16:20:25 +03:00
hcz
b721ae11f5 Fix const column detection 2019-05-08 16:20:25 +03:00
hcz
e1a236f55c Update API 2019-05-08 16:20:25 +03:00
hcz
789b484108 Fix cmakelists 2019-05-08 16:20:25 +03:00
hcz
560246c0c3 Add Simdjson API 2019-05-08 16:20:25 +03:00
吴健
c687a85304 Merge branch 'bug-null-in' of github.com:janplus/ClickHouse into bug-null-in 2019-05-07 13:59:46 +08:00
吴健
84ee8cec93 Add function ignoreExceptNull to meet nullable functionIn's requirement for optimization of analyse. 2019-05-07 13:20:23 +08:00
Guillaume Tassery
3f8121d34f format code style 2019-05-06 17:23:06 +07:00
Danila Kutenin
c280907f09 Zero initialization instead of memset 2019-05-06 11:34:28 +03:00
Danila Kutenin
512c7fa3a4 Comments 2019-05-06 00:40:37 +03:00
Danila Kutenin
8b235f7a4a Comments 2019-05-06 00:37:58 +03:00
Danila Kutenin
6a6b30a56f Less code with unique ptrs and some docs added 2019-05-06 00:35:08 +03:00
Danila Kutenin
f613c06c47 ngram vector_vector distance added 2019-05-05 23:48:46 +03:00
Danila Kutenin
ca4f098362 fix hyperscan to treat regular expressions as utf-8 expressions 2019-05-05 09:51:59 +03:00
Alexey Milovidov
c462b0fabb Miscellaneous 2019-05-05 02:12:55 +03:00
alexey-milovidov
9e3c80d2ed
Merge pull request #5170 from yandex/cidr-to-range-better-code
Cidr to range better code
2019-05-02 19:29:36 +03:00
Alexey Milovidov
4dc0170d2c CIDR functions: better code #5095 2019-05-02 16:04:24 +03:00
alexey-milovidov
478b721b36
Merge pull request #5167 from yandex/remove-useless-code-in-columns-description
Removed bad and useless code in ColumnsDescription
2019-05-02 02:16:36 +03:00
Alexey Milovidov
0381a6a1ec Better code for CIDR to range functions #5095 2019-05-02 01:09:24 +03:00
Alexey Milovidov
3fa2784e7c Removed unneeded file #5095 2019-05-02 01:00:56 +03:00
alexey-milovidov
2dccdb18eb
Merge pull request #5095 from PerformanceVision/CIDR_MASK
Functions for calculate lower and high range of an IP address using a CIDR mask
2019-05-02 01:00:29 +03:00
Alexey Milovidov
1be21e3cb6 Removed bad and useless code in ColumnsDescription 2019-05-02 00:43:05 +03:00
alexey-milovidov
35471f4234
Update in.cpp 2019-04-30 20:26:56 +03:00
alexey-milovidov
b88c981902
Update basename.cpp 2019-04-30 19:15:08 +03:00
alexey-milovidov
39b696f9df
Update basename.cpp 2019-04-30 19:14:53 +03:00
Guillaume Tassery
ee742f38d9 Formatting 2019-04-30 13:53:09 +07:00
Guillaume Tassery
9a5a0b2776 Formatting 2019-04-30 13:50:38 +07:00
吴健
acbeedad38 Fix null problem in FunctionIn 2019-04-30 11:50:30 +08:00
alexey-milovidov
109726d533
Update basename.cpp 2019-04-30 00:40:54 +03:00
Guillaume Tassery
95141804fb Add a basename function 2019-04-29 16:40:44 +07:00
Guillaume Tassery
f21004ed7e IPv4CIDRtoIPv4Range and IPv6CIDRtoIPv6Range return now an IP 2019-04-26 13:30:07 +07:00
Guillaume Tassery
7c4541b127
Merge branch 'master' into top_level_domain 2019-04-26 12:30:23 +07:00
Guillaume Tassery
40db4551f2 Add USE_GPERF on configuration file 2019-04-26 12:01:02 +07:00
Guillaume Tassery
2640a0716c Add some pragma for ignoring some compilation flags for file generated by gperf 2019-04-24 17:09:46 +07:00
proller
ba4687e110
Cmake: dont use LINK_MODE (but use BUILD_SHARED_LIBS) (#5091) 2019-04-24 12:06:28 +03:00
Guillaume Tassery
68b0144178 Remove unused header and some typo in comment for IPv6 range function 2019-04-24 13:26:28 +07:00
Guillaume Tassery
efd780f20d Apply fix for IPv6 range function when we use a column with more than one element 2019-04-24 12:19:36 +07:00
Guillaume Tassery
0e67d424da Code reformating for ip range function 2019-04-24 11:18:17 +07:00
Guillaume Tassery
56393de68e Handle UB case for CIDR IP range function and add documentation 2019-04-23 19:28:25 +07:00
Guillaume Tassery
8dee4fe5d1 Add function for get an ipv4 range using an ipv4 and a cidr mask 2019-04-23 14:02:23 +07:00
Guillaume Tassery
153c9d6455 Add function for get an IP range from a CIDR 2019-04-22 18:21:59 +07:00
Alexander Kozhikhov
fd07c97097 Merge branch 'master' of https://github.com/yandex/ClickHouse 2019-04-21 18:43:29 +03:00
Alexander Kozhikhov
daf4690d37 review fixes 2019-04-21 17:32:42 +03:00
Guillaume Tassery
bd9a583869 Adapt coding style top level domain lookup 2019-04-18 10:51:02 +07:00
Vitaly Baranov
7ef7a2d767 Disable ubsan error "outside the range of representable values" in bitSwapLastTwo.cpp 2019-04-17 19:14:31 +03:00
Guillaume Tassery
a904a4af7a Remove useless file 2019-04-17 19:45:25 +07:00
Guillaume Tassery
65fc607c18 Modify tldlookup.gperf file for include only second level domain 2019-04-17 19:40:11 +07:00
Guillaume Tassery
1eccfacca5 Take into account when we have content after the dns on (cutTo)firstSignificantSubdomain function 2019-04-17 19:27:15 +07:00
Guillaume Tassery
4cc0ee677a Use gperf perfect hash for checking with a better accurency the tld for firstSignificantSubdomain and cutToFirstSignificantSubdomain 2019-04-17 13:21:26 +07:00
Danila Kutenin
0abde66822 isValidUTF8 movemask is a bad choice, let's use testz 2019-04-16 15:49:22 +03:00
Alexander Kozhikhov
19021e76bb style 2019-04-15 03:16:13 +03:00
Alexander Kozhikhov
12132b8fdf fix in evalMLMethod 2019-04-15 02:57:14 +03:00
Alexey Milovidov
0db5ec9227 More tests 2019-04-12 03:52:24 +03:00
Alexey Milovidov
ceef92fa9e Fixed inconsistent whitespaces 2019-04-12 03:37:07 +03:00
Alexey Milovidov
b4878dbffa Added support for non-constant and negative size and length arguments for function substringUTF8 2019-04-12 03:34:11 +03:00
Alexey Milovidov
36d016d331 Added [[noreturn]] attributes for functions that throw unconditionally 2019-04-10 23:27:14 +03:00
Alexey Milovidov
5be06d8556 Simplified expression (suggested by PVS-Studio) 2019-04-10 23:05:25 +03:00
Alexey Milovidov
37e086f3bf Merge branch 'master' of github.com:yandex/ClickHouse 2019-04-10 22:57:42 +03:00
Alexey Milovidov
b4efebc6ac Fixed error found by PVS-Studio 2019-04-10 22:57:30 +03:00
Danila Kutenin
e9bef32618 License to valid utf8 function 2019-04-10 22:09:27 +03:00
Alexander Kozhikhov
c93aae6741 linear regression 2019-04-09 01:40:37 +03:00
Alexander Kozhikhov
a9ae6d0681 Merge branch 'master' of https://github.com/yandex/ClickHouse 2019-04-09 00:01:28 +03:00
Danila Kutenin
9dc6d8f0f5 Style 2019-04-08 00:23:15 +03:00
Danila Kutenin
3519598ae9 Accurate tail handling 2019-04-07 23:24:08 +03:00
Danila Kutenin
b603e51c6e isValidUTF8 function added 2019-04-07 21:58:13 +03:00
Alexander Kozhikhov
f550304fa4 Merge branch 'master' of https://github.com/yandex/ClickHouse 2019-04-04 03:17:37 +03:00
Vitaly Baranov
842ce6cff8 Fix function cutQueryStringAndFragment(). 2019-04-03 15:58:38 +03:00
Nikolai Kochetov
30d3496c36
Merge pull request #4871 from yandex/fix-array-intersect-impl
Fix array intersect
2019-04-02 17:05:01 +03:00
proller
45591c9d83
Build and test fixes (#4873) 2019-04-02 12:43:53 +03:00
Nikolai Kochetov
e8a8da1a4f Fix array intersect. 2019-04-01 16:27:07 +03:00
Nikolai Kochetov
736e3c0f46
Merge pull request #4847 from fancyqlx/master
Fixed segment fault of arrayIntersect
2019-04-01 16:06:56 +03:00
Danila Kutenin
fca67c8820 Fix clang build 2019-04-01 08:05:52 +03:00
Danila Kutenin
3ce9686bc4 Some cosmetics 2019-04-01 00:04:45 +03:00
Danila Kutenin
8edb5c5b2d Some cosmetics 2019-03-31 23:59:18 +03:00
Danila Kutenin
d509c226ab Pool was not doing what I expected, great improvement of compilation 2019-03-31 23:55:36 +03:00
Danila Kutenin
f80fb09b81 Hyperscan better usage of scratch space and error handling 2019-03-31 23:19:22 +03:00
qianlixiang
bd6bc86ceb Fixed style check errors 2019-03-31 13:03:08 +08:00
qianlixiang
32692fb0dd Fixed style check errors 2019-03-31 12:52:24 +08:00
qianlixiang
dad023ea48 Fixed style check errors 2019-03-31 12:00:11 +08:00
qianlixiang
a1d2732d4c Fixed segment fault of arrayIntersect and add a test 2019-03-31 11:44:05 +08:00
alexey-milovidov
cf492a4401
Merge pull request #4862 from danlark1/master
Optimize Volnitsky by inlining compare function
2019-03-31 02:24:33 +03:00
alexey-milovidov
254369324b
Miscellanous changes in ColumnArray (#4857)
* Miscellanous

* Miscellanous

* Miscellanous
2019-03-30 13:46:17 +03:00
Danila Kutenin
18d3813ff9 Optimize Volnitsky by inlining compare function 2019-03-30 12:39:45 +03:00
qianlixiang
bdb420cdfd Fixed segment fault of arrayIntersect 2019-03-29 18:53:50 +08:00
alexey-milovidov
7aefd6663f
Merge pull request #4841 from danlark1/master
Added hyperscan fuzzy search
2019-03-29 12:39:25 +03:00
Danila Kutenin
cd850f6f98 Fix clang build 2019-03-29 10:04:44 +03:00
Danila Kutenin
b822923f5a Better renamings of fuzzy string search 2019-03-29 04:27:17 +03:00
Danila Kutenin
ef0d622a5f Added hyperscan fuzzy search 2019-03-29 04:02:05 +03:00
alexey-milovidov
e3c5da0a69
Merge pull request #4783 from yandex/cow-ptr-compositions
WIP. Attempt to make compositions with COWPtr more convenient
2019-03-29 02:09:39 +03:00
Danila Kutenin
d22d66a552 Fix exception message 2019-03-28 20:06:04 +03:00
Danila Kutenin
d4e67fa64c Another bug fixed to the comparison 2019-03-28 19:58:16 +03:00
Danila Kutenin
70a7f4805d Test to the restriction added, fix bug 2019-03-28 19:41:11 +03:00
Danila Kutenin
cae4a426c5 Hyperscan restriction 2019-03-28 17:59:25 +03:00
Alexey Milovidov
4417705f32 Merge branch 'master' into cow-ptr-compositions 2019-03-28 17:47:59 +03:00
Alexey Milovidov
faf9462c37 Added support for constant-constant case in multi-search functions 2019-03-27 22:25:58 +03:00
Alexey Milovidov
42dd521e45 Added a setting to disallow Hyperscan 2019-03-27 00:56:46 +03:00
proller
1584fd4367
Fix hyperscan library find and use (#4798) 2019-03-26 00:31:00 +03:00
Alexey Milovidov
fb19cf1eec Attempt to make compositions with COWPtr more convenient 2019-03-25 04:43:54 +03:00
alexey-milovidov
06e79a7643
Merge pull request #4780 from danlark1/master
Hyperscan string search regular expression matching was added
2019-03-25 04:26:18 +03:00
Danila Kutenin
4b7824095e Possible fixes to docker containers 2019-03-24 21:24:56 +03:00
Danila Kutenin
1c90d98d16 Clang fix 2019-03-24 02:56:52 +03:00
Danila Kutenin
725139f0f5 Renamings, fixes to search algorithms, more tests 2019-03-24 01:49:38 +03:00
Danila Kutenin
f8001b41ba Hyperscan multi regular expressions search 2019-03-23 22:40:16 +03:00
Alexey Milovidov
847abfdbb1 Fixed bad code with arithmetic ops on aggregate function states 2019-03-23 05:35:50 +03:00
Alexey Milovidov
5a7237cc13 Miscellaneous 2019-03-23 04:27:08 +03:00
proller
d6d83d4a5c
Allow build without ssl (#4525) (#4750) 2019-03-22 14:18:24 +03:00
alexey-milovidov
f7bdc657d3
Update hasAllAny.h 2019-03-15 02:12:30 +03:00
Alexey Milovidov
a459b2c298 Fixed insufficient type checking in hasAll, hasAny functions 2019-03-15 02:10:51 +03:00
proller
cf8001b6a9
Fix includes (#4673) 2019-03-13 16:58:04 +03:00
alexey-milovidov
617a0a8938
Merge pull request #4592 from yandex/fix-concurrent-alters2
Fix a race condition during concurrent ALTER COLUMN queries
2019-03-10 19:11:49 +03:00
alexey-milovidov
357e17959e
Merge pull request #4602 from danlark1/master
Rename trigramDistance to ngramDistance, add more functions with CaseInsensitive and UTF, update docs, more job done in perf, added some perf tests for string search that I would like to see
2019-03-09 21:19:59 +03:00
Alexey Zatelepin
3344955154 separate Read and Write lock holders, better method names 2019-03-07 21:10:21 +03:00
Alexey Zatelepin
ab025805a3 alter intention lock 2019-03-07 20:21:18 +03:00
Alexey Milovidov
3389cd1735 Merge branch 'master' into fix-function-replicate 2019-03-07 14:56:43 +03:00
proller
86d9876221 Merge remote-tracking branch 'upstream/master' into fix23 2019-03-06 22:58:47 +03:00
proller
ee15cb237c Better fix 2019-03-06 22:19:21 +03:00
proller
9159d602f5 Fix bug in arrayEnumerateRanked 2019-03-06 20:19:26 +03:00
Alexey Milovidov
fb8e37f60f Fix segfault in function "replicate" 2019-03-06 02:37:41 +03:00
Danila Kutenin
dcfd3fe37f Comment in FunctionsStringSimilarity 2019-03-06 02:08:08 +03:00
Alexey Milovidov
5a644f0052 Miscellaneous 2019-03-06 02:05:47 +03:00
Alexey Milovidov
b5b3f52f0f Added documentation 2019-03-06 01:43:18 +03:00
Danila Kutenin
952b5ea24a Rename trigramDistance to ngramDistance, add more functions with CaseInsensitive and UTF, update docs, more job done in perf, added some perf tests for string search that I would like to see 2019-03-06 01:42:28 +03:00
Alexey Milovidov
224839a58a Attempt to read the code #4475 2019-03-06 00:53:16 +03:00
proller
89014b5480
Build fixes (#4591) 2019-03-05 13:15:47 +03:00
Vitaly Baranov
61e21d50e1
Merge pull request #4207 from andyyzh/bitmap_feature
Added bitmap function feature with roaring bitmap
2019-03-04 22:42:47 +03:00
Alexey Milovidov
3033d329eb Fixed error #4564 2019-03-04 18:50:23 +03:00
Alexey Milovidov
b29c24d3d9 Fixed error #4564 2019-03-04 18:13:33 +03:00
Alexey Milovidov
315b6f3878 Better string comparison (development) 2019-03-03 23:08:39 +03:00
Alexey Milovidov
865d54d9e3 Addition to prev. revision 2019-03-03 16:53:06 +03:00
Alexey Milovidov
d120fc788b NaN comparisons are implementation specific - remove them from tests 2019-03-03 04:18:35 +03:00
Alexey Milovidov
bcf8cd6297 NaN comparisons are implementation specific - remove them from tests 2019-03-03 04:17:33 +03:00
Alexey Milovidov
0dc576e44f Avoid performance degradation in few degenerate cases: added fast path for tautological compares #4405 2019-03-03 02:51:48 +03:00
Alexey Milovidov
8df4b8e270 Added comment #4409 2019-03-01 23:17:30 +03:00
Alexey Milovidov
3c2dd5257b Fixed typo #4409 2019-03-01 23:16:56 +03:00
Alexey Milovidov
bc25bedbd4 Implemented function "flatten" #4409 2019-03-01 23:07:58 +03:00
Alexey Milovidov
9d0adf40d2 Merge branch 'array-flatten-function' of https://github.com/kzon/ClickHouse into kzon-array-flatten-function 2019-03-01 19:42:06 +03:00
alexey-milovidov
97cea95f74
Merge pull request #4536 from amosbird/fixedhashtable
A Proper lookup table that uses HashTable's API
2019-03-01 17:47:33 +03:00
Amos Bird
26ab5dd7a7 A Proper lookup table that uses HashTable's API
This is the first step of allowing heterogeneous cells in hash tables.

performance test results are

```

1. HashMap<UInt16, UInt8, TrivialHash, HashTableFixedGrower<16>>;
2. NewLookupMap<UInt16, UInt8>

ResolutionWidth 30000 1 .................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................223550276.46
ResolutionWidth 30000 2 .................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................248772721.24
Best: 2 - 24877272124

ResolutionWidth 100000 1 ..........................................................................................................................................................................................................................................................238498413.99
ResolutionWidth 100000 2 ..........................................................................................................................................................................................................................................................261808889.98
Best: 2 - 26180888998

ResolutionWidth 300000 1 ...................................................................................239307348.81
ResolutionWidth 300000 2 ...................................................................................257592761.30
Best: 2 - 25759276130

ResolutionWidth 1000000 1 .........................240144759.26
ResolutionWidth 1000000 2 .........................257093531.91
Best: 2 - 25709353191

ResolutionWidth 5000000 1 .....241573260.35
ResolutionWidth 5000000 2 .....259314162.79
Best: 2 - 25931416279

ResolutionDepth 30000 1 .................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................217108119.84
ResolutionDepth 30000 2 .................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................249459504.41
Best: 2 - 24945950441

ResolutionDepth 100000 1 ..........................................................................................................................................................................................................................................................229065162.17
ResolutionDepth 100000 2 ..........................................................................................................................................................................................................................................................253769105.64
Best: 2 - 25376910564

ResolutionDepth 300000 1 ...................................................................................233079225.18
ResolutionDepth 300000 2 ...................................................................................256316273.78
Best: 2 - 25631627378

ResolutionDepth 1000000 1 .........................234184633.51
ResolutionDepth 1000000 2 .........................261100491.57
Best: 2 - 26110049157

ResolutionDepth 5000000 1 .....233118795.66
ResolutionDepth 5000000 2 .....252436160.41
Best: 2 - 25243616041

```
2019-03-01 16:47:13 +08:00
Alexey Zatelepin
eee73dac42 Merge branch 'master' into fix-concurrent-alters 2019-02-28 19:19:21 +03:00
proller
aad2856eba CLICKHOUSE-4179 : arrayEnumerateDenseRanked arrayEnumerateUniqRanked (#4475)
* Build fixes (clang 7.0.0+)

* better

* wip

* wip

* fix

* fix

* wip

* fix

* clean

* clean

* CLICKHOUSE-4179 WIP

* wip

* wip

* wip

* wip

* wip

* wip

* wip

* wip

* wip

*    wip

* wip

* wip

* wip

* wip

* cf

*    wip

* wip

* wip

* wip

* cf

* wip

* wip# Please enter the commit message for your changes. Lines starting

* wip# Please enter the commit message for your changes. Lines starting

* cf

* wip# Please enter the commit message for your changes. Lines starting

* wip

* cf

* wip

* wip

* clean

* clean

* clean

* clean

* wip

* wip

* wip

* cf

* wip

* wip

* wip

* cf

* clean

* wip

* clean

* cf

* clean

* clean

* test

* clean

* wip

* cf

* wip

* clean

* clean

* wip

* wip

* clean

* cf

* clean

* Requested changes

* cf

* no convertToFullColumnIfConst

* clean

* clean

* clean

* fix

* Remove comment
2019-02-28 01:18:58 +03:00
Alexey Zatelepin
0d7ab9e3ea allow acquiring read lock to a table out of order if it was already locked by the same query [#CLICKHOUSE-3789]
this is important in preventing ALTER deadlocks
2019-02-27 21:34:38 +03:00
proller
23d70b2e04
Add Y_IGNORE (#4521) 2019-02-27 19:52:08 +03:00
alexey-milovidov
bffe514efa
Merge pull request #4466 from danlark1/master
Function trigramDistance() added for string similarity search
2019-02-25 21:20:29 +03:00
Георгий Кондратьев
b94de32423 Support xxHash with external lz4
xxhash.h does not exist in external lz4 because it is an implementation detail
and its symbols are namespaced with XXH_NAMESPACE macro.  When lz4 is external,
xxHash has to be external too, and the dependents have to link to it.

This find_xxhash.cmake is based on find_lz4.cmake.
2019-02-25 07:14:37 +00:00
Danila Kutenin
6c50a55777 Faster hash function and one comment removed 2019-02-23 21:26:09 +03:00
Danila Kutenin
6736dcc96b A bit better performance, return old name to function 2019-02-22 22:43:47 +03:00
Danila Kutenin
9ea1e948b3 better style in distance() 2019-02-22 06:50:06 +03:00
Danila Kutenin
3149eac681 some style 2019-02-22 06:07:35 +03:00
Danila Kutenin
ada849263b Optimize distance function, some allocation, hashmap tricks 2019-02-22 06:02:10 +03:00
Danila Kutenin
3f636b3c34 Remove redundant include 2019-02-21 08:16:55 +03:00