Commit Graph

1235 Commits

Author SHA1 Message Date
Kruglov Pavel
3c4e5df3a9
Merge branch 'master' into refactor-json-extract 2024-07-10 14:20:25 +02:00
Kruglov Pavel
6a9e2d9e82
Merge pull request #65980 from Avogar/add-missing-settings-in-schema-cache
Add missing settings in schema inference cache
2024-07-10 10:15:22 +00:00
avogar
a32795d116 Fix review comments 2024-07-10 10:07:02 +00:00
Kruglov Pavel
6baa52d101
Fix null insertion into dynamic column 2024-07-09 14:01:28 +02:00
Kruglov Pavel
f3bfe8d881
Merge branch 'master' into data-types-binary-encoding 2024-07-09 12:27:06 +02:00
avogar
f556f2cd95 Try to fix special build 2024-07-04 18:28:22 +00:00
Kruglov Pavel
41b9216dd1
Fix build 2024-07-04 00:22:41 +02:00
Kruglov Pavel
a5adf31b9e
Fix special build 2024-07-03 19:30:36 +02:00
avogar
6530ae104d Fix tests 2024-07-03 15:23:01 +00:00
avogar
d03fcb5ff1 Fix 2024-07-03 14:23:38 +00:00
avogar
63303dd798 Fix style 2024-07-03 14:03:04 +00:00
avogar
ea3b0e735d Refactor JSONExtract functions and support more types and reuse its code in new JSON type 2024-07-03 13:40:41 +00:00
kevinyhzou
9341bbe197 ORC read use writer time zone 2024-07-03 16:36:08 +08:00
avogar
02de441246 Remove unrelated files 2024-07-02 15:55:26 +00:00
avogar
a2f7d8e13f Add missing settings in schema inference cache 2024-07-02 10:42:12 +00:00
avogar
6573b5436e Add docs about dynamic binary format 2024-07-01 14:21:11 +00:00
avogar
22eb2fe006 Merge branch 'master' of github.com:ClickHouse/ClickHouse into data-types-binary-encoding 2024-07-01 10:19:31 +00:00
Michael Kolupaev
3229cb5874
Merge pull request #61750 from KevinyhZou/improve_json_each_row_ignore_key_case
Improve JSONEachRow reading by ignoring the keys case
2024-06-27 23:38:50 +00:00
Kruglov Pavel
6436bd9cf3
Merge branch 'master' into data-types-binary-encoding 2024-06-27 17:19:36 +02:00
Alexey Milovidov
0990a82089
Merge branch 'master' into improve_json_each_row_ignore_key_case 2024-06-26 03:28:32 +02:00
lgbo-ustc
d7b56e112e support writting page index into parquet file 2024-06-24 09:22:38 +08:00
avogar
785f3ac5c9 Revert unrelated changes 2024-06-21 18:40:49 +00:00
avogar
7e7dd78844 Revert unrelated changes 2024-06-21 18:31:41 +00:00
avogar
b9fbbbb284 Update settings changes history 2024-06-21 18:23:52 +00:00
avogar
3dfab50408 Merge branch 'master' of github.com:ClickHouse/ClickHouse into data-types-binary-encoding 2024-06-21 18:21:19 +00:00
avogar
b1f45da3eb Implement binary encoding for ClickHouse data types 2024-06-21 18:20:35 +00:00
Shaun Struwig
d8c8aebc1f
Merge branch 'master' into footer_column__names 2024-06-15 08:39:02 +02:00
Nikita Mikhaylov
e48ebf5529 Fixed build and tests 2024-06-14 01:43:47 +02:00
Nikita Mikhaylov
a98ec2e0e9 Update src/Formats/EscapingRuleUtils.cpp
Co-authored-by: Yakov Olkhovskiy <99031427+yakov-olkhovskiy@users.noreply.github.com>
2024-06-14 01:43:47 +02:00
Nikita Mikhaylov
04897eb0fa Done 2024-06-14 01:43:47 +02:00
Kruglov Pavel
085c406f1f
Merge pull request #64641 from azat/fix-float-inference
Fix type inference for float (in case of small buffer)
2024-06-12 11:38:54 +00:00
Blargian
09e59765c5 Add setting output_format_pretty_display_footer_column_names_min_rows 2024-06-12 09:45:36 +02:00
Blargian
37e03ef320 Modify pretty formats to display column names in the footer when row count is large 2024-06-12 07:52:50 +02:00
Blargian
5aa9389f85 Add failing test, setting and docuumentation 2024-06-11 15:13:36 +02:00
LiuNeng
0ca96559c2
Merge branch 'master' into adapting-parquet-block-size 2024-06-06 11:14:01 +08:00
Robert Schulze
ec3b82ba63
Merge pull request #64606 from rschu1ze/map-stuff
Double-checking #59318 and docs for `Map`
2024-06-05 07:56:29 +00:00
Azat Khuzhin
918d3849e1 Simplify logic for input_format_try_infer_integers
Now, when we can be sure that it is a float, parse it as a float, and
fallback to int/uint after.

But note, that this would break something if tryReadFloat() !=
tryReadIntText() + parsing of '.'/'e', but for now, it is true.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-03 12:09:47 +02:00
Amos Bird
b2d6610d5f
Support empty tuple. 2024-06-03 16:05:42 +08:00
Azat Khuzhin
5246c56a2a Fix type inference for float (in case of small buffer)
In case of small buffer (i.e. --max_read_buffer_size 1) the pos() will
be always point to this one byte, so, comparing pos() will be always
evaluated to true.

And we cannot use count() as well, since in case of big buffer it will
be the same, plus, in case of reading extra byte for checking for '.'
the count() will be different, but it does not mean that the byte had
been interpreted (and allowing 1 byte of difference will not work almost
always, since it will read max_read_buffer_size bytes).

So instead, expose the has_fractional flag from the read helpers for
float, via two new methods:
- tryReadFloatTextExt
- tryReadFloatTextExtNoExponent

Where "ext" stands for "extended", which means expose extra information.

v2: consider number as float if it has '.' or 'e' (previously only if it
has some signs after those two it had been considered as float)

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-01 16:34:55 +02:00
Robert Schulze
8fc358f427
Merge remote-tracking branch 'rschu1ze/master' into map-stuff 2024-05-31 11:19:57 +00:00
Robert Schulze
b0c955e9c9
Various stuff 2024-05-29 20:51:48 +00:00
Robert Schulze
18d432f44c
Reapply "Remove some unnecessary UNREACHABLEs"
This reverts commit 5a868304c0.
2024-05-29 13:37:47 +00:00
liuneng
3bd3717d34 revert setting rename 2024-05-29 10:24:42 +08:00
liuneng
b4c2fa7e27 add test case 2024-05-28 15:17:08 +08:00
Alexander Tokmakov
5a868304c0
Revert "Remove some unnecessary UNREACHABLEs" 2024-05-27 11:38:22 +02:00
liuneng
b30d11f046 adapting parquet reader output block rows 2024-05-27 16:21:31 +08:00
Robert Schulze
7a552f5b06
Merge pull request #64035 from rschu1ze/unreachable-unreachable
Remove some unnecessary `UNREACHABLE`s
2024-05-26 20:37:17 +00:00
Michael Kolupaev
ee3e7f2fd0
Merge pull request #60361 from copperybean/gcmaster-parquet
A native parquet reader for primitive types
2024-05-24 04:50:12 +00:00
Kruglov Pavel
30dce7821c
Merge pull request #63058 from Avogar/dynamic-data-type
Implement Dynamic data type
2024-05-23 14:19:46 +00:00
Robert Schulze
f792a602da
Merge remote-tracking branch 'rschu1ze/master' into unreachable-unreachable 2024-05-22 21:08:27 +00:00