Commit Graph

1219 Commits

Author SHA1 Message Date
avogar
f556f2cd95 Try to fix special build 2024-07-04 18:28:22 +00:00
Kruglov Pavel
41b9216dd1
Fix build 2024-07-04 00:22:41 +02:00
Kruglov Pavel
a5adf31b9e
Fix special build 2024-07-03 19:30:36 +02:00
avogar
6530ae104d Fix tests 2024-07-03 15:23:01 +00:00
avogar
d03fcb5ff1 Fix 2024-07-03 14:23:38 +00:00
avogar
63303dd798 Fix style 2024-07-03 14:03:04 +00:00
avogar
ea3b0e735d Refactor JSONExtract functions and support more types and reuse its code in new JSON type 2024-07-03 13:40:41 +00:00
Michael Kolupaev
3229cb5874
Merge pull request #61750 from KevinyhZou/improve_json_each_row_ignore_key_case
Improve JSONEachRow reading by ignoring the keys case
2024-06-27 23:38:50 +00:00
Alexey Milovidov
0990a82089
Merge branch 'master' into improve_json_each_row_ignore_key_case 2024-06-26 03:28:32 +02:00
lgbo-ustc
d7b56e112e support writting page index into parquet file 2024-06-24 09:22:38 +08:00
Shaun Struwig
d8c8aebc1f
Merge branch 'master' into footer_column__names 2024-06-15 08:39:02 +02:00
Nikita Mikhaylov
e48ebf5529 Fixed build and tests 2024-06-14 01:43:47 +02:00
Nikita Mikhaylov
a98ec2e0e9 Update src/Formats/EscapingRuleUtils.cpp
Co-authored-by: Yakov Olkhovskiy <99031427+yakov-olkhovskiy@users.noreply.github.com>
2024-06-14 01:43:47 +02:00
Nikita Mikhaylov
04897eb0fa Done 2024-06-14 01:43:47 +02:00
Kruglov Pavel
085c406f1f
Merge pull request #64641 from azat/fix-float-inference
Fix type inference for float (in case of small buffer)
2024-06-12 11:38:54 +00:00
Blargian
09e59765c5 Add setting output_format_pretty_display_footer_column_names_min_rows 2024-06-12 09:45:36 +02:00
Blargian
37e03ef320 Modify pretty formats to display column names in the footer when row count is large 2024-06-12 07:52:50 +02:00
Blargian
5aa9389f85 Add failing test, setting and docuumentation 2024-06-11 15:13:36 +02:00
LiuNeng
0ca96559c2
Merge branch 'master' into adapting-parquet-block-size 2024-06-06 11:14:01 +08:00
Robert Schulze
ec3b82ba63
Merge pull request #64606 from rschu1ze/map-stuff
Double-checking #59318 and docs for `Map`
2024-06-05 07:56:29 +00:00
Azat Khuzhin
918d3849e1 Simplify logic for input_format_try_infer_integers
Now, when we can be sure that it is a float, parse it as a float, and
fallback to int/uint after.

But note, that this would break something if tryReadFloat() !=
tryReadIntText() + parsing of '.'/'e', but for now, it is true.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-03 12:09:47 +02:00
Amos Bird
b2d6610d5f
Support empty tuple. 2024-06-03 16:05:42 +08:00
Azat Khuzhin
5246c56a2a Fix type inference for float (in case of small buffer)
In case of small buffer (i.e. --max_read_buffer_size 1) the pos() will
be always point to this one byte, so, comparing pos() will be always
evaluated to true.

And we cannot use count() as well, since in case of big buffer it will
be the same, plus, in case of reading extra byte for checking for '.'
the count() will be different, but it does not mean that the byte had
been interpreted (and allowing 1 byte of difference will not work almost
always, since it will read max_read_buffer_size bytes).

So instead, expose the has_fractional flag from the read helpers for
float, via two new methods:
- tryReadFloatTextExt
- tryReadFloatTextExtNoExponent

Where "ext" stands for "extended", which means expose extra information.

v2: consider number as float if it has '.' or 'e' (previously only if it
has some signs after those two it had been considered as float)

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-06-01 16:34:55 +02:00
Robert Schulze
8fc358f427
Merge remote-tracking branch 'rschu1ze/master' into map-stuff 2024-05-31 11:19:57 +00:00
Robert Schulze
b0c955e9c9
Various stuff 2024-05-29 20:51:48 +00:00
Robert Schulze
18d432f44c
Reapply "Remove some unnecessary UNREACHABLEs"
This reverts commit 5a868304c0.
2024-05-29 13:37:47 +00:00
liuneng
3bd3717d34 revert setting rename 2024-05-29 10:24:42 +08:00
liuneng
b4c2fa7e27 add test case 2024-05-28 15:17:08 +08:00
Alexander Tokmakov
5a868304c0
Revert "Remove some unnecessary UNREACHABLEs" 2024-05-27 11:38:22 +02:00
liuneng
b30d11f046 adapting parquet reader output block rows 2024-05-27 16:21:31 +08:00
Robert Schulze
7a552f5b06
Merge pull request #64035 from rschu1ze/unreachable-unreachable
Remove some unnecessary `UNREACHABLE`s
2024-05-26 20:37:17 +00:00
Michael Kolupaev
ee3e7f2fd0
Merge pull request #60361 from copperybean/gcmaster-parquet
A native parquet reader for primitive types
2024-05-24 04:50:12 +00:00
Kruglov Pavel
30dce7821c
Merge pull request #63058 from Avogar/dynamic-data-type
Implement Dynamic data type
2024-05-23 14:19:46 +00:00
Robert Schulze
f792a602da
Merge remote-tracking branch 'rschu1ze/master' into unreachable-unreachable 2024-05-22 21:08:27 +00:00
ZhiHong Zhang
3d7befef4f
Merge branch 'master' into gcmaster-parquet 2024-05-22 23:31:00 +08:00
Kruglov Pavel
fddedee9a9
Merge pull request #59747 from Blargian/56257_parse_crlf_with_TSV_files
Parse CRLF with TSV files
2024-05-22 13:45:07 +00:00
Kruglov Pavel
4989109e13
Merge pull request #63662 from v01dXYZ/63496-compression-from-file-descriptor
Compress STDOUT if redirected to file with a compression extension
2024-05-22 13:37:05 +00:00
Robert Schulze
0d3aeddc93
Merge remote-tracking branch 'rschu1ze/master' into unreachable-unreachable 2024-05-22 07:25:00 +00:00
ZhiHong Zhang
4b1c9adb3a
Merge branch 'ClickHouse:master' into gcmaster-parquet 2024-05-22 09:33:01 +08:00
avogar
6bba847b7d Merge branch 'master' of github.com:ClickHouse/ClickHouse into dynamic-data-type 2024-05-21 09:08:24 +00:00
Robert Schulze
2909e6451b
Move StringUtils.h/cpp back to Common/ 2024-05-19 09:39:36 +00:00
Robert Schulze
9969f9cf30
Merge remote-tracking branch 'rschu1ze/master' into unreachable-unreachable 2024-05-19 08:26:39 +00:00
Kruglov Pavel
c861ac4858
Merge branch 'master' into dynamic-data-type 2024-05-17 22:17:41 +02:00
Robert Schulze
53e992af4f
Remove some unnecessary UNREACHABLEs 2024-05-17 11:46:07 +00:00
Raúl Marín
7e429482fc Revert "Merge pull request #63479 from yariks5s/add_setting_from_multiline_strings"
This reverts commit 962d5e5bda, reversing
changes made to 8c4a5d3663.
2024-05-16 12:55:26 +02:00
Kruglov Pavel
4cfe2665de
Update src/Formats/FormatSettings.h 2024-05-15 20:28:17 +02:00
Kruglov Pavel
413be14c43
Merge branch 'master' into dynamic-data-type 2024-05-15 13:43:04 +02:00
Shaun Struwig
47ab2e2dc5
Merge branch 'ClickHouse:master' into 56257_parse_crlf_with_TSV_files 2024-05-15 05:28:18 +02:00
avogar
a7e87e22ad Merge branch 'master' of github.com:ClickHouse/ClickHouse into 56257_parse_crlf_with_TSV_files 2024-05-14 11:56:43 +00:00
豪肥肥
72fa329808
Merge branch 'ClickHouse:master' into output_format_npy 2024-05-14 11:07:46 +08:00