Commit Graph

411 Commits

Author SHA1 Message Date
avogar
dc87c483dd Add a setting to allow saving bad escape sequences in JSON input formats 2024-03-25 21:58:53 +00:00
Alexey Milovidov
3e5ddddb35 Merge branch 'master' into dont-cut-single-value 2024-03-24 00:51:10 +01:00
Alexey Milovidov
4cbecd0bbd Add a setting 2024-03-23 04:20:52 +01:00
Alexey Milovidov
a2e89c8be7 Fix wrong cases of numbers pretty printing
Add a test

Revert changes from another branch

Add a test

Better test

Revert wrong changes
2024-03-23 03:33:03 +01:00
Alexey Milovidov
570692fe83
Merge branch 'master' into json-ambg-tuple-inference 2024-03-05 04:50:39 +03:00
Kruglov Pavel
4bdafed801
Merge pull request #60420 from HowePa/format_case_insensitive
Make all format names case insensitive.
2024-03-04 19:09:10 +01:00
avogar
70abdf7a41 Small improvements in JSON schema inference 2024-03-04 17:32:22 +00:00
豪肥肥
6f9cb058a6
Update FormatFactory.cpp 2024-02-27 07:59:09 +08:00
豪肥肥
24155c80c9
Update src/Formats/FormatFactory.cpp
Co-authored-by: Kruglov Pavel <48961922+Avogar@users.noreply.github.com>
2024-02-27 07:50:04 +08:00
HowePa
dbd8d35f01 use lower case in dict 2024-02-27 00:48:34 +08:00
HowePa
0b72f7b182 Make all format names case insensitive. 2024-02-26 22:46:51 +08:00
rogeryk
7a92f542b4 Add setting output_format_pretty_single_large_number_tip_threshold 2024-02-26 20:19:53 +08:00
Kruglov Pavel
5fd2582e83
Merge pull request #59500 from Avogar/exponent-floats-inference
Don't infer floats in exponential notation by default
2024-02-19 13:51:07 +01:00
Alexey Milovidov
df48106cd5
Merge pull request #60015 from azat/values-quote-escape
Fix INSERT into SQLite with single quote (by escaping single quotes with a quote instead of backslash)
2024-02-19 10:37:45 +01:00
Kruglov Pavel
4d6f167e0c
Merge pull request #59092 from Avogar/auto-format-detection
Try to detect file format automatically during schema inference if it's unknown
2024-02-16 14:32:18 +01:00
Azat Khuzhin
bbe38a3fe4 Add ability to escape quotes in Values format with single quote
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2024-02-15 12:47:29 +01:00
Alexey Milovidov
bbd7acd7f9
Merge branch 'master' into exponent-floats-inference 2024-02-15 01:46:51 +01:00
avogar
ff21aa9a19 Don't infer floats in exponential notation by default 2024-02-01 19:47:05 +00:00
Blargian
4a8a7208f2 rename of settings, add setting for resultset, extend test, fix documentation and add to SettingsChanges log 2024-01-29 21:25:58 +01:00
Kruglov Pavel
46a6b84a5a
Merge branch 'master' into auto-format-detection 2024-01-25 22:11:07 +01:00
Shaun Struwig
e6844a5412
Merge branch 'ClickHouse:master' into #31363_format_template_configure_in_settings 2024-01-25 20:06:45 +01:00
Maksim Kita
2a327107b6 Updated implementation 2024-01-25 14:31:49 +03:00
avogar
93fbe1d9c8 Fixes 2024-01-23 18:59:40 +00:00
avogar
617cc514b7 Try to detect file format automatically during schema inference if it's unknown 2024-01-23 18:59:39 +00:00
Blargian
7b235fe643 #31363 - remove schema delimiter setting and add test 00937_format_schema_rows_template.sh and reference 2024-01-22 22:59:59 +02:00
Blargian
eae39ff545 #31363 - modified TemplateBlockOutputFormat to work with added format_schema_rows_template setting 2024-01-21 21:51:06 +02:00
Blargian
f1749217ee added format_schema_rows_template setting 2024-01-18 21:53:56 +02:00
Kruglov Pavel
6d064512e1
Merge pull request #58614 from Blargian/58363_disable_ansi_pretty_automatically
58363 Automatically disable ANSI escape sequences in Pretty formats if the output is not a terminal
2024-01-17 13:45:41 +01:00
Blargian
5f500522a4 #58363 - added setting is_writing_to_terminal to FormatSettings.h, modified PrettyBlockOutputFormat to use this, which is set in FormatFactory.cpp getOutputFormat and getOutputFormatParallelIfPossible 2024-01-15 16:32:51 +02:00
Alexey Milovidov
afb50f03d9
Merge pull request #58519 from Avogar/control-arrow-dict-indexes-type
Add settings for better control of indexes type in Arrow dictionary
2024-01-13 20:00:40 +01:00
Blargian
0fdba3b83d #58363 - fix failing style check 2024-01-13 12:14:54 +02:00
Alexey Milovidov
d112492c56 Remove some code 2024-01-13 03:48:04 +01:00
Blargian
72b5cf5993 #58363 - removed switch from PrettyBlockOutputFormat and modified BlockOutputFormats to use color variable. Updated english and russian documentation. Updated test 00405 reference file. 2024-01-12 19:46:03 +02:00
avogar
fbfdde60a7 Add settings for better control of indexes type in Arrow dictionary. Use signed integer type for indexes by default 2024-01-12 13:06:51 +00:00
Blargian
aa8876a611 #58363 - Changes based on review of draft PR - changed output_format_pretty_color to use UInt64Auto. Added isWritingToTerminal function to IO/WriteHelpers.h and updated test 2024-01-12 12:31:57 +02:00
Blargian
b65adbecc1 minor fixes. Doesnt seem to be using ANSI escapes anymore 2024-01-08 23:52:25 +02:00
Blargian
a15b573315 #58363 - fix formatting issues and change ON, OFF, AUTO to 0, 1, auto 2024-01-08 15:25:14 +02:00
avogar
7e5ba62017 Allow to read Bool values into String in JSON input formats 2024-01-05 20:33:30 +00:00
Blargian
459946035c #58363 🚧 modified Pretty in FormatSettings.h to have PrettyColor which can be 0,1 or auto. modified output_format_pretty_color in FormatFactory.cpp to make use of this, added the default to Settings.h. Implemented the logic for enabling/disabling based on output_format_pretty_color in PrettyBlockOutputFormat.h 2024-01-04 16:10:36 +02:00
Kruglov Pavel
6567fb2c08
Merge pull request #56859 from Avogar/csv-infer-numbers-from-strings
Allow to infer numbers from strings in CSV format
2023-12-12 17:09:02 +01:00
Kruglov Pavel
20510cde34
Merge pull request #57006 from Avogar/save-errors-better
Fix early stop while parsing file with skipping lots of errors
2023-12-11 19:03:14 +01:00
avogar
c3a76fcc08 Allow to infer numbers from strings in CSV format 2023-12-11 18:02:05 +00:00
Kruglov Pavel
c6fecfb1af
Merge pull request #56901 from KevinyhZou/Fix_allow_cr_end_of_csv_line
Fix allow cr end of line for csv
2023-11-29 20:57:58 +01:00
János Benjamin Antal
ab935e3dd7 Use the google proto files when importing protobuf schemas 2023-11-22 12:39:41 +00:00
kevinyhzou
3adc8fdf78 Fix ci 2023-11-21 11:22:12 +08:00
avogar
ffa90628f0 Make input format errors logger a bit better 2023-11-20 17:22:49 +00:00
kevinyhzou
2a50daf5dd Allow cr at end of csv line 2023-11-06 12:21:42 +08:00
kevinyhzou
ef30e6723d bug fix csv read while end of line is not crlf 2023-11-06 12:21:42 +08:00
zvonand
0766c73aab Rename date_time_overflow_mode -> date_time_overflow_behavior, moved it to format settings 2023-10-25 23:11:13 +02:00
李扬
465962df7f
Support orc filter push down (file + stripe + rowgroup level) (#55330)
* support orc filter push down

* update orc lib version

* replace setqueryinfo with setkeycondition

* fix issue https://github.com/ClickHouse/ClickHouse/issues/53536

* refactor source with key condition

* fix building error

* remove std::cout

* update orc

* update orc version

* fix bugs

* improve code

* upgrade orc lib

* fix code style

* change as requested

* add performance tests for orc filter push down

* add performance tests for orc filter push down

* fix all bugs

* fix default as null issue

* add uts for null as default issues

* upgrade orc lib

* fix failed orc lib uts and fix typo

* fix failed uts

* fix failed uts

* fix ast fuzzer tests

* fix bug of uint64 overflow in https://s3.amazonaws.com/clickhouse-test-reports/55330/de22fdcaea2e12c96f300e95f59beba84401712d/fuzzer_astfuzzerubsan/report.html

* fix asan fatal caused by reused column vector batch in native orc input format. refer to https://s3.amazonaws.com/clickhouse-test-reports/55330/be39d23af2d7e27f5ec7f168947cf75aeaabf674/stateless_tests__asan__[4_4].htm

* fix wrong performance tests

* disable 02892_orc_filter_pushdown on aarch64. https://s3.amazonaws.com/clickhouse-test-reports/55330/be39d23af2d7e27f5ec7f168947cf75aeaabf674/stateless_tests__aarch64_.html

* add some comments

* add some comments

* inline range::equals and range::less

* fix data race of key condition

* trigger ci
2023-10-24 12:08:17 -07:00