kevinyhzou
2a50daf5dd
Allow cr at end of csv line
2023-11-06 12:21:42 +08:00
kevinyhzou
ef30e6723d
bug fix csv read while end of line is not crlf
2023-11-06 12:21:42 +08:00
zvonand
0766c73aab
Rename date_time_overflow_mode -> date_time_overflow_behavior, moved it to format settings
2023-10-25 23:11:13 +02:00
李扬
465962df7f
Support orc filter push down (file + stripe + rowgroup level) ( #55330 )
...
* support orc filter push down
* update orc lib version
* replace setqueryinfo with setkeycondition
* fix issue https://github.com/ClickHouse/ClickHouse/issues/53536
* refactor source with key condition
* fix building error
* remove std::cout
* update orc
* update orc version
* fix bugs
* improve code
* upgrade orc lib
* fix code style
* change as requested
* add performance tests for orc filter push down
* add performance tests for orc filter push down
* fix all bugs
* fix default as null issue
* add uts for null as default issues
* upgrade orc lib
* fix failed orc lib uts and fix typo
* fix failed uts
* fix failed uts
* fix ast fuzzer tests
* fix bug of uint64 overflow in https://s3.amazonaws.com/clickhouse-test-reports/55330/de22fdcaea2e12c96f300e95f59beba84401712d/fuzzer_astfuzzerubsan/report.html
* fix asan fatal caused by reused column vector batch in native orc input format. refer to https://s3.amazonaws.com/clickhouse-test-reports/55330/be39d23af2d7e27f5ec7f168947cf75aeaabf674/stateless_tests__asan__[4_4].htm
* fix wrong performance tests
* disable 02892_orc_filter_pushdown on aarch64. https://s3.amazonaws.com/clickhouse-test-reports/55330/be39d23af2d7e27f5ec7f168947cf75aeaabf674/stateless_tests__aarch64_.html
* add some comments
* add some comments
* inline range::equals and range::less
* fix data race of key condition
* trigger ci
2023-10-24 12:08:17 -07:00
taiyang-li
9c186d18a8
retrigger ci
2023-10-24 16:13:53 +08:00
taiyang-li
a02c49e16f
allow skip null values when serailize tuple to json objects
2023-10-24 11:47:46 +08:00
Azat Khuzhin
2cbb069b68
Add ability to ignore data after semicolon in Values format
...
This is required for client, to handle comments in multiquery mode.
v0: separate context for input format
v2: cannot use separate context since params and stuff are changed in global context
v3: do not sent this setting to the server (breaks queries for readonly profiles)
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-10-12 14:55:26 +02:00
Kruglov Pavel
bea80ab5b7
Merge branch 'master' into json-object-as-tuple-inference
2023-09-26 15:23:08 +02:00
avogar
42ca897f2d
Better schema inference for JSON formats
2023-09-25 15:42:59 +00:00
Kruglov Pavel
2c407ab3c0
Merge branch 'master' into json-object-as-tuple-inference
2023-09-15 16:29:48 +02:00
Kruglov Pavel
dbd24b240c
Merge branch 'master' into http-valid-json-on-exception
2023-09-15 14:55:31 +02:00
Kruglov Pavel
6419f91cfc
Merge pull request #54585 from ClickHouse/Avogar-patch-1
...
Remove output_format_markdown_escape_special_characters from settings changes history
2023-09-14 15:45:05 +02:00
avogar
1480c8ad30
Place setting into separate struct
2023-09-13 13:19:05 +00:00
slvrtrn
dddea9219a
Address the review comments
2023-09-12 18:39:03 +02:00
slvrtrn
611a75a87f
Merge remote-tracking branch 'origin' into simplified-prepared-statements-for-mysql
2023-09-12 10:38:44 +02:00
avogar
ba307c7466
Allow to infer named Tuples from JSON objects under a setting in JSON formats
2023-09-07 19:41:19 +00:00
slvrtrn
bb0eff9669
Revert format changes
2023-09-04 21:15:26 +02:00
irenjj
51aa89eed8
Add a setting to automatically escape special characters in Markdown.
2023-08-28 00:10:33 +08:00
slvrtrn
055d2e3c3d
Merge remote-tracking branch 'origin' into simplified-prepared-statements-for-mysql
2023-08-25 21:27:47 +02:00
slvrtrn
734ffd916c
WIP prepared statements
2023-08-25 20:31:21 +02:00
Michael Kolupaev
2f4d433e69
Parquet filter pushdown
2023-08-21 14:15:52 -07:00
Michael Kolupaev
6009e1b293
Merge pull request #53324 from bigo-sg/ch_gluten_2583
...
Implement native orc input format without arrow to improve performance
2023-08-21 13:44:57 -07:00
Kruglov Pavel
c68456a20a
Merge pull request #52692 from Avogar/variable-number-of-volumns-more-formats
...
Allow variable number of columns in more formats, make it work with schema inference
2023-08-21 13:28:35 +02:00
taiyang-li
f723e8d43a
change as request
2023-08-21 12:09:02 +08:00
avogar
bca91548ad
Add setting input_format_parquet_local_file_min_bytes_for_seek
2023-08-17 12:28:01 +00:00
avogar
7e863a2726
Address comments
2023-08-11 13:17:49 +00:00
avogar
3ad7e57059
Optimize reading small row groups by batching them together in Parquet
2023-08-11 13:17:45 +00:00
Kruglov Pavel
6600f87f86
Merge branch 'master' into http-valid-json-on-exception
2023-08-10 13:53:32 +02:00
Kruglov Pavel
33a39900ad
Merge branch 'master' into variable-number-of-volumns-more-formats
2023-08-09 19:51:17 +02:00
Anton Popov
ff137773e7
Merge branch 'master' into formats-with-subcolumns
2023-08-02 15:24:56 +02:00
avogar
a71cd56a90
Output valid JSON/XML on excetpion during HTTP query execution
2023-08-01 10:06:56 +00:00
Kruglov Pavel
3e1c409e60
Merge branch 'master' into structure-to-schema
2023-07-28 11:32:16 +02:00
avogar
6d77d52dfe
Allow variable number of columns in TSV/CuatomSeprarated/JSONCompactEachRow, make schema inference work with variable number of columns
2023-07-27 18:02:29 +00:00
Kruglov Pavel
0d34e97dbe
Merge branch 'master' into formats-with-subcolumns
2023-07-26 13:30:35 +02:00
Michael Kolupaev
8184a289e5
Partially reimplement Parquet encoder to make it faster and parallelizable
2023-07-25 10:16:28 +00:00
Kruglov Pavel
f0026af189
Revert "Revert "Improve CSVInputFormat to check and set default value to column if deserialize failed""
2023-07-19 14:51:11 +02:00
Kruglov Pavel
7b3564f96a
Revert "Improve CSVInputFormat to check and set default value to column if deserialize failed"
2023-07-19 14:44:59 +02:00
robot-ch-test-poll4
63d0616a22
Merge pull request #51716 from KevinyhZou/bug_fix_csv_field_type_not_match
...
Improve CSVInputFormat to check and set default value to column if deserialize failed
2023-07-19 14:41:05 +02:00
kevinyhzou
95424177d5
review fix
2023-07-19 18:26:54 +08:00
avogar
67f340b501
Merge branch 'master' of github.com:ClickHouse/ClickHouse into structure-to-schema
2023-07-18 13:52:15 +00:00
Kruglov Pavel
1dd05319b5
Merge branch 'master' into formats-with-subcolumns
2023-07-17 19:13:42 +02:00
kevinyhzou
355faa4251
ci fix
2023-07-17 20:08:32 +08:00
robot-clickhouse-ci-2
ac3cc1c2ff
Merge pull request #45671 from ClibMouse/feature/interval-kql-style-formatting
...
Implement KQL-style formatting for Interval
2023-07-16 04:06:54 +02:00
kevinyhzou
b2665031dc
review fix
2023-07-13 20:27:14 +08:00
kevinyhzou
ba57c84db3
bug fix csv input field type mismatch
2023-07-13 20:24:10 +08:00
ltrk2
2d2debe3ce
Introduce a separate setting for interval output formatting
2023-07-10 13:51:49 -04:00
ltrk2
522b9ebf8c
Implement KQL-style formatting for Interval
2023-07-10 13:51:49 -04:00
Dmitry Kardymon
32f5a78302
Fix setting name
2023-07-06 07:32:46 +00:00
Dmitry Kardymon
24b5c9c204
Use one setting input_format_csv_allow_variable_number_of_colums and code in RowInput
2023-07-06 06:05:43 +00:00
avogar
98aa6b317f
Support reading subcolumns from file/s3/hdfs/url/azureBlobStorage table functions
2023-07-04 21:17:26 +00:00