Commit Graph

440 Commits

Author SHA1 Message Date
Geoff Genz
5a40a871c8
Fix Arrow format settings anchor 2024-03-27 05:42:22 -06:00
avogar
dc87c483dd Add a setting to allow saving bad escape sequences in JSON input formats 2024-03-25 21:58:53 +00:00
Shaun Struwig
f4c33b85fc
Update formats.md
Change `urlencoded` to `URL-encoded`
2024-03-18 18:23:47 +01:00
Blargian
8a4d4641dd Update documentation with an example 2024-03-15 21:22:02 +01:00
Shaun Struwig
f251a6d262
Merge branch 'ClickHouse:master' into 59557_form_input_format 2024-03-11 18:52:28 +01:00
Shaun Struwig
beb0d08bdb
Merge branch 'ClickHouse:master' into 56257_parse_crlf_with_TSV_files 2024-03-05 14:09:01 +01:00
HowePa
0b72f7b182 Make all format names case insensitive. 2024-02-26 22:46:51 +08:00
Blargian
d2fb1e4ff8 Add FormSchemaReader 2024-02-19 07:58:22 +01:00
Shaun Struwig
1d440f0399
Merge branch 'ClickHouse:master' into 56257_parse_crlf_with_TSV_files 2024-02-16 19:48:33 +01:00
Igor Nikonov
062602d433 Merge remote-tracking branch 'origin/master' into docs_fix_typos 2024-02-15 18:22:00 +00:00
Shaun Struwig
525a5188e4
Merge branch 'master' into 56257_parse_crlf_with_TSV_files 2024-02-14 17:40:08 +01:00
Nikolai Fedorovskikh
a98af159b5 [Docs] fix some typos and missing commas 2024-02-13 02:10:41 +01:00
Blargian
debc804b77 documentation changes 2024-02-08 08:10:53 +01:00
Blargian
4a8a7208f2 rename of settings, add setting for resultset, extend test, fix documentation and add to SettingsChanges log 2024-01-29 21:25:58 +01:00
Shaun Struwig
e988f8a471
fix typo in formats.md 2024-01-24 17:30:04 +01:00
Blargian
3832a8261a #31363 - update documentation for En and Ru 2024-01-22 23:20:02 +02:00
avogar
bfd7db8cf4 Better docs 2024-01-12 13:06:51 +00:00
avogar
7e5ba62017 Allow to read Bool values into String in JSON input formats 2024-01-05 20:33:30 +00:00
avogar
1db000dae5 Add docs 2023-12-11 18:02:05 +00:00
Kruglov Pavel
1831ecc38f
Merge pull request #56424 from yariks5s/npy_float_16
Float16 support for npy format
2023-11-16 17:00:24 +01:00
James Seymour
43b921abe5 fix(docs): correct default value for output_format_parquet_compression_method to 'ltz4' 2023-11-11 14:56:16 +10:00
yariks5s
4da512d6a3 ehnanced docs 2023-11-07 17:57:39 +00:00
Kruglov Pavel
bf77ce691c
Merge pull request #55982 from yariks5s/npy_input_format
New input format Npy
2023-11-01 14:26:22 +01:00
yariks5s
9a2d89e3e4 removed getSize() and enhanced docs 2023-10-30 12:42:19 +00:00
yariks5s
894724bfb3 suggested changes 2023-10-28 01:17:25 +00:00
yariks5s
23635352f1 fixed due to review 2023-10-27 15:43:03 +00:00
yariks5s
2ab1ae42c1 added docs and tests, style check 2023-10-25 10:37:05 +00:00
Michael Kolupaev
ce7eca0615
DWARF input format (#55450)
* Add ReadBufferFromFileBase::isRegularLocalFile()

* DWARF input format

* Review comments

* Changed things around ENABLE_EMBEDDED_COMPILER build setting

* Added 'ranges' column

* no-msan no-ubsan
2023-10-16 17:00:07 -07:00
Kruglov Pavel
5ded0005a3
Merge pull request #55064 from AVMusorin/system-drop-format-cache
Allow drop cache for Protobuf format
2023-10-10 14:23:11 +02:00
Aleksandr Musorin
8d0c961af0 Allow drop cache for protobuf format
Before it was impossible to update Protobuf schema without server
restart. With this commit, it is enough to send query `SYSTEM DROP
SCHEMA FORMAT CACHE [FOR Protobuf]`.
2023-10-09 10:41:15 +02:00
avogar
e2d7c543eb Don't try to infer numbers from strings in JSON formats by default to avoid parsing errors, add docs for setting input_format_json_try_infer_numbers_from_strings 2023-09-28 22:15:26 +00:00
Kruglov Pavel
b6863a9f52
Fix comments 2023-09-26 14:13:34 +02:00
avogar
42ca897f2d Better schema inference for JSON formats 2023-09-25 15:42:59 +00:00
Kruglov Pavel
c68456a20a
Merge pull request #52692 from Avogar/variable-number-of-volumns-more-formats
Allow variable number of columns in more formats, make it work with schema inference
2023-08-21 13:28:35 +02:00
Michael Kolupaev
a1522e22ea
Merge pull request #53281 from Avogar/batch-small-parquet-row-groups
Optimize reading small row groups by batching them together in Parquet
2023-08-18 17:15:42 -07:00
avogar
bca91548ad Add setting input_format_parquet_local_file_min_bytes_for_seek 2023-08-17 12:28:01 +00:00
avogar
7e863a2726 Address comments 2023-08-11 13:17:49 +00:00
avogar
3ad7e57059 Optimize reading small row groups by batching them together in Parquet 2023-08-11 13:17:45 +00:00
Kruglov Pavel
33a39900ad
Merge branch 'master' into variable-number-of-volumns-more-formats 2023-08-09 19:51:17 +02:00
avogar
98435657cb Clean up 2023-08-09 11:28:09 +00:00
avogar
01a7c7560f Add input format One 2023-08-09 11:25:32 +00:00
Anton Popov
ff137773e7
Merge branch 'master' into formats-with-subcolumns 2023-08-02 15:24:56 +02:00
Kruglov Pavel
3e1c409e60
Merge branch 'master' into structure-to-schema 2023-07-28 11:32:16 +02:00
avogar
b9c9933cc9 Fix typo 2023-07-27 18:56:23 +00:00
avogar
67b0993bdf Add documentation 2023-07-27 18:54:41 +00:00
avogar
6d77d52dfe Allow variable number of columns in TSV/CuatomSeprarated/JSONCompactEachRow, make schema inference work with variable number of columns 2023-07-27 18:02:29 +00:00
Kruglov Pavel
0d34e97dbe
Merge branch 'master' into formats-with-subcolumns 2023-07-26 13:30:35 +02:00
Kruglov Pavel
342400d0b3
Merge branch 'master' into revert-52322-revert-51716-bug_fix_csv_field_type_not_match 2023-07-20 12:39:38 +02:00
Kruglov Pavel
f0026af189
Revert "Revert "Improve CSVInputFormat to check and set default value to column if deserialize failed"" 2023-07-19 14:51:11 +02:00
Kruglov Pavel
7b3564f96a
Revert "Improve CSVInputFormat to check and set default value to column if deserialize failed" 2023-07-19 14:44:59 +02:00
kevinyhzou
94796f28ad ci fix 2023-07-19 19:24:16 +08:00
kevinyhzou
95424177d5 review fix 2023-07-19 18:26:54 +08:00
Kruglov Pavel
1e616e17ab
Merge branch 'master' into row-binary-with-defaults 2023-07-17 19:13:57 +02:00
Kruglov Pavel
1dd05319b5
Merge branch 'master' into formats-with-subcolumns 2023-07-17 19:13:42 +02:00
kevinyhzou
355faa4251 ci fix 2023-07-17 20:08:32 +08:00
kevinyhzou
b2665031dc review fix 2023-07-13 20:27:14 +08:00
kevinyhzou
ba57c84db3 bug fix csv input field type mismatch 2023-07-13 20:24:10 +08:00
Dmitry Kardymon
32f5a78302 Fix setting name 2023-07-06 07:32:46 +00:00
Dmitry Kardymon
24b5c9c204 Use one setting input_format_csv_allow_variable_number_of_colums and code in RowInput 2023-07-06 06:05:43 +00:00
Kruglov Pavel
a2805f8f44
Merge branch 'master' into formats-with-subcolumns 2023-07-04 23:27:03 +02:00
avogar
3dc4ff1760 Remove obsolete settings 2023-07-04 21:21:22 +00:00
Dmitry Kardymon
ab4142eb8f Merge remote-tracking branch 'clickhouse/master' into ADQM-870 2023-07-04 08:23:31 +03:00
avogar
34bf0284ad Add RowBinaryWithDefaults format 2023-06-30 16:18:30 +00:00
Nikifor Seriakov
5a39960e03
Update docs/en/interfaces/formats.md
Fixed RawBLOB comparison lists formatting.
2023-06-27 21:32:39 +04:00
Dmitry Kardymon
dbced8a30c Merge remote-tracking branch 'origin/master' into ADQM-870 2023-06-22 19:49:06 +00:00
Dan Roscigno
c856c4a7df
Merge branch 'master' into Docs/ip_addresses 2023-06-21 17:26:27 -04:00
Dmitry Kardymon
dd43a186ad Minor edit docs / add int256 test 2023-06-19 09:51:29 +00:00
Dmitry Kardymon
30bea857fd Merge remote-tracking branch 'origin/master' into ADQM-870 2023-06-19 07:19:07 +00:00
Kruglov Pavel
38ed92c8f4
Update Avro format docs 2023-06-16 15:53:29 +02:00
Dmitry Kardymon
806176d88e Add input_format_csv_missing_as_default setting and tests 2023-06-15 11:23:08 +00:00
KevinyhZou
953f40aa3b
Merge branch 'master' into bug_fix_csv_parse_by_tab_delimiter 2023-06-15 10:25:19 +08:00
Denny Crane
fd01cb7bec
Merge branch 'master' into Docs/ip_addresses 2023-06-14 17:38:48 -03:00
Dmitry Kardymon
a91fc3ddb3 Add docs/ add more cases in test 2023-06-14 16:44:31 +00:00
kevinyhzou
f3b99156ac review fix 2023-06-14 10:48:21 +08:00
kevinyhzou
911f8ad8dc use whitespace or tab as field delimiter 2023-06-12 11:57:52 +08:00
Kruglov Pavel
1baa6404e6
Merge branch 'master' into skip-trailing-empty-lines 2023-06-06 19:39:34 +02:00
avogar
df50833b70 Allow to skip trailing empty lines in CSV/TSV/CustomeSeparated formats 2023-06-06 17:33:05 +00:00
Dan Roscigno
f691fe787b
Merge branch 'master' into Docs/ip_addresses 2023-06-06 09:12:31 -04:00
Denny Crane
2cc457141e clean documentation of ip4 ip6 from domains 2023-06-04 15:32:54 -03:00
Alexey Gerasimchuck
75791d7a63 Added input_format_csv_trim_whitespaces parameter 2023-05-25 07:51:32 +00:00
Alexey Milovidov
2370f46e70
Update formats.md 2023-04-22 21:36:58 +03:00
robot-ch-test-poll1
f466c89621
Merge pull request #48911 from Avogar/parquet-metadata-format
Add ParquetMetadata input format to read Parquet file metadata
2023-04-21 03:46:26 +02:00
Robert Schulze
c406663442
Docs: Replace annoying three spaces in enumerations by a single space 2023-04-19 15:56:55 +00:00
avogar
7a67951f64 Add more fields, fix style 2023-04-18 17:59:01 +00:00
avogar
2518ee4988 Remove deleted line 2023-04-18 16:58:43 +00:00
avogar
f7f609dfb9 Better 2023-04-18 16:57:55 +00:00
avogar
b277a5c943 Add ParquetMetadata input format to read Parquet file metadata 2023-04-18 16:46:26 +00:00
Kruglov Pavel
e9784b9444
Bring back accidentally deleted line 2023-04-18 16:06:21 +02:00
Kruglov Pavel
8aade2d39e
Fix typo 2023-04-18 15:56:41 +02:00
avogar
e47bb5cc72 Add docs 2023-04-18 13:35:28 +00:00
Sergei Trifonov
cc5a4c3853
Merge pull request #48568 from ClickHouse/serxa-patch-3
Update formats.md
2023-04-10 19:07:08 +02:00
Thom O'Connor
8b0adf58e5
Update formats.md
Including an Avro example with DESCRIBE url in S3
2023-04-10 10:17:43 -06:00
Sergei Trifonov
3bc5c6423b
Update formats.md 2023-04-09 16:54:21 +02:00
Kruglov Pavel
fb33d6bedb
Merge pull request #48126 from Avogar/orc-arrow-parquet-more-types
Support more ClickHouse types in ORC/Arrow/Parquet formats
2023-04-06 13:32:46 +02:00
Kruglov Pavel
9331c6c260
Merge pull request #48122 from Avogar/bson-more-types
Support Enum output/input in BSONEachRow, allow all map key types and avoid extra calculations
2023-04-05 18:26:19 +02:00
Kruglov Pavel
8bd997d680
Update docs/en/interfaces/formats.md
Co-authored-by: Yakov Olkhovskiy <99031427+yakov-olkhovskiy@users.noreply.github.com>
2023-04-05 11:57:41 +02:00
Kruglov Pavel
8d6b709f1a
Merge pull request #48257 from Avogar/capnp-more-types
Support more types in CapnProto format, allow integer conversions
2023-04-05 11:56:46 +02:00
Yakov Olkhovskiy
e1e508f8ec
Merge branch 'master' into bson-more-types 2023-04-03 17:21:50 -04:00
Kruglov Pavel
e2c3c3accc
Merge branch 'master' into orc-arrow-parquet-more-types 2023-04-03 18:33:20 +02:00
Kruglov Pavel
cef4a5b29c
Fix docs 2023-03-30 22:07:57 +02:00