Commit Graph

46 Commits

Author SHA1 Message Date
Dmitry Kardymon
24b5c9c204 Use one setting input_format_csv_allow_variable_number_of_colums and code in RowInput 2023-07-06 06:05:43 +00:00
Dmitry Kardymon
dd43a186ad Minor edit docs / add int256 test 2023-06-19 09:51:29 +00:00
Dmitry Kardymon
30bea857fd Merge remote-tracking branch 'origin/master' into ADQM-870 2023-06-19 07:19:07 +00:00
Dmitry Kardymon
806176d88e Add input_format_csv_missing_as_default setting and tests 2023-06-15 11:23:08 +00:00
KevinyhZou
953f40aa3b
Merge branch 'master' into bug_fix_csv_parse_by_tab_delimiter 2023-06-15 10:25:19 +08:00
Dmitry Kardymon
a91fc3ddb3 Add docs/ add more cases in test 2023-06-14 16:44:31 +00:00
kevinyhzou
f3b99156ac review fix 2023-06-14 10:48:21 +08:00
Kruglov Pavel
607f337d67
Merge pull request #50592 from Avogar/max-bytes-to-read-in-schema-inference
Add setting to limit the number of bytes to read in schema inference
2023-06-13 16:47:57 +02:00
kevinyhzou
911f8ad8dc use whitespace or tab as field delimiter 2023-06-12 11:57:52 +08:00
Kruglov Pavel
1baa6404e6
Merge branch 'master' into skip-trailing-empty-lines 2023-06-06 19:39:34 +02:00
avogar
df50833b70 Allow to skip trailing empty lines in CSV/TSV/CustomeSeparated formats 2023-06-06 17:33:05 +00:00
Kruglov Pavel
af880a6f3b
Merge branch 'master' into max-bytes-to-read-in-schema-inference 2023-06-06 14:47:58 +02:00
avogar
33e51d4f3b Add setting to limit the number of bytes to read in schema inference 2023-06-05 15:22:04 +00:00
Alexey Gerasimchuk
9958731c27
Merge branch 'master' into ADQM-830 2023-06-05 07:46:47 +10:00
Robert Schulze
54872f9e7e
Typos: Follow-up to #50476 2023-06-02 13:28:09 +00:00
Alexey Gerasimchuck
4eb944fef1 minor changes in documentation 2023-05-25 08:05:11 +00:00
Alexey Gerasimchuck
75791d7a63 Added input_format_csv_trim_whitespaces parameter 2023-05-25 07:51:32 +00:00
Mike Kot
f1d0ed367d separate name for format setting 2023-04-28 13:39:38 +00:00
Mike Kot
7bffa89e96 add all 3 settings 2023-04-28 13:22:36 +00:00
Mike Kot
e428af0b63 replace format settings with server setting 2023-04-27 21:27:18 +00:00
Mike Kot
357cde95e9 Merge branch 'master' into feature/show-secrets-option 2023-04-27 20:19:40 +00:00
Robert Schulze
c406663442
Docs: Replace annoying three spaces in enumerations by a single space 2023-04-19 15:56:55 +00:00
Rich Raposa
3a3438ebf9
Update settings-formats.md
Default compression method for Parquet changed from snappy to lz4 in 23.3
2023-03-30 10:26:33 -06:00
DanRoscigno
6d8a2bbd48 standardize admonitions 2023-03-27 14:54:05 -04:00
rfraposa
747eaf3f72 Fix broken links 2023-03-23 14:40:45 -06:00
Kruglov Pavel
fe973f3d6f
Merge branch 'master' into native-types-conversions 2023-03-09 13:03:25 +01:00
Kruglov Pavel
69a1309ade
Merge branch 'master' into native-types-conversions 2023-03-07 20:06:17 +01:00
Mike Kot
1fca58b694
Merge branch 'master' into feature/show-secrets-option 2023-03-06 15:46:54 +03:00
Mike Kot
00dfb27eb9 rm secrets test from fasttest, correct 02117 2023-03-06 15:45:48 +03:00
avogar
a6cf2cdab8 Fix style, add docs 2023-03-02 10:36:07 +00:00
Kruglov Pavel
545e5c0f5b
Merge pull request #46617 from Avogar/use-parquet-2
Change default parquet format version to v2_latest, add a setting to controle the version
2023-02-28 14:30:59 +01:00
avogar
ab899bf2f3 Allow types conversion in Native input format 2023-02-27 19:28:19 +00:00
Kruglov Pavel
36e65f5f84
Use versions vith dots 2023-02-27 19:00:40 +01:00
robot-clickhouse-ci-1
fc1e0608c6
Merge pull request #46962 from melvynator/patch-11
Add a warning to the schema_inference_hints
2023-02-27 16:51:24 +01:00
Mike Kot
c799bd094b
Merge branch 'master' into feature/show-secrets-option 2023-02-27 18:43:14 +03:00
Kruglov Pavel
443dedddca
Merge branch 'master' into use-parquet-2 2023-02-27 14:31:43 +01:00
Kruglov Pavel
e9078b402b
Merge pull request #46742 from Avogar/update-settings
Enable input_format_json_ignore_unknown_keys_in_named_tuple by default
2023-02-27 14:30:28 +01:00
Mike Kot
62acc7a6cf setting and some docs 2023-02-27 13:18:21 +00:00
Peignon Melvyn
8fe08bc458
Add a warning to the schema_inference_hints 2023-02-27 14:02:10 +01:00
avogar
54622566df Add setting to change parquet version 2023-02-23 16:14:10 +00:00
avogar
50caa3d66c Update docs 2023-02-22 16:41:49 +00:00
Kruglov Pavel
9866ecfe8b
Merge branch 'master' into null-as-default-all-formats 2023-02-20 20:49:30 +01:00
avogar
04cf144edc Fix TSKV, update docs 2023-02-10 17:20:51 +00:00
Kruglov Pavel
cf2587cd1b
Add docs 2023-02-08 12:32:26 +01:00
Azat Khuzhin
1a8437f2c9 Add ability to ignore unknown keys in JSON object for named tuples
This can be useful in case your input JSON is complex, while you need
only few fields in it.

This behaviour is controlled by the
input_format_json_ignore_unknown_keys_in_named_tuple setting name, that
is turned OFF by default.

This will, almost, allow to parse gharchive dataset without jq. "almost"
because of two things:
- Tuple cannot be Nullable, so such keys with Tuple type in ClickHouse
  cannot be `null` in JSON
- You cannot use dot.dot notation to extract columns for file() engine,
  only tupleElement()

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-27 10:01:08 +01:00
DanRoscigno
f95bc55923 split Format settings out 2023-01-24 17:19:50 -05:00