Blargian
72b5cf5993
#58363 - removed switch from PrettyBlockOutputFormat and modified BlockOutputFormats to use color variable. Updated english and russian documentation. Updated test 00405 reference file.
2024-01-12 19:46:03 +02:00
avogar
bfd7db8cf4
Better docs
2024-01-12 13:06:51 +00:00
avogar
fbfdde60a7
Add settings for better control of indexes type in Arrow dictionary. Use signed integer type for indexes by default
2024-01-12 13:06:51 +00:00
avogar
7e5ba62017
Allow to read Bool values into String in JSON input formats
2024-01-05 20:33:30 +00:00
avogar
1db000dae5
Add docs
2023-12-11 18:02:05 +00:00
kevinyhzou
2a50daf5dd
Allow cr at end of csv line
2023-11-06 12:21:42 +08:00
kevinyhzou
ef30e6723d
bug fix csv read while end of line is not crlf
2023-11-06 12:21:42 +08:00
avogar
e2d7c543eb
Don't try to infer numbers from strings in JSON formats by default to avoid parsing errors, add docs for setting input_format_json_try_infer_numbers_from_strings
2023-09-28 22:15:26 +00:00
avogar
42ca897f2d
Better schema inference for JSON formats
2023-09-25 15:42:59 +00:00
avogar
ba307c7466
Allow to infer named Tuples from JSON objects under a setting in JSON formats
2023-09-07 19:41:19 +00:00
irenjj
b57e248aab
Fix doc spell error.
2023-09-04 20:58:19 +08:00
irenjj
277be44250
add test
2023-08-30 22:57:46 +08:00
irenjj
473ba91c21
Refactor writeAnyMarkdownEscapedString and Add more info into doc.
2023-08-29 23:54:40 +08:00
irenjj
13322f1a90
Remove redundant functions.
2023-08-28 23:54:07 +08:00
irenjj
51aa89eed8
Add a setting to automatically escape special characters in Markdown.
2023-08-28 00:10:33 +08:00
Kruglov Pavel
c68456a20a
Merge pull request #52692 from Avogar/variable-number-of-volumns-more-formats
...
Allow variable number of columns in more formats, make it work with schema inference
2023-08-21 13:28:35 +02:00
avogar
bca91548ad
Add setting input_format_parquet_local_file_min_bytes_for_seek
2023-08-17 12:28:01 +00:00
avogar
7e863a2726
Address comments
2023-08-11 13:17:49 +00:00
Kruglov Pavel
e67c6c2f0a
Fix typo
2023-08-11 13:17:49 +00:00
avogar
3ad7e57059
Optimize reading small row groups by batching them together in Parquet
2023-08-11 13:17:45 +00:00
Kruglov Pavel
33a39900ad
Merge branch 'master' into variable-number-of-volumns-more-formats
2023-08-09 19:51:17 +02:00
Anton Popov
ff137773e7
Merge branch 'master' into formats-with-subcolumns
2023-08-02 15:24:56 +02:00
Kruglov Pavel
8f6526a930
Merge branch 'master' into structure-to-schema
2023-08-01 16:22:14 +02:00
Kruglov Pavel
3e1c409e60
Merge branch 'master' into structure-to-schema
2023-07-28 11:32:16 +02:00
Jai Jhala
dba8b445bd
Update default output_format_arrow_compression.md
...
Updates the default parameter of output_format_arrow_compression_method from "none" to "lz4_frame".
2023-07-27 12:32:53 -07:00
avogar
67b0993bdf
Add documentation
2023-07-27 18:54:41 +00:00
avogar
6d77d52dfe
Allow variable number of columns in TSV/CuatomSeprarated/JSONCompactEachRow, make schema inference work with variable number of columns
2023-07-27 18:02:29 +00:00
Kruglov Pavel
0d34e97dbe
Merge branch 'master' into formats-with-subcolumns
2023-07-26 13:30:35 +02:00
Kruglov Pavel
342400d0b3
Merge branch 'master' into revert-52322-revert-51716-bug_fix_csv_field_type_not_match
2023-07-20 12:39:38 +02:00
Kruglov Pavel
f0026af189
Revert "Revert "Improve CSVInputFormat to check and set default value to column if deserialize failed""
2023-07-19 14:51:11 +02:00
Kruglov Pavel
7b3564f96a
Revert "Improve CSVInputFormat to check and set default value to column if deserialize failed"
2023-07-19 14:44:59 +02:00
robot-ch-test-poll4
63d0616a22
Merge pull request #51716 from KevinyhZou/bug_fix_csv_field_type_not_match
...
Improve CSVInputFormat to check and set default value to column if deserialize failed
2023-07-19 14:41:05 +02:00
kevinyhzou
dcf7ba2534
remove unuseful code
2023-07-19 19:36:19 +08:00
kevinyhzou
94796f28ad
ci fix
2023-07-19 19:24:16 +08:00
kevinyhzou
95424177d5
review fix
2023-07-19 18:26:54 +08:00
Ilya Golshtein
c550a532e6
test_for_basic_auth_registry - black formatter happy + some doc
2023-07-19 08:32:45 +00:00
Kruglov Pavel
1dd05319b5
Merge branch 'master' into formats-with-subcolumns
2023-07-17 19:13:42 +02:00
kevinyhzou
355faa4251
ci fix
2023-07-17 20:08:32 +08:00
robot-clickhouse-ci-2
ac3cc1c2ff
Merge pull request #45671 from ClibMouse/feature/interval-kql-style-formatting
...
Implement KQL-style formatting for Interval
2023-07-16 04:06:54 +02:00
kevinyhzou
b2665031dc
review fix
2023-07-13 20:27:14 +08:00
kevinyhzou
ba57c84db3
bug fix csv input field type mismatch
2023-07-13 20:24:10 +08:00
ltrk2
2d2debe3ce
Introduce a separate setting for interval output formatting
2023-07-10 13:51:49 -04:00
ltrk2
b673aa8e6b
Use the dialect configuration
2023-07-10 13:51:49 -04:00
ltrk2
522b9ebf8c
Implement KQL-style formatting for Interval
2023-07-10 13:51:49 -04:00
Dmitry Kardymon
32f5a78302
Fix setting name
2023-07-06 07:32:46 +00:00
Dmitry Kardymon
24b5c9c204
Use one setting input_format_csv_allow_variable_number_of_colums and code in RowInput
2023-07-06 06:05:43 +00:00
avogar
3dc4ff1760
Remove obsolete settings
2023-07-04 21:21:22 +00:00
Dmitry Kardymon
dd43a186ad
Minor edit docs / add int256 test
2023-06-19 09:51:29 +00:00
Dmitry Kardymon
30bea857fd
Merge remote-tracking branch 'origin/master' into ADQM-870
2023-06-19 07:19:07 +00:00
Dmitry Kardymon
806176d88e
Add input_format_csv_missing_as_default setting and tests
2023-06-15 11:23:08 +00:00
KevinyhZou
953f40aa3b
Merge branch 'master' into bug_fix_csv_parse_by_tab_delimiter
2023-06-15 10:25:19 +08:00
Dmitry Kardymon
a91fc3ddb3
Add docs/ add more cases in test
2023-06-14 16:44:31 +00:00
kevinyhzou
f3b99156ac
review fix
2023-06-14 10:48:21 +08:00
Kruglov Pavel
607f337d67
Merge pull request #50592 from Avogar/max-bytes-to-read-in-schema-inference
...
Add setting to limit the number of bytes to read in schema inference
2023-06-13 16:47:57 +02:00
kevinyhzou
911f8ad8dc
use whitespace or tab as field delimiter
2023-06-12 11:57:52 +08:00
Kruglov Pavel
1baa6404e6
Merge branch 'master' into skip-trailing-empty-lines
2023-06-06 19:39:34 +02:00
avogar
df50833b70
Allow to skip trailing empty lines in CSV/TSV/CustomeSeparated formats
2023-06-06 17:33:05 +00:00
Kruglov Pavel
af880a6f3b
Merge branch 'master' into max-bytes-to-read-in-schema-inference
2023-06-06 14:47:58 +02:00
avogar
33e51d4f3b
Add setting to limit the number of bytes to read in schema inference
2023-06-05 15:22:04 +00:00
Alexey Gerasimchuk
9958731c27
Merge branch 'master' into ADQM-830
2023-06-05 07:46:47 +10:00
Robert Schulze
54872f9e7e
Typos: Follow-up to #50476
2023-06-02 13:28:09 +00:00
Alexey Gerasimchuck
4eb944fef1
minor changes in documentation
2023-05-25 08:05:11 +00:00
Alexey Gerasimchuck
75791d7a63
Added input_format_csv_trim_whitespaces parameter
2023-05-25 07:51:32 +00:00
Mike Kot
f1d0ed367d
separate name for format setting
2023-04-28 13:39:38 +00:00
Mike Kot
7bffa89e96
add all 3 settings
2023-04-28 13:22:36 +00:00
Mike Kot
e428af0b63
replace format settings with server setting
2023-04-27 21:27:18 +00:00
Mike Kot
357cde95e9
Merge branch 'master' into feature/show-secrets-option
2023-04-27 20:19:40 +00:00
Robert Schulze
c406663442
Docs: Replace annoying three spaces in enumerations by a single space
2023-04-19 15:56:55 +00:00
Rich Raposa
3a3438ebf9
Update settings-formats.md
...
Default compression method for Parquet changed from snappy to lz4 in 23.3
2023-03-30 10:26:33 -06:00
DanRoscigno
6d8a2bbd48
standardize admonitions
2023-03-27 14:54:05 -04:00
rfraposa
747eaf3f72
Fix broken links
2023-03-23 14:40:45 -06:00
Kruglov Pavel
fe973f3d6f
Merge branch 'master' into native-types-conversions
2023-03-09 13:03:25 +01:00
Kruglov Pavel
69a1309ade
Merge branch 'master' into native-types-conversions
2023-03-07 20:06:17 +01:00
Mike Kot
1fca58b694
Merge branch 'master' into feature/show-secrets-option
2023-03-06 15:46:54 +03:00
Mike Kot
00dfb27eb9
rm secrets test from fasttest, correct 02117
2023-03-06 15:45:48 +03:00
avogar
a6cf2cdab8
Fix style, add docs
2023-03-02 10:36:07 +00:00
Kruglov Pavel
545e5c0f5b
Merge pull request #46617 from Avogar/use-parquet-2
...
Change default parquet format version to v2_latest, add a setting to controle the version
2023-02-28 14:30:59 +01:00
avogar
ab899bf2f3
Allow types conversion in Native input format
2023-02-27 19:28:19 +00:00
Kruglov Pavel
36e65f5f84
Use versions vith dots
2023-02-27 19:00:40 +01:00
robot-clickhouse-ci-1
fc1e0608c6
Merge pull request #46962 from melvynator/patch-11
...
Add a warning to the schema_inference_hints
2023-02-27 16:51:24 +01:00
Mike Kot
c799bd094b
Merge branch 'master' into feature/show-secrets-option
2023-02-27 18:43:14 +03:00
Kruglov Pavel
443dedddca
Merge branch 'master' into use-parquet-2
2023-02-27 14:31:43 +01:00
Kruglov Pavel
e9078b402b
Merge pull request #46742 from Avogar/update-settings
...
Enable input_format_json_ignore_unknown_keys_in_named_tuple by default
2023-02-27 14:30:28 +01:00
Mike Kot
62acc7a6cf
setting and some docs
2023-02-27 13:18:21 +00:00
Peignon Melvyn
8fe08bc458
Add a warning to the schema_inference_hints
2023-02-27 14:02:10 +01:00
avogar
54622566df
Add setting to change parquet version
2023-02-23 16:14:10 +00:00
avogar
50caa3d66c
Update docs
2023-02-22 16:41:49 +00:00
Kruglov Pavel
9866ecfe8b
Merge branch 'master' into null-as-default-all-formats
2023-02-20 20:49:30 +01:00
avogar
04cf144edc
Fix TSKV, update docs
2023-02-10 17:20:51 +00:00
Kruglov Pavel
cf2587cd1b
Add docs
2023-02-08 12:32:26 +01:00
Azat Khuzhin
1a8437f2c9
Add ability to ignore unknown keys in JSON object for named tuples
...
This can be useful in case your input JSON is complex, while you need
only few fields in it.
This behaviour is controlled by the
input_format_json_ignore_unknown_keys_in_named_tuple setting name, that
is turned OFF by default.
This will, almost, allow to parse gharchive dataset without jq. "almost"
because of two things:
- Tuple cannot be Nullable, so such keys with Tuple type in ClickHouse
cannot be `null` in JSON
- You cannot use dot.dot notation to extract columns for file() engine,
only tupleElement()
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-27 10:01:08 +01:00
DanRoscigno
f95bc55923
split Format settings out
2023-01-24 17:19:50 -05:00