Commit Graph

408 Commits

Author SHA1 Message Date
Robert Schulze
c406663442
Docs: Replace annoying three spaces in enumerations by a single space 2023-04-19 15:56:55 +00:00
avogar
7a67951f64 Add more fields, fix style 2023-04-18 17:59:01 +00:00
avogar
2518ee4988 Remove deleted line 2023-04-18 16:58:43 +00:00
avogar
f7f609dfb9 Better 2023-04-18 16:57:55 +00:00
avogar
b277a5c943 Add ParquetMetadata input format to read Parquet file metadata 2023-04-18 16:46:26 +00:00
Kruglov Pavel
e9784b9444
Bring back accidentally deleted line 2023-04-18 16:06:21 +02:00
Kruglov Pavel
8aade2d39e
Fix typo 2023-04-18 15:56:41 +02:00
avogar
e47bb5cc72 Add docs 2023-04-18 13:35:28 +00:00
Sergei Trifonov
cc5a4c3853
Merge pull request #48568 from ClickHouse/serxa-patch-3
Update formats.md
2023-04-10 19:07:08 +02:00
Thom O'Connor
8b0adf58e5
Update formats.md
Including an Avro example with DESCRIBE url in S3
2023-04-10 10:17:43 -06:00
Sergei Trifonov
3bc5c6423b
Update formats.md 2023-04-09 16:54:21 +02:00
Kruglov Pavel
fb33d6bedb
Merge pull request #48126 from Avogar/orc-arrow-parquet-more-types
Support more ClickHouse types in ORC/Arrow/Parquet formats
2023-04-06 13:32:46 +02:00
Kruglov Pavel
9331c6c260
Merge pull request #48122 from Avogar/bson-more-types
Support Enum output/input in BSONEachRow, allow all map key types and avoid extra calculations
2023-04-05 18:26:19 +02:00
Kruglov Pavel
8bd997d680
Update docs/en/interfaces/formats.md
Co-authored-by: Yakov Olkhovskiy <99031427+yakov-olkhovskiy@users.noreply.github.com>
2023-04-05 11:57:41 +02:00
Kruglov Pavel
8d6b709f1a
Merge pull request #48257 from Avogar/capnp-more-types
Support more types in CapnProto format, allow integer conversions
2023-04-05 11:56:46 +02:00
Yakov Olkhovskiy
e1e508f8ec
Merge branch 'master' into bson-more-types 2023-04-03 17:21:50 -04:00
Kruglov Pavel
e2c3c3accc
Merge branch 'master' into orc-arrow-parquet-more-types 2023-04-03 18:33:20 +02:00
Kruglov Pavel
cef4a5b29c
Fix docs 2023-03-30 22:07:57 +02:00
Kruglov Pavel
a3908b54c1
Merge pull request #48124 from Avogar/msgpack-more-types
Support more ClickHouse types in MsgPack format
2023-03-30 22:03:47 +02:00
Kruglov Pavel
d096da0cfd
Merge branch 'master' into bson-more-types 2023-03-30 21:18:10 +02:00
avogar
35937adcaa Support more types in CapnProto format 2023-03-30 19:15:28 +00:00
robot-ch-test-poll
04d30a2afb
Merge pull request #48119 from Avogar/avro-more-types
Support more ClickHouse types in Avro format
2023-03-29 13:07:35 +02:00
avogar
bc56c02858 Support more ClickHouse types in ORC/Arrow/Parquet formats 2023-03-28 20:02:42 +00:00
avogar
a0fcf81abf Support more ClickHouse types in MsgPack format 2023-03-28 18:25:52 +00:00
avogar
e7ff6e85c2 Support Enum output/input in BSONEachRow, allow all map key types and avoid extra calculations 2023-03-28 17:57:23 +00:00
avogar
06a6f5cf76 Support more ClickHouse types in Avro format 2023-03-28 17:36:20 +00:00
DanRoscigno
6d8a2bbd48 standardize admonitions 2023-03-27 14:54:05 -04:00
rfraposa
747eaf3f72 Fix broken links 2023-03-23 14:40:45 -06:00
Alexey Milovidov
26c17b61fb
Merge pull request #47434 from Avogar/avro-improvements
Support Decimals and Date32 in Avro format
2023-03-18 22:16:34 +03:00
rfraposa
ac5ed141d8 New nav - reverting the revert 2023-03-17 21:45:43 -05:00
Alexander Tokmakov
ec44c8293a
Revert "New navigation" 2023-03-17 21:21:11 +03:00
rfraposa
7cb53ebc21 Justin feedback 2023-03-16 12:06:17 -05:00
Kruglov Pavel
6544e7c3ee
Fix typos 2023-03-10 17:11:27 +01:00
avogar
b6c91d9ace Support Decimals and Date32 in Avro format 2023-03-10 15:48:18 +00:00
Rich Raposa
66dd5b3834
Merge branch 'master' into new-nav 2023-03-09 08:12:23 -07:00
Kruglov Pavel
71b6d6c6ae
Merge pull request #47114 from Avogar/parquet-compression
Improve working with compression methods in Parquet/ORC/Arrow formats
2023-03-09 13:02:18 +01:00
rfraposa
5be7e09382 Update formats.md 2023-03-06 22:00:45 -07:00
Kruglov Pavel
3de905bb7c
Merge pull request #46616 from Avogar/fix-ipv4-ipv6-formats
Fix IPv4/IPv6 serialization/deserialization in binary formats
2023-03-06 19:40:29 +01:00
avogar
a6cf2cdab8 Fix style, add docs 2023-03-02 10:36:07 +00:00
Kruglov Pavel
36e65f5f84
Use versions vith dots 2023-02-27 19:00:40 +01:00
Kruglov Pavel
443dedddca
Merge branch 'master' into use-parquet-2 2023-02-27 14:31:43 +01:00
Kruglov Pavel
47f9ca2166
Merge branch 'master' into fix-ipv4-ipv6-formats 2023-02-23 20:32:43 +01:00
avogar
54622566df Add setting to change parquet version 2023-02-23 16:14:10 +00:00
Dan Roscigno
b6612d2c18
fix anchor link 2023-02-21 11:24:39 -05:00
avogar
e37f6b5457 Update docs 2023-02-20 19:50:25 +00:00
Kruglov Pavel
2a3cb8b4ee
Merge pull request #45340 from Avogar/parquet-fixed-binary
Support FixedSizeBinary type in Parquet/Arrow
2023-02-10 18:31:20 +01:00
Dan Roscigno
b33486d715
Update formats.md
closes
2023-02-09 12:52:01 -05:00
Kruglov Pavel
4e2918cee3
Merge branch 'master' into parquet-fixed-binary 2023-02-08 12:31:13 +01:00
Azat Khuzhin
1a8437f2c9 Add ability to ignore unknown keys in JSON object for named tuples
This can be useful in case your input JSON is complex, while you need
only few fields in it.

This behaviour is controlled by the
input_format_json_ignore_unknown_keys_in_named_tuple setting name, that
is turned OFF by default.

This will, almost, allow to parse gharchive dataset without jq. "almost"
because of two things:
- Tuple cannot be Nullable, so such keys with Tuple type in ClickHouse
  cannot be `null` in JSON
- You cannot use dot.dot notation to extract columns for file() engine,
  only tupleElement()

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-27 10:01:08 +01:00
Dan Roscigno
777ddf80ab
Update docs/en/interfaces/formats.md 2023-01-26 11:18:18 -05:00
Denys Golotiuk
0eadb7112d
Added markdown format docs 2023-01-26 13:33:14 +02:00
Dan Roscigno
1cc3708092
Merge branch 'master' into update-settings-url 2023-01-24 19:51:39 -05:00
DanRoscigno
77ae27f26c update for split of format settings 2023-01-24 19:37:55 -05:00
Kruglov Pavel
23c12ac8ee
Merge branch 'master' into parquet-fixed-binary 2023-01-24 16:51:05 +01:00
Kruglov Pavel
4bd3f0e5ef
Merge pull request #44953 from Avogar/tsv-csv-detect-header
Detect header in CSV/TSV/CustomSeparated files automatically
2023-01-24 15:13:52 +01:00
Rich Raposa
429e93965c
Update formats.md
Google has a new website for Protocol Buffers. The old link expires on Jan 31, 2023
2023-01-23 15:42:35 -07:00
avogar
5bf4704e7a Support FixedSizeBinary type in Parquet/Arrow 2023-01-16 21:01:31 +00:00
Kruglov Pavel
e9d6590926
Merge branch 'master' into tsv-csv-detect-header 2023-01-16 17:50:24 +01:00
avogar
1c0941d72a Add docs and examples 2023-01-16 16:46:41 +00:00
avogar
87b934c472 Insert default values in case of missing tuple elements in JSONEachRow 2023-01-12 16:36:44 +00:00
DanRoscigno
7168c217b0 switch text to response for query blocks 2023-01-11 10:08:11 -05:00
serxa
8d099a4417 make more SQL queries copyable from docs in one click 2023-01-11 13:43:51 +00:00
Ivan Blinkov
61c2f23713 Remove leftover empty lines at the end of markdown files 2023-01-09 15:15:18 +01:00
Ivan Blinkov
b7e082d033 Remove "Original article links" 2023-01-09 15:13:36 +01:00
DanRoscigno
925ce4b96c edits 2022-12-30 09:21:12 -05:00
DanRoscigno
0902db3fe0 edits 2022-12-29 22:34:25 -05:00
avogar
ae715b9d00 Finish docs 2022-12-29 20:42:03 +00:00
avogar
46b7ec4209 Add detailed documentation about schema inference 2022-12-29 13:42:56 +00:00
Yakov Olkhovskiy
9ce4e6b7e2
fix style 2022-12-16 17:30:40 -05:00
Yakov Olkhovskiy
bb5d7ff28b
Append requirement for FORMAT RowBinary with strict delimiter 2022-12-16 15:43:52 -05:00
avogar
d0f9bb2ec2 Allow to parse JSON objects into Strings 2022-12-08 18:58:18 +00:00
Kruglov Pavel
c35b2a6495
Add a limit for string size in RowBinary format (#43842) 2022-12-02 13:57:11 +01:00
Kruglov Pavel
98d6b96c82
Merge pull request #42033 from mark-polokhov/BSONEachRow
Add BSONEachRow input/output format
2022-11-22 14:45:21 +01:00
avogar
2af60f34eb Restrict document size in parallel parsing, allow to read ObjectId/JS code into String column 2022-11-15 13:35:17 +00:00
avogar
4d993e653a Fix build and style 2022-11-15 13:06:24 +00:00
avogar
842d25c358 Minor improvements, better docs 2022-11-14 20:05:01 +00:00
avogar
564d83bbc7 Better handle uint64 2022-11-11 13:24:12 +00:00
avogar
94c6dc42eb Use better types 2022-11-11 13:17:48 +00:00
avogar
9e89af28c6 Refactor BSONEachRow format, fix bugs, support more data types, support parallel parsing and schema inference 2022-11-10 20:15:14 +00:00
DanRoscigno
34f90ff6ef update pages that refer to dictionaries 2022-11-07 09:26:50 -05:00
Kruglov Pavel
9c1e654584
Fix typo 2022-09-28 16:38:04 +02:00
avogar
03ee7efcb9 Better example in docs 2022-09-28 12:48:31 +00:00
Kruglov Pavel
bfddb91c9a
Update docs/en/interfaces/formats.md
Co-authored-by: Sergei Trifonov <svtrifonov@gmail.com>
2022-09-28 14:15:53 +02:00
avogar
4f32ef9bb7 Add docs 2022-09-22 17:04:42 +00:00
Kruglov Pavel
22e11aef2d
Merge pull request #40910 from Avogar/new-json-formats
Add new JSON formats, add improvements and refactoring
2022-09-21 14:19:08 +02:00
avogar
868ce8bc16 Fix comments, make better naming, add docs, add setting output_format_json_quote_64bit_floats 2022-09-20 13:49:17 +00:00
DanRoscigno
149970447a add link to formats 2022-09-16 17:21:03 -04:00
avogar
ef7f7b30aa Fix tests 2022-09-15 18:20:17 +00:00
avogar
9abc6a9c3f Fix typos 2022-09-09 15:10:00 +00:00
avogar
ae7f9abd32 Add documentation 2022-09-09 12:04:49 +00:00
DanRoscigno
5b5fcc56aa add slugs 2022-08-28 10:53:34 -04:00
avogar
f925046dc4 Add more Pretty formats 2022-07-27 11:37:02 +00:00
Denny Crane
bca33a5a0a
Update formats.md 2022-07-24 20:57:37 -03:00
Denny Crane
1d5b2b770e
Update formats.md 2022-07-19 18:23:02 -03:00
Denny Crane
be1d40916a
Update formats.md 2022-07-19 18:22:21 -03:00
avogar
34a92383f1 Update docs 2022-06-30 16:17:14 +00:00
avogar
9bb68bc6de Add SQLInsert output format 2022-06-27 18:31:57 +00:00
Kruglov Pavel
d49dcc2b58
Fix using INSERT INTO ... FORMAT ... SETTINGS ... in docs 2022-06-09 16:16:43 +02:00
vdimir
dc772335ee
Doc typos, dict 2022-06-08 14:31:01 +02:00
vdimir
94bddec0bc
More typos checked and fixed 2022-06-08 14:28:41 +02:00
vdimir
9bd9c3d1d1
Fix some typos in docs 2022-06-08 14:26:02 +02:00
Kruglov Pavel
91c8951a1a
Merge pull request #37698 from Avogar/update_format_settings_docs
Refactor docs related to format settings
2022-06-07 13:28:36 +02:00
avogar
f65ad93ff0 Add some fixes 2022-06-03 12:11:46 +00:00
rfraposa
869967de41 Remove H1 anchor tags from docs 2022-06-02 04:55:18 -06:00
avogar
636cd6e1d3 Add fixes 2022-05-31 15:20:38 +00:00
avogar
858570d335 Refactor docs related to format settings 2022-05-31 15:18:49 +00:00
avogar
139a7e19a9 Fix comments 2022-05-30 11:43:29 +00:00
avogar
ede6e2f433 Add docs for settings 2022-05-25 15:10:20 +00:00
avogar
d2304f5d15 Make better 2022-05-20 12:07:29 +00:00
avogar
12010a81b7 Make better 2022-05-18 09:25:26 +00:00
mergify[bot]
4a661b6e78
Merge branch 'master' into json-columns-formats 2022-05-13 11:32:03 +00:00
rfraposa
3234534dd8 Fix table on formats.md page 2022-05-10 22:55:17 -06:00
avogar
04fdd75c56 Make JSONColumns frormats mono block by default 2022-05-09 11:13:44 +00:00
avogar
9c5ad1e773 Add docs 2022-05-06 16:51:50 +00:00
Kruglov Pavel
9618fddea2
Fix formats docs 2022-05-06 17:49:02 +02:00
Kruglov Pavel
4d08587559
Merge branch 'master' into mysqldump-format 2022-04-28 15:58:18 +02:00
vdimir
560d668c38
Add doc for output format Prometheus 2022-04-26 14:57:36 +00:00
avogar
33d845dade Add MySQLDump input format 2022-04-26 10:42:56 +00:00
rfraposa
8f01fe9c49 Revised /en folder 2022-04-09 07:34:21 -06:00
rfraposa
5250d9ad11 Removed /ja folder, cleaned up /ru markdown 2022-04-09 07:29:05 -06:00
Alexey Milovidov
9854b55835
Revert "Format changes for new docs" 2022-04-04 02:05:35 +03:00
rfraposa
421812a877 Updates /interfaces docs 2022-03-29 17:57:11 -06:00
rfraposa
33e28bcb18 Updated /development folder 2022-03-29 08:26:35 -06:00
Sergio Tulentsev
ef5062ec43
docs: Replace some usages of "rows" which are meant to be "strings"
The wrong term was confusing in the description of the CSV format.
2022-03-21 17:52:30 +00:00
taiyang-li
7a0e09db28 update data type mapping of clickhouse and arrow/orc/parquet 2022-03-19 11:25:59 +08:00
Robert Schulze
0d2ece6d91
Merge branch 'ClickHouse:master' into protobuf-batch-write 2022-03-16 09:43:33 +01:00
Sergei Trifonov
5fad4db4eb
LineAsString is supported output format now
support added in #30919
2022-03-15 14:56:43 +01:00
Robert Schulze
317166eac3
Document ProtobufList format 2022-03-15 14:11:32 +01:00
rfraposa
5a4466cec7 Update references in docs 2022-03-12 00:24:31 -06:00
Niek
a9fe2a0767
Update formats.md 2021-12-16 17:37:53 +01:00
Dmitriy
2dd24a887c Fix by comments
Поправил согласно комментариям разработчика.
2021-11-29 22:16:53 +03:00
Dmitriy
c7cd80dd6c Translate to Russian
Выполнил перевод на русский язык.
2021-11-26 20:25:45 +03:00
Dmitriy
cb22ad4ad1 Document some formats and settings
Задокументировал форматы CustomSeparatedWithNames, CustomSeparatedWithNamesAndTypes и семь настроек формата CustomSeparated.
2021-11-23 23:11:44 +03:00
Dmitriy
0f6d69fb0f Merge remote-tracking branch 'remotes/clickhouse/master' into sevirov-DOCSUP-16819-add_capnproto_output_format 2021-11-22 20:31:27 +03:00
Dmitriy
ba0adf0f97 Fix by comments
Поправил согласно комментариям разработчика.
2021-11-22 20:24:47 +03:00
sevirov
e40c7017b4
Update docs/en/interfaces/formats.md
Co-authored-by: gyuton <40863448+gyuton@users.noreply.github.com>
2021-11-19 20:02:43 +03:00
karnevil13
9c5e788949
Merge branch 'master' into karnevil-DOCSUP-17584 2021-11-18 23:50:07 +03:00
Dmitriy
d32d80a22e Add links
Добавил ссылки.
2021-11-16 20:10:08 +03:00
Dmitriy
f4d01eecf9 Fix link
Поправил ссылку.
2021-11-14 21:11:20 +03:00
Dmitriy
caf2aec7b8 Add CapnProto format
Задокументировал формат CapnProto.
2021-11-14 15:27:35 +03:00
karnevil13
27d98b8617 Updated gui.md, replicated.md & formats.md 2021-11-10 02:20:59 +03:00
Kruglov Pavel
7fd06937eb
Update formats.md 2021-11-08 15:23:26 +03:00
avogar
a3090fecc3 Update documentation about parsing enums in TSV/CSV 2021-11-08 14:42:46 +03:00
Kruglov Pavel
1f8535c02b
Merge branch 'master' into null-deserialization 2021-11-02 12:15:21 +03:00
mergify[bot]
1102b1dcef
Merge branch 'master' into tsv-csv 2021-10-31 12:03:24 +00:00
alexey-milovidov
94039ace63
Merge pull request #30799 from Algunenano/update_doc_ommitted_fields
Update docs to stop referencing insert_sample_with_metadata
2021-10-31 14:32:21 +03:00
avogar
a37bcb1feb Fix docs 2021-10-28 14:32:49 +03:00
Raúl Marín
d0fe359a23 Update docs to stop referencing insert_sample_with_metadata 2021-10-28 12:34:38 +02:00
avogar
b4aae86fff Add comments, update docs 2021-10-27 22:16:48 +03:00
avogar
d1ef96a5ef Add test, avoid unnecessary allocations, use PeekableReadBuffer only in corner case 2021-10-27 17:29:15 +03:00