Commit Graph

1560 Commits

Author SHA1 Message Date
Kruglov Pavel
558eda4146
Merge pull request #49412 from azat/block-use-dense-hash-map
Switch Block::NameMap to google::dense_hash_map over HashMap
2023-05-15 12:22:55 +02:00
Alexey Milovidov
f6144ee32b
Revert "Make Pretty formats even prettier." 2023-05-13 02:45:07 +03:00
Azat Khuzhin
2c40dd6a4c Switch Block::NameMap to google::dense_hash_map over HashMap
Since HashMap creates 2^8 elements by default, while dense_hash_map
should be good here.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-05-12 05:52:57 +02:00
Robert Schulze
9db78792d0
Fix MsgPackRowInputFormat.cpp build 2023-05-11 10:00:32 +00:00
Azat Khuzhin
d8dd50a9c6 Fix misc-misplaced-const clang-tidy warning
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-05-09 21:27:21 +02:00
Azat Khuzhin
d03ae2abfa Fix modernize-loop-convert clang-tidy warning
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-05-09 21:19:37 +02:00
Azat Khuzhin
833652b7c9 Revert "Suppress clang-analyzer-cplusplus.NewDelete in MsgPackRowInputFormat"
Let's try to revert this quirk during upgrading to clang 16.

This reverts commit c1e70169d2.
2023-05-09 20:36:39 +02:00
Alexey Milovidov
90b0de5677 Make Pretty prettier 2023-05-05 06:36:53 +02:00
Alexey Milovidov
179eddee01 Remove garbage from Pretty format 2023-05-05 04:44:47 +02:00
Michael Kolupaev
eb3b774ad0 Better control over Parquet row group size 2023-05-04 14:59:55 -07:00
Raúl Marín
f0e045bb3d Merge remote-tracking branch 'blessed/master' into arenita 2023-04-24 10:42:56 +02:00
Alexey Milovidov
b08f6b9dcc
Update LineAsStringRowInputFormat.cpp 2023-04-23 08:32:58 +03:00
Alexey Milovidov
54d10f87f2 Consistency of the LineAsString format 2023-04-23 05:50:46 +02:00
Alexander Gololobov
e6d34a9f8b
Merge pull request #48987 from Avogar/avoid-logical-error
Don't throw logical error when column is not found in Parquet/Arrow schema
2023-04-21 09:46:16 +02:00
robot-ch-test-poll1
f466c89621
Merge pull request #48911 from Avogar/parquet-metadata-format
Add ParquetMetadata input format to read Parquet file metadata
2023-04-21 03:46:26 +02:00
avogar
8a3e813ecd Don't throw logical error when column is not found in Parquet/Arrow schema 2023-04-20 19:09:40 +00:00
Kruglov Pavel
9bc95bed85
Merge pull request #48898 from Avogar/pretty-json
Add PrettyJSONEachRow format to output pretty JSON
2023-04-19 12:27:24 +02:00
Kruglov Pavel
8053b18c05
Merge pull request #48361 from Avogar/fix-arrow-dict-2
Fix serializing LowCardinality as Arrow dictionary
2023-04-19 12:23:27 +02:00
Kruglov Pavel
21dddf8c4c
Merge pull request #48864 from Avogar/fix-parquet-date32
Fix reading Date32 Parquet/Arrow column into not Date32 column
2023-04-19 09:16:42 +02:00
avogar
0878ab8443 Fix build 2023-04-18 19:51:53 +00:00
Kruglov Pavel
a5c52d3bc3
Merge branch 'master' into parquet-metadata-format 2023-04-18 21:51:14 +02:00
avogar
7a67951f64 Add more fields, fix style 2023-04-18 17:59:01 +00:00
avogar
b0e5f7069e Update exception message 2023-04-18 17:15:16 +00:00
avogar
c5efa4dc01 Add comment 2023-04-18 17:10:37 +00:00
avogar
f7f609dfb9 Better 2023-04-18 16:57:55 +00:00
avogar
b277a5c943 Add ParquetMetadata input format to read Parquet file metadata 2023-04-18 16:46:26 +00:00
Kruglov Pavel
8710c15c85
Apply suggestion 2023-04-18 18:25:54 +02:00
avogar
e356f92b77 Add PrettyJSONEachRow format to output pretty JSON 2023-04-18 13:28:59 +00:00
Kruglov Pavel
3bbc347901
Fix build 2023-04-17 22:22:26 +02:00
Kruglov Pavel
be0b0e7921
Fix build 2023-04-17 20:58:19 +02:00
avogar
527572e7bd Fix reading Date32 Parquet/Arrow column into not Date32 column 2023-04-17 16:51:22 +00:00
Kruglov Pavel
5c9b404c6e
Update src/Processors/Formats/Impl/CHColumnToArrowColumn.cpp
Co-authored-by: Yakov Olkhovskiy <99031427+yakov-olkhovskiy@users.noreply.github.com>
2023-04-17 14:02:07 +02:00
Raúl Marín
39f8c43a60 Merge remote-tracking branch 'blessed/master' into arenita 2023-04-17 10:33:38 +02:00
Michael Kolupaev
87be78e6de Better 2023-04-17 04:58:32 +00:00
Michael Kolupaev
e133633359 Parallel decoding with one row group per thread 2023-04-17 04:58:32 +00:00
Michael Kolupaev
2d4fe85513 Something 2023-04-17 04:58:32 +00:00
Michael Kolupaev
dc6e34075e Read less unnecessary data from Parquet files 2023-04-17 04:58:32 +00:00
Dmitry Novik
5cc9b46f78 Merge remote-tracking branch 'origin/master' into optimize-compilation 2023-04-13 16:04:09 +02:00
Raúl Marín
da9a539cf7 Reduce the usage of Arena.h 2023-04-13 10:31:32 +02:00
Robert Schulze
3f7ce60e03
Merge branch 'master' into rs/gcc-removal 2023-04-12 22:17:04 +02:00
Robert Schulze
05606a8835
Clean up GCC warning pragmas 2023-04-11 18:21:08 +00:00
Kruglov Pavel
0b17ac318f
Merge branch 'master' into fix-arrow-dict-2 2023-04-11 14:32:27 +02:00
Azat Khuzhin
5b2b20a0b0 Rename ThreadGroupStatus to ThreadGroup
There are methods like getThreadGroup() and ThreadGroupSwitcher class,
so seems that this is logical.

Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-04-07 15:31:48 +02:00
Kruglov Pavel
fb33d6bedb
Merge pull request #48126 from Avogar/orc-arrow-parquet-more-types
Support more ClickHouse types in ORC/Arrow/Parquet formats
2023-04-06 13:32:46 +02:00
Kruglov Pavel
11d3d01e50
Fix special build 2023-04-05 18:29:59 +02:00
Kruglov Pavel
9331c6c260
Merge pull request #48122 from Avogar/bson-more-types
Support Enum output/input in BSONEachRow, allow all map key types and avoid extra calculations
2023-04-05 18:26:19 +02:00
avogar
9d3790a4c8 Fix tests 2023-04-05 11:27:38 +00:00
Kruglov Pavel
031ab94636
Merge branch 'master' into fix-arrow-dict-2 2023-04-05 11:57:57 +02:00
Kruglov Pavel
8d6b709f1a
Merge pull request #48257 from Avogar/capnp-more-types
Support more types in CapnProto format, allow integer conversions
2023-04-05 11:56:46 +02:00
Kruglov Pavel
fad053c4fe
Make better 2023-04-04 16:10:47 +02:00