avogar
58f2aca120
Fix tests
2022-03-18 19:04:16 +00:00
avogar
cffa2096de
Fix working with unneeded columns in Arrow/Parquet/ORC formats
2022-03-18 13:07:54 +00:00
Kruglov Pavel
aa3c05e9d4
Merge pull request #35152 from rschu1ze/protobuf-batch-write
...
ProtobufList
2022-03-18 13:24:34 +01:00
mergify[bot]
28734562bb
Merge branch 'master' into issue/#34890_fix_input_format_null_as_default_bug
2022-03-17 13:24:34 +00:00
Antonio Andelic
607f785e48
Revert "Merge pull request #35145 from bigo-sg/lower-column-name"
...
This reverts commit ebf72bf61d
, reversing
changes made to f1b812bdc1
.
2022-03-17 12:31:43 +00:00
Yakov Olkhovskiy
00700988ee
style fix
2022-03-17 02:31:01 -04:00
Yakov Olkhovskiy
7bb66e6702
added INTERPOLATE extension for ORDER BY WITH FILL
2022-03-17 01:51:35 -04:00
Anton Popov
2ced42ed41
add experimental settings for Object type
2022-03-16 16:51:23 +00:00
Anton Popov
0ba78c3c3a
Merge remote-tracking branch 'upstream/master' into HEAD
2022-03-16 15:28:09 +00:00
avogar
f7c5fe14e4
Fix schema inference for TSKV format while using small max_read_buffer_size
2022-03-16 13:53:50 +00:00
Robert Schulze
0d2ece6d91
Merge branch 'ClickHouse:master' into protobuf-batch-write
2022-03-16 09:43:33 +01:00
Robert Schulze
23122cb327
Fix review comments
...
ParquetBlockOutputFormat.cpp:
- undo unrelated formatting
ProtobufSerializer.cpp:
- undef debug tracing
- simplify logic in writeRow()
ProtobufSchemas.cpp:
- restore original search in cache by message type
2022-03-15 11:27:17 +01:00
Maksim Kita
2665724301
Fix clang-tidy warnings in Parsers, Processors, QueryPipeline folders
2022-03-14 18:17:35 +00:00
Anton Popov
36ec379aeb
Merge remote-tracking branch 'upstream/master' into HEAD
2022-03-14 16:28:35 +00:00
Antonio Andelic
ebf72bf61d
Merge pull request #35145 from bigo-sg/lower-column-name
...
add setting to lower column case when reading parquet/orc file
2022-03-14 11:25:03 +01:00
Maksim Kita
ad6b3693e1
Merge pull request #35123 from zhanghuajieHIT/fix_build_fail_with_gcc
...
fix build fail with gcc
2022-03-14 10:36:15 +01:00
Kseniia Sumarokova
58a2d2b458
Merge pull request #35118 from zzsmdfj/issue/#31469_MaterializedMysql_mysqlDate2CkDate32
...
to #31469_MaterializedMysql_mysqlDate2CkDate32
2022-03-14 10:32:33 +01:00
Robert Schulze
514d4d2187
Implement ProtobufList - fixes ClickHouse#16436
...
Introduce IO format "ProtobufList" with protobuf schema
// schemafile.proto
message Envelope {
message MessageType {
uint32 colA = 1;
string colB = 2;
}
repeated MessageType mt = 1;
}
where "Envelope" is a hard-coded/expected top-level message and
"MessageType" is a message with user-provided name containing the table
fields to export/import, e.g.
SELECT * FROM db1.tab1 FORMAT ProtobufList SETTINGS format_schema =
'schemafile:MessageType'
As a result, the new format wraps a list of messages (one per row) into
a single, containing message. Compare that to the schema of the existing
IO formats "Protobuf" and "ProtobufSingle":
message MessageType {
uint32 colA = 1;
string colB = 2;
}
The new format does not save space compared to the existing formats, but
it is conceptually a bit more beautiful and also more convenenient.
Implementation details:
- Created new files ProtobufList(Input|Output)Format which use the
existing ProtobufSerializer mechanism. The goal was to reuse as much
code as possible and avoid copypasta.
- I was torn between inheriting from I(Input|Output)Format vs.
IRow(Input|Output)Format for ProtobufList(Input|Output)Format. The
former is chunk-based which can be better for performance. Since the
ProtobufSerializer mechanism is row-based but data is generally passed
around in chunks, I decided for the latter to leverage the existing
chunk <--> row mapping code in IRow(InputOutput)Format.
- A new ProtobufSerializer called ProtobufSerializerEnvelope was
introduced (--> ProtobufSerializer.cpp). It represents the top-level
message which encloses the list of inner nested messages, i.e. the
rows.
- With the new format, parsing the schema file and matching the fields in
the schema file to table column works like for the old formats. The only
difference is that parsing starts one level below the "Envelope" (-->
ProtobufSchema.cpp). This is more natural than forcing customers to
have table columns start with "Envelope".
- Creation of the ProtobufSerializer tree also works like before. What
is different is that we finally add a ProtobufSerializerEnvelope as
new root of the tree. It's only purpose is to write/read the top-level
message for the first/last row to write/read.
Caveats:
- The low-level serialization code in ProtobufWriter uses an internal
buffer which is flushed to the output file only in endMessage().
In the existing "Protobuf" format, this happens once per row, in the
new format this happens only at the end of the serialization
since row-level messages now call start/endNestedMessage(). As a
future TODO to, the buffer should be flushed also in
start/endNestedMessage() to reduce memory consumption.
2022-03-14 08:04:58 +01:00
Maksim Kita
ce0c8e5597
Update JSONRowOutputFormat.cpp
2022-03-14 00:58:36 +01:00
Robert Schulze
f0ba39b071
Clean up some header includes and make formatting more consistent
2022-03-13 20:24:12 +01:00
zhanghuajie
53a8987b3b
fix build fail with gcc --fix warnings without disabling some parameters
2022-03-11 21:59:19 +08:00
shuchaome
7a3623d216
fix bug
2022-03-11 17:26:13 +08:00
Nikolai Kochetov
47f4bd30cd
Merge pull request #35186 from amosbird/fixwithtotalemptychunk
...
Fix empty chunk in with total transform
2022-03-11 10:24:19 +01:00
metahys
ff934cf0c2
Fix unexpected result when use -state type aggregate function in window frame ( #34999 )
...
* Fix unexpected result when use -state type aggregate function in window frame
* fix style
* fix style
* fix test
* fix flaky test
* fix flaky test
Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
2022-03-11 11:54:17 +03:00
shuchaome
46cb4483a6
Optimise by lowering schema on the beginning. Add a functional test.
2022-03-11 14:34:46 +08:00
Amos Bird
a1b61dabfd
Fix empty chunk in with total transform.
2022-03-10 23:27:36 +08:00
mergify[bot]
c326ebd67f
Merge branch 'master' into issue/#34890_fix_input_format_null_as_default_bug
2022-03-09 15:59:04 +00:00
shuchaome
b7cd85df6b
remove unused column_names in ORCBlockInputFormat
2022-03-09 18:16:22 +08:00
shuchaome
bb50133424
Apply suggestions from code review
...
Co-authored-by: Antonio Andelic <antonio2368@users.noreply.github.com>
2022-03-09 17:32:27 +08:00
shuchaome
9647818adc
add unlikely for performance
2022-03-09 17:02:07 +08:00
shuchaome
8027bb1e32
modify code style
2022-03-09 16:32:18 +08:00
shuchaome
56795b831d
add setting to lower column case when reading parquet/orc file
2022-03-09 16:07:02 +08:00
zhanghuajie
11dde7c127
fix build fail with gcc
2022-03-08 22:34:51 +08:00
zzsmdfj
67b9f81104
to #31469_MaterializedMysql_mysqlDate2CkDate32
2022-03-08 18:17:22 +08:00
zzsmdfj
7252c18ff0
to #34890_fix_input_format_null_as_default_bug
2022-03-04 15:04:43 +08:00
Anton Popov
df3b07fe7c
Merge remote-tracking branch 'upstream/master' into HEAD
2022-03-03 22:25:28 +00:00
Nikolai Kochetov
32120d5dec
Merge pull request #34993 from ClickHouse/try-fix-delayed-source
...
Avoid pushing to port with data inside DelayedSource
2022-03-03 13:44:45 +01:00
Maksim Kita
b1a956c5f1
clang-tidy check performance-move-const-arg fix
2022-03-02 18:15:27 +00:00
Maksim Kita
1f5837359e
clang-tidy check performance-noexcept-move-constructor fix
2022-03-02 18:15:27 +00:00
Nikolai Kochetov
ed8dfc14d4
Avoid pushing to port with data inside DelayedSource
2022-03-02 14:21:58 +01:00
Anton Popov
2758db5341
add more comments
2022-03-01 19:32:55 +03:00
Anton Popov
fcdebea925
Merge remote-tracking branch 'upstream/master' into HEAD
2022-02-25 13:41:30 +03:00
zvonand
90c857c5e3
merge
2022-02-17 18:23:37 +03:00
Kruglov Pavel
5e8b2228e0
Merge pull request #34561 from bigo-sg/arrow_type_timestamp
...
Implement transformation between CH DateTime64 and arrow timestamp column
2022-02-17 16:55:17 +03:00
zvonand
cf244689a2
fixed filling transform
2022-02-16 15:14:25 +03:00
Anton Popov
72e75fdaf5
Merge pull request #34601 from CurtizJ/filtering-by-sparse-columns
...
Support filtering by sparse columns without conversion to full
2022-02-15 23:26:13 +03:00
Anton Popov
7cddae1351
return back result_size_hint
2022-02-15 15:12:25 +03:00
Anton Popov
5c316ffabe
support filtering by sparse columns without convertion to full
2022-02-15 14:30:54 +03:00
Kruglov Pavel
cf454a6539
Merge pull request #34532 from CurtizJ/fix-aggregation-in-order-3
...
Fix aggregation in order with distributed_aggregation_memory_efficient=0
2022-02-15 14:26:15 +03:00
zvonand
888542e29b
add[interval] no longer oses decimal components
...
Not only support for better subsecond logic, but also fewer conversions
-> faster operation
2022-02-14 02:52:56 +03:00
zvonand
2454f1dd13
Decimal components attempt
2022-02-14 00:04:09 +03:00
taiyang-li
e53719a86b
remove comments
2022-02-13 17:13:23 +08:00
taiyang-li
aabf2aac69
finish all tests
2022-02-13 17:06:58 +08:00
taiyang-li
6559941972
support datetime64 when transform ch chunk to arrow table
2022-02-13 14:56:01 +08:00
alexey-milovidov
4a2c69c073
Merge pull request #34067 from Algunenano/mv_fixes
...
Fix `parallel_view_processing=0` and `view_duration_ms` in views log
2022-02-12 22:36:41 +03:00
Anton Popov
357bdd69c4
fix aggregation in order with distributed_aggregation_memory_efficient=0
2022-02-11 18:09:13 +03:00
Vladimir C
a2b1900333
Merge pull request #34496 from Avogar/jsonl
...
Support .jsonl extension for JSONEachRow format
2022-02-11 15:44:31 +01:00
W
7cb0433fae
Update buildPushingToViewsChain.h
...
typo
2022-02-11 14:34:20 +08:00
avogar
9e58ae7577
Support jsonl extension for JSONEachRow format
2022-02-10 16:00:37 +03:00
Kruglov Pavel
a4f5610764
Merge pull request #34476 from CurtizJ/avoid-settings-copy
...
Avoid unnecessary copying of `Settings`
2022-02-10 14:13:46 +03:00
Anton Popov
298838f891
avoid unnecessary copying of Settings
2022-02-10 12:13:51 +03:00
Anton Popov
18940b8637
Merge remote-tracking branch 'upstream/master' into HEAD
2022-02-09 23:38:38 +03:00
mergify[bot]
d78525bd10
Merge branch 'master' into fix-removing-order-in-CreatingSetsTransform
2022-02-09 13:55:52 +00:00
Azat Khuzhin
4fa2ae76bc
Fix memory leak in AggregatingInOrderTransform
...
Reproducer:
# NOTE: we need clickhouse from 33957 since right now LSan is broken due to getauxval().
$ url=https://s3.amazonaws.com/clickhouse-builds/33957/e04b862673644d313712607a0078f5d1c48b5377/package_asan/clickhouse
$ wget $url -o clickhouse-asan
$ chmod +x clickhouse-asan
$ ./clickhouse-asan server &
$ ./clickhouse-asan client
:) create table data (key Int, value String) engine=MergeTree() order by key
:) insert into data select number%5, toString(number) from numbers(10e6)
# usually it is enough one query, benchmark is just for stability of the results
# note, that if the exception was not happen from AggregatingInOrderTransform then add --continue_on_errors and wait
$ ./clickhouse-asan benchmark --query 'select key, uniqCombined64(value), groupArray(value) from data group by key' --optimize_aggregation_in_order=1 --memory_tracker_fault_probability=0.01, max_untracked_memory='2Mi'
LSan report:
==24595==ERROR: LeakSanitizer: detected memory leaks
Direct leak of 3932160 byte(s) in 6 object(s) allocated from:
0 0xcadba93 in realloc ()
1 0xcc108d9 in Allocator<false, false>::realloc() obj-x86_64-linux-gnu/../src/Common/Allocator.h:134:30
2 0xde19eae in void DB::PODArrayBase<>::realloc<DB::Arena*&>(unsigned long, DB::Arena*&) obj-x86_64-linux-gnu/../src/Common/PODArray.h:161:25
3 0xde5f039 in void DB::PODArrayBase<>::reserveForNextSize<DB::Arena*&>(DB::Arena*&) obj-x86_64-linux-gnu/../src/Common/PODArray.h
4 0xde5f039 in void DB::PODArray<>::push_back<>(DB::GroupArrayNodeString*&, DB::Arena*&) obj-x86_64-linux-gnu/../src/Common/PODArray.h:432:19
5 0xde5f039 in DB::GroupArrayGeneralImpl<>::add() const obj-x86_64-linux-gnu/../src/AggregateFunctions/AggregateFunctionGroupArray.h:465:31
6 0xde5f039 in DB::IAggregateFunctionHelper<>::addBatchSinglePlaceFromInterval() const obj-x86_64-linux-gnu/../src/AggregateFunctions/IAggregateFunction.h:481:53
7 0x299df134 in DB::Aggregator::executeOnIntervalWithoutKeyImpl() obj-x86_64-linux-gnu/../src/Interpreters/Aggregator.cpp:869:31
8 0x2ca75f7d in DB::AggregatingInOrderTransform::consume() obj-x86_64-linux-gnu/../src/Processors/Transforms/AggregatingInOrderTransform.cpp:124:13
...
SUMMARY: AddressSanitizer: 4523184 byte(s) leaked in 12 allocation(s).
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-02-09 09:23:56 +03:00
Maksim Kita
b8e350054c
clang-tidy move fix build
2022-02-08 21:21:32 +00:00
Nikolai Kochetov
82a7d70a31
Merge branch 'master' into fix-removing-order-in-CreatingSetsTransform
2022-02-08 19:29:03 +03:00
Nikolai Kochetov
d2d47b9595
Fixing build.
2022-02-08 16:27:33 +00:00
Maksim Kita
4bb69bcb15
Merge pull request #34398 from DevTeamBK/input_format
...
Method called on already moved
2022-02-08 15:20:07 +01:00
Nikolai Kochetov
7e54dafdc1
Fix wrong destruction order in CreatingSetsTransform.
2022-02-08 10:41:07 +00:00
Kruglov Pavel
b4fec2af7c
Merge pull request #34065 from Avogar/msgpack
...
Support UUID in MsgPack format
2022-02-08 11:42:17 +03:00
Rajkumar
6b3adbb0de
Method called on already moved
2022-02-07 19:50:34 -08:00
avogar
a4c7ecde87
Make better
2022-02-07 17:51:26 +03:00
avogar
c3d30fd502
Fix comments
2022-02-07 17:11:44 +03:00
Kruglov Pavel
34a17075d3
FIx error messages
2022-02-07 17:11:44 +03:00
avogar
77b42bb9ff
Support UUID in MsgPack format
2022-02-07 17:11:44 +03:00
HeenaBansal2009
eeec2478ba
Fix clang-tidy issue
2022-02-06 22:36:35 -08:00
Alexey Milovidov
f98010e374
Small improvements
2022-02-06 07:14:01 +03:00
Alexey Milovidov
4a83dbc514
Fix linkage
2022-02-04 00:26:44 +03:00
Alexey Milovidov
c426f11096
Maybe better
2022-02-04 00:20:16 +03:00
Alexey Milovidov
7c12f5f37a
Fix terribly low performance of LineAsString
format
2022-02-04 00:07:31 +03:00
Anton Popov
9b844c6b42
Merge pull request #32748 from CurtizJ/read-in-order-fixed-prefix
...
Support `optimize_read_in_order` if prefix of sorting key is already sorted
2022-02-03 18:17:08 +03:00
mergify[bot]
150d7ba8b5
Merge branch 'master' into mv_fixes
2022-02-03 00:41:52 +00:00
Azat Khuzhin
1d19851590
Disable data skipping indexes by default for queries with FINAL
...
This patch adds use_skip_indexes_if_final setting, that is OFF by
default. Since skipping data for queries with FINAL may produce
incorrect result.
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-02-02 13:31:50 +03:00
Anton Popov
836a348a9c
Merge remote-tracking branch 'upstream/master' into HEAD
2022-02-01 15:23:07 +03:00
mergify[bot]
4f0e011958
Merge branch 'master' into mv_fixes
2022-01-31 23:53:35 +00:00
Alexey Milovidov
e4e7169277
Remove some strange code
2022-02-01 02:52:36 +03:00
Alexey Milovidov
83136f3515
Allow \r in the middle of the line in format Regexp
2022-02-01 02:49:26 +03:00
Alexey Milovidov
872d0a0fbe
Improve performance of format Regexp
2022-02-01 02:07:48 +03:00
alesapin
dd61d1c2de
Merge pull request #34172 from ClickHouse/fix_race_in_some_engines
...
Fix benign race condition for storage HDFS, S3, URL
2022-01-31 22:41:54 +03:00
alesapin
93c0700c4c
Fix typo
2022-01-31 16:46:58 +03:00
alesapin
056b9e335f
Fix comment
2022-01-31 16:39:42 +03:00
alesapin
31753afb7e
Fix cancel logic in parallel parsing
2022-01-31 16:38:15 +03:00
Maksim Kita
5ef83deaa6
Update sort to pdqsort
2022-01-30 19:49:48 +00:00
Anton Popov
78b9f15abb
Merge remote-tracking branch 'upstream/master' into HEAD
2022-01-30 03:24:37 +03:00
Anton Popov
b950a12cb3
Merge pull request #34068 from CurtizJ/fix-async-insert-native
...
Fix asynchronous inserts with `Native` format
2022-01-29 01:24:53 +03:00
Azat Khuzhin
1519985c98
Fix possible "Can't attach query to the thread, it is already attached"
...
After detachQueryIfNotDetached() had been removed it is not enough to
use attachTo() for ThreadPool (scheduleOrThrowOnError()) since the query
may be already attached, if the thread doing multiple jobs, so
CurrentThread::attachToIfDetached() should be used instead.
This should fix all the places from the failures on CI [1]:
$ fgrep DB::CurrentThread::attachTo -A1 ~/Downloads/47.txt | fgrep -v attachTo | cut -d' ' -f5,6 | sort | uniq -c
92 --
2 /fasttest-workspace/build/../../ClickHouse/contrib/libcxx/include/deque:1393: DB::ParallelParsingInputFormat::parserThreadFunction(std::__1::shared_ptr<DB::ThreadGroupStatus>,
4 /fasttest-workspace/build/../../ClickHouse/src/Storages/MergeTree/MergeTreeData.cpp:1595: void
87 /fasttest-workspace/build/../../ClickHouse/src/Storages/MergeTree/MergeTreeDataSelectExecutor.cpp:993: void
[1]: https://github.com/ClickHouse/ClickHouse/runs/4954466034?check_suite_focus=true
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-01-28 16:25:33 +03:00
Azat Khuzhin
b0c862c297
Fix memory accounting for queries that uses < max_untracker_memory
...
MemoryTracker starts accounting memory directly only after per-thread
allocation exceeded max_untracker_memory (or memory_profiler_step).
But even memory under this limit should be accounted too, and there is
code to do this in ThreadStatus dtor, however due to
PullingAsyncPipelineExecutor detached the query from thread group that
memory was not accounted.
So remove CurrentThread::detachQueryIfNotDetached() from threads that
uses ThreadFromGlobalPool since it has ThreadStatus, and the query will
be detached using CurrentThread::defaultThreadDeleter.
Note, that before this patch memory accounting works for HTTP queries
due to it had been accounted from ParallelFormattingOutputFormat, but
not for TCP.
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-01-28 16:25:33 +03:00
Anton Popov
6c0959b907
fix asynchronous inserts with Native format
2022-01-28 03:25:15 +03:00
Raúl Marín
5f7ac9fbb1
MV: Limit the chain threads based on parallel_view_processing
2022-01-27 18:13:30 +01:00
Raúl Marín
3df56f46b3
Fix MV view_duration_ms
2022-01-27 18:09:47 +01:00
tavplubix
9e91a9dfd1
Merge pull request #33429 from zzsmdfj/MaterializeMySQL_support_set_and_other_dataType
...
MaterializeMySQL support SET YEAR TIME GEOMETRY dataType
2022-01-26 11:29:46 +03:00
Kruglov Pavel
9f12f4af13
Merge pull request #33302 from Avogar/formats-with-suffixes
...
Allow to create new files on insert for File/S3/HDFS engines
2022-01-25 10:56:15 +03:00
avogar
1f49acc164
Better naming
2022-01-24 16:28:36 +03:00
Anton Popov
6861adadcf
Merge pull request #33637 from azat/fix-optimize_aggregation_in_order
...
RFC: Fix "Chunk should have AggregatedChunkInfo in GroupingAggregatedTransform"
2022-01-22 19:05:39 +03:00
Anton Popov
e8ce091e68
Merge remote-tracking branch 'upstream/master' into HEAD
2022-01-21 20:11:18 +03:00
zzsmdfj
411f43ec4d
to MaterializeMySQL_support_set_and_other_dataType
2022-01-21 12:24:12 +08:00
avogar
67e396f8f4
Fix schema inference for JSONEachRow and JSONCompactEachRow
2022-01-20 16:31:24 +03:00
mergify[bot]
cbb87e762c
Merge branch 'master' into formats-with-suffixes
2022-01-19 17:21:18 +00:00
Amos Bird
f65d77cc74
Hotfix of missing header
2022-01-19 14:11:42 +08:00
mergify[bot]
b318f9b5db
Merge branch 'master' into formats-with-suffixes
2022-01-18 12:17:07 +00:00
Anton Popov
a25f2518e3
Merge pull request #33141 from 1over/feature_default_keyword
...
Add support of DEFAULT keyword for INSERT
2022-01-18 02:04:37 +03:00
Kruglov Pavel
a7df9cd53a
Merge branch 'master' into formats-with-suffixes
2022-01-14 21:03:49 +03:00
avogar
253035a5df
Fix
2022-01-14 19:17:06 +03:00
Kruglov Pavel
d2e9f37bee
Merge branch 'master' into format-by-extention
2022-01-14 18:36:23 +03:00
avogar
89a181bd19
Make better
2022-01-14 18:16:18 +03:00
Azat Khuzhin
a4c2f23b07
Fix "Chunk should have AggregatedChunkInfo in GroupingAggregatedTransform"
...
In case of optimize_aggregation_in_order there will be
ChunkInfoWithAllocatedBytes.
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-01-14 17:44:09 +03:00
Azat Khuzhin
06402386eb
Do not generate empty chunks in AggregatingInOrderTransform
...
This is just a micro optimization and it should not affect anything,
real fixes are in separate patches (previous and next).
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-01-14 17:44:09 +03:00
Azat Khuzhin
d9a64d1f86
Fix optimize_aggregation_in_order in case of empty result set
...
Note, that this is not complete fix, see the next two patches.
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-01-14 17:43:52 +03:00
Kruglov Pavel
5a908e8edd
Merge branch 'master' into formats-with-suffixes
2022-01-14 16:45:20 +03:00
Kruglov Pavel
d54a430d9c
Merge pull request #33566 from Avogar/fix-avro
...
Fix segfault in Avro
2022-01-14 16:01:56 +03:00
Kseniia Sumarokova
5da673c3a5
Merge pull request #31104 from bigo-sg/hive_table
...
Implement hive table engine
2022-01-14 09:39:17 +03:00
Kruglov Pavel
305d58a762
Merge pull request #33524 from Avogar/stacktrace-in-client
...
Don't print exception twice in client in case of exception in parallel parsing
2022-01-13 15:50:42 +03:00
taiyang-li
82a3024712
Merge remote-tracking branch 'ck/master' into bigo_hive_table
2022-01-13 18:11:23 +08:00
Nikolai Kochetov
872ee5dc09
Update src/Processors/Formats/Impl/AvroRowOutputFormat.h
...
Co-authored-by: Bharat Nallan <bharatnc@gmail.com>
2022-01-13 12:55:14 +03:00
tavplubix
bacdaa7b4d
Merge pull request #33564 from ClickHouse/fix_minor_bug_in_ttl
...
Fix minor bug in TTL and flaky test
2022-01-13 11:51:57 +03:00
avogar
c5ea4b1bc0
Fix segfault in Avro
2022-01-12 18:34:28 +03:00
avogar
8390e9ad60
Detect format by file name in file/hdfs/s3/url table functions
2022-01-12 18:29:31 +03:00
Alexander Tokmakov
72b67f37fc
fix log message and flaky test
2022-01-12 18:20:22 +03:00
lgbo-ustc
5c71d3687a
fixed some bugs
...
1. interagtion test for test_hive_query failed
2. nullptr reference in arrowSchemaToCHHeader
2022-01-12 17:01:05 +08:00
taiyang-li
66813a3aa9
merge master
2022-01-12 16:56:29 +08:00
avogar
9915ce7ded
Fix segfault in arrowSchemaToCHHeader
2022-01-11 20:30:35 +03:00
avogar
0ae0aa712b
Don't print exception twice in client in case of exception in parallel parsing
2022-01-11 18:37:07 +03:00
msaf1980
780a1b2abe
graphite: split tagged/plain rollup rules (for merges perfomance)
2022-01-10 16:34:16 +05:00
Alexey Milovidov
a75f1ccb21
Fix build
2022-01-09 12:26:36 +03:00
Azat Khuzhin
c05dac89ee
ReadFromRemote: Remove redundant ParallelReplicasReadingCoordinator
2022-01-07 20:04:57 +03:00
李扬
2df2442ad0
Merge branch 'master' into hive_table
2022-01-04 01:26:16 -06:00
taiyang-li
8730dda895
fix hivte text
2022-01-01 09:16:30 +08:00
taiyang-li
1e102bc1b2
merge master
2022-01-01 09:01:06 +08:00
alexey-milovidov
34b934a1e0
Merge pull request #33331 from ClickHouse/serxa/line-as-string-output-format
...
Add LineAsString output format
2021-12-31 14:38:36 +03:00
Sergei Trifonov
f1d398ae4b
Add LineAsString output format
2021-12-30 20:38:54 +03:00
alesapin
16c36d72b1
Merge pull request #33296 from ClickHouse/fix_clang_tidy_3
...
Fix clang tidy 3
2021-12-29 22:43:42 +03:00
avogar
97788b9c21
Allow to create new files on insert for File/S3/HDFS engines
2021-12-29 21:19:13 +03:00
Kruglov Pavel
489a30859f
Merge pull request #32455 from Avogar/schema-inference
...
Automatic schema inference for input formats
2021-12-29 21:03:48 +03:00
alesapin
67fc9d7129
Fix clang-tidy
2021-12-29 18:35:54 +03:00
Nikolai Kochetov
c715204e18
Merge pull request #32271 from Algunenano/kill_scalar_github
...
Be able to KILL scalar queries
2021-12-29 16:00:59 +03:00
Kruglov Pavel
dd7f61b337
Merge branch 'master' into schema-inference
2021-12-29 12:59:05 +03:00
avogar
364b4f5d36
Fix special build
2021-12-29 12:21:01 +03:00
Kruglov Pavel
cb0ed7fcb7
Fix typo
2021-12-29 12:21:01 +03:00
avogar
26abf7aa62
Remove code duplication, use simdjson and rapidjson instead of Poco
2021-12-29 12:21:01 +03:00
avogar
74f09d6476
Fix tests
2021-12-29 12:18:56 +03:00
avogar
aaf9f85c67
Add more tests and fixes
2021-12-29 12:18:56 +03:00
avogar
dd994aa761
Add some tests and some fixes
2021-12-29 12:18:56 +03:00
avogar
8112a71233
Implement schema inference for most input formats
2021-12-29 12:18:56 +03:00
Maksim Kita
b5698ea134
Updated test_executable_table_function tests
2021-12-28 22:55:31 +03:00
Maksim Kita
d1db3c9f42
Updated ShellCommandSource
2021-12-28 22:55:31 +03:00
Maksim Kita
3386378050
Updated test_executable_table_function tests
2021-12-28 22:55:31 +03:00
Maksim Kita
61b36f22b8
Fixed style check
2021-12-28 22:55:31 +03:00
Maksim Kita
fbe65658a1
Fixed tests
2021-12-28 22:55:31 +03:00
Maksim Kita
c2977dbf58
Updated test_executable_table_function integration tests
2021-12-28 22:55:30 +03:00
Maksim Kita
de49a25185
Fixed build
2021-12-28 22:55:30 +03:00
Maksim Kita
cda8bd7f44
Added test_executable_dictionary source tests
2021-12-28 22:55:30 +03:00
Maksim Kita
5590cfa3aa
Updated executable function integration tests
2021-12-28 22:55:30 +03:00
Maksim Kita
d3e507b9ef
Added executable function integrations tests
2021-12-28 22:55:30 +03:00
Maksim Kita
2ffd83ee83
Fix shell command source start
2021-12-28 22:55:30 +03:00
Maksim Kita
eb4e400c54
Executable refactoring
2021-12-28 22:55:30 +03:00
alexey-milovidov
934c1b3e83
Merge pull request #33238 from kssenii/some-killing
...
Allow to cancel some sources which failed to cancel
2021-12-28 22:00:44 +03:00
taiyang-li
aa97634d9b
merge master
2021-12-28 18:38:50 +08:00
kssenii
1f6ca619b7
Allow some killing
2021-12-27 22:42:56 +03:00
kssenii
1bd4936961
Add retry for postgres query
2021-12-27 11:03:04 +03:00
taiyang-li
9036b18c2f
merge master
2021-12-27 15:12:48 +08:00
alexey-milovidov
0a55fa3dc2
Revert "Grouping sets dev"
2021-12-25 20:30:31 +03:00
alexey-milovidov
c583ea7e6b
Merge pull request #32484 from Algunenano/libcxx13_take2
...
libc++ 13 compatibility
2021-12-25 10:14:12 +03:00
Andrii Buriachevskyi
e8cc6df7bb
Add support of DEFAULT keyword for INSERT
2021-12-24 13:10:19 +01:00
Alexey Milovidov
29d28c531f
Move code around to avoid dlsym on Musl
2021-12-24 12:25:27 +03:00
Anton Popov
1f67177d4d
fix optimize_read_in_order with preliminary merge and add some tests
2021-12-23 18:32:31 +03:00
Raúl Marín
cb22091b33
Merge remote-tracking branch 'blessed/master' into kill_scalar_github
2021-12-23 13:59:33 +01:00
Raúl Marín
77db850c0b
Merge remote-tracking branch 'blessed/master' into libcxx13_take2
2021-12-23 12:42:39 +01:00
Nikolai Kochetov
b84591d17a
Merge pull request #32862 from ClickHouse/fix-mv-result-with-many-chunks
...
Fix MV query with multiple chunk result.
2021-12-23 14:37:17 +03:00
Raúl Marín
88b8fd8b60
Merge remote-tracking branch 'blessed/master' into libcxx13_take2
2021-12-23 09:16:19 +01:00
Alexey Milovidov
f37ff32c37
Whitespaces
2021-12-23 01:33:47 +03:00
mreddy017
3e50217501
Remove the additional white space as per the pipeline build error.
2021-12-23 01:30:56 +03:00
mreddy017
10eb2dbdb7
Addressing review comments
2021-12-23 01:30:56 +03:00
Harry-Lee
846c46ac4b
Fix issue #80 : union index out of boundary
2021-12-23 01:30:56 +03:00
Raúl Marín
dbb13036bc
Hook progress callbacks with subqueries too
2021-12-22 23:13:45 +01:00
Raúl Marín
1d9916eb25
Merge remote-tracking branch 'blessed/master' into kill_scalar_github
2021-12-22 16:42:10 +01:00
tavplubix
d8fb293d57
Merge pull request #32900 from zzsmdfj/issue/#15182_MaterializeMySQL_support_bit_type
...
Add support for BIT data type in MaterializedMySQL
2021-12-22 15:58:29 +03:00
zzsmdfj
039afc5cfe
#15182_MaterializeMySQL_support_bit_type-improve bit2Uint64 function
2021-12-22 16:17:18 +08:00
Raúl Marín
eddccb0065
Merge remote-tracking branch 'blessed/master' into kill_scalar_github
2021-12-21 10:00:17 +01:00
taiyang-li
2597925724
merge master
2021-12-21 15:55:39 +08:00
Maksim Kita
dd0d3de050
Merge pull request #32970 from kitaisreal/loops-remove-postfix-increment
...
Loops remove postfix increment
2021-12-20 19:51:07 +03:00
Raúl Marín
b553e51969
Merge remote-tracking branch 'blessed/master' into libcxx13_take2
2021-12-20 17:47:57 +01:00
Maksim Kita
51477adf1b
Updated additional cases
2021-12-20 15:55:07 +03:00
kreuzerkrieg
f06c37d206
Stop reading incomplete stripes and skip rows.
2021-12-19 18:41:32 +02:00
alexey-milovidov
6b97af4c63
Merge pull request #26869 from taylor12805/grouping-sets-dev
...
Grouping sets dev
2021-12-17 20:50:15 +03:00
Anton Popov
99ebabd822
Merge remote-tracking branch 'upstream/master' into HEAD
2021-12-17 19:02:29 +03:00
alesapin
6bd7e425c6
Merge pull request #22535 from CurtizJ/sparse-serialization
...
Sparse serialization and ColumnSparse
2021-12-17 15:26:17 +03:00
mergify[bot]
785d16d733
Merge branch 'master' into kill_scalar_github
2021-12-17 10:21:54 +00:00
taiyang-li
d033fc4c24
merge master and fix conflict
2021-12-17 15:11:21 +08:00
zzsmdfj
e9eed1f927
to #15182_MaterializeMySQL_support_bit_type
2021-12-17 15:03:24 +08:00
Nikolai Kochetov
96384fe4f3
Update WindowTransform.cpp
2021-12-16 21:20:39 +03:00
Nikolai Kochetov
82f94a35f9
Update WindowTransform.cpp
2021-12-16 21:20:01 +03:00
Nikolai Kochetov
e28bcbfcea
Fix MV query with multiple chunk result.
2021-12-16 21:17:05 +03:00
Dmitrii Mokhnatkin
2147658432
Proper handler for apache arrow column duplication
2021-12-15 18:30:32 +03:00
Raúl Marín
3de002c7c9
Merge remote-tracking branch 'blessed/master' into libcxx13_take2
2021-12-15 12:57:57 +01:00
mergify[bot]
bff0424139
Merge branch 'master' into kill_scalar_github
2021-12-15 09:22:58 +00:00
Dmitry Novik
56a3f4a000
Cleanup code
2021-12-14 22:15:14 +03:00
alesapin
d7663b2179
Merge branch 'master' into fix_special_build_check
2021-12-14 19:08:28 +03:00
alesapin
884801e1bd
Fixing
2021-12-14 19:08:08 +03:00
Anton Popov
16312e7e4a
Merge remote-tracking branch 'upstream/master' into HEAD
2021-12-14 18:58:17 +03:00
Raúl Marín
44f3b1c9d2
Merge remote-tracking branch 'blessed/master' into libcxx13_take2
2021-12-14 13:05:01 +01:00
Anton Popov
bda0cc2f76
Merge pull request #32530 from Avogar/fix-async-inserts
...
Fix async inserts for some input formats
2021-12-14 14:07:05 +03:00
taiyang-li
ca3f7425a4
fix code
2021-12-14 17:37:31 +08:00
taiyang-li
8234d1176f
merge master
2021-12-14 10:39:21 +08:00
alexey-milovidov
fd9d40925a
Merge pull request #32640 from ClickHouse/fix-function-file
...
Fix surprisingly bad code in function `file`
2021-12-13 19:34:36 +03:00
Raúl Marín
61d959df8f
Fix arrow build
2021-12-13 16:49:22 +01:00
Raúl Marín
d9e4544239
Adapt to arrow 6
2021-12-13 16:49:21 +01:00
Kruglov Pavel
52328f6abc
Merge pull request #32490 from ClickHouse/quote-more-identifiers
...
Fix issue with ambiguous query formatting in distributed queries.
2021-12-13 16:41:01 +03:00
mergify[bot]
fa8f9671f4
Merge branch 'master' into fix-async-inserts
2021-12-13 11:13:20 +00:00
Alexey Milovidov
71926a3a97
Fix surprisingly bad code in function "file"
2021-12-13 07:57:54 +03:00
李扬
8675086104
Merge branch 'master' into hive_table
2021-12-12 09:01:46 -06:00
alexey-milovidov
71df622b1f
Merge pull request #31888 from ClickHouse/fix-31114
...
Remove LowCardinality from window function args.
2021-12-12 06:37:55 +03:00
Alexey Milovidov
58ffe89643
Merge branch 'master' into quote-more-identifiers
2021-12-12 03:33:20 +03:00
taiyang-li
5ef68fc479
fix building
2021-12-11 15:50:59 +08:00
Kruglov Pavel
764e205d36
Fix resetParser in MsgPack format
2021-12-10 21:37:08 +03:00
avogar
1be84d80d4
Fix async inserts for some input formats
2021-12-10 20:54:08 +03:00
Alexey Milovidov
5c90ed2ed9
Unambiguous formatting of distributed queries
2021-12-10 00:55:14 +03:00
Raúl Marín
3a7e524f9e
Style
2021-12-09 18:24:28 +01:00
Raúl Marín
3fc4167c54
Rework how progress is reported in views
2021-12-09 17:08:29 +01:00
kssenii
f5a77fca39
Done
2021-12-09 14:44:25 +00:00
Nikolai Kochetov
f4f06ca368
Fix assert.
2021-12-09 16:59:21 +03:00
Nikita Mikhaylov
dbf5091016
Parallel reading from replicas ( #29279 )
2021-12-09 13:39:28 +03:00
Raúl Marín
0158133b62
SourceWithProgress: Don't use process_list_elem without checking it
2021-12-09 10:02:32 +01:00
Raúl Marín
edf4095139
Add QueryStatus to BuildQueryPipelineSettings and use it
2021-12-09 10:02:32 +01:00
Raúl Marín
9fe648ef8e
Force PipeLineExecutor creators to pass a QueryStatus
...
This is to avoid future issues
2021-12-09 10:02:32 +01:00
Anton Popov
61a5f8a61a
add comments
2021-12-08 18:56:30 +03:00
Anton Popov
d8367334a3
Merge remote-tracking branch 'upstream/master' into HEAD
2021-12-08 18:26:19 +03:00
Mikhail f. Shiryaev
03927f5fc6
Revert "graphite: split tagged/plain rollup rules (for merges perfomance)"
...
This reverts commit 303552f515
.
2021-12-08 15:49:21 +01:00
Nikolai Kochetov
dd906b2a3e
Remove comments.
2021-12-08 14:25:00 +03:00
Nikolai Kochetov
07b2d69ea2
Another try.
2021-12-08 14:22:43 +03:00
Kruglov Pavel
3498e13551
Merge pull request #32270 from ClickHouse/fix-strict-resize
...
Fix possible Pipeline stuck in case of StrictResize processor.
2021-12-08 11:03:53 +03:00
Nikolai Kochetov
2c6cb902c4
Revert "Better fix."
...
This reverts commit f07fc08a49
.
2021-12-07 20:54:49 +03:00
Dmitry Novik
4afaa8697c
remove ya.make
2021-12-07 16:47:42 +03:00
Mikhail f. Shiryaev
5fc20b3e6a
Merge pull request #25122 from msaf1980/rollup_rules_type
...
GraphiteMergeTree Rollup rules type
2021-12-07 10:18:08 +01:00
Nikolai Kochetov
f07fc08a49
Better fix.
2021-12-06 22:08:32 +03:00
Nikolai Kochetov
ce568bff90
Merge branch 'master' into fix-31114
2021-12-06 19:47:13 +03:00
msaf1980
303552f515
graphite: split tagged/plain rollup rules (for merges perfomance)
2021-12-06 16:49:08 +01:00
Nikolai Kochetov
1d75d8fd45
Fix assert.
2021-12-06 15:52:08 +03:00
Nikolai Kochetov
32b4875273
Fix possible Pipeline stuck in case of StrictResize processor.
2021-12-06 14:53:39 +03:00
Kruglov Pavel
cc71c537bc
Merge pull request #32204 from Avogar/skip-quoted-values
...
Improve skiping unknown fields with Quoted escaping rule in Template/CustomSeparated formats
2021-12-06 12:28:14 +03:00
Nikolai Kochetov
91c4c89c66
Merge pull request #31636 from Algunenano/pull_timeout
...
Check max_execution_time in the pipeline and pulling executors
2021-12-06 11:33:39 +03:00