Alexey Milovidov
04b7a34870
Maybe fix error
2022-06-19 05:21:59 +02:00
Alexey Milovidov
73709b0488
Revert "Revert "Add a setting to use more memory for zstd decompression""
2022-06-18 15:55:35 +03:00
alesapin
16e8b85fbf
Revert "Add a setting to use more memory for zstd decompression"
2022-06-18 14:08:14 +02:00
Alexey Milovidov
e20259e9ca
Merge pull request #37015 from wuxiaobai24/zstd_window_log_max
...
Add a setting to use more memory for zstd decompression
2022-06-18 04:19:27 +03:00
Roman Vasin
ed3fe84b63
Fix runKinit() is called only for USE_KRB5
2022-06-16 14:45:27 +03:00
Roman Vasin
6e28275569
Add warnings about using krb5 parameters
2022-06-16 14:21:04 +03:00
Roman Vasin
d93fd3bd2d
Add complilation support for case when krb5 is not used
2022-06-16 09:30:40 +00:00
Roman Vasin
344fbe8de4
Fix code style
2022-06-15 20:26:42 +03:00
Roman Vasin
dd5b0ee065
Add kerberosInit() function to call KeberosInit
2022-06-15 17:02:53 +03:00
Roman Vasin
9bf6b9d491
Add kinit presence handling in StorageKafka; Cleanup code in HDFSCommon
2022-06-15 11:37:02 +03:00
Roman Vasin
d1d6d87432
Cleanup code in KerberosInit
2022-06-09 11:51:15 +03:00
Roman Vasin
3cfea6e76f
Cleanup code in KerberosInit, HDFSCommon and StorageKafka; update English and Russian documentation.
2022-06-08 17:57:45 +03:00
Roman Vasin
2b76d0c6a9
Add new integration test for kerberized Kafka; remove old kinit code from HDFSCommon
2022-06-08 12:26:35 +03:00
taiyang-li
57b6cf6c09
fix build error
2022-06-08 09:58:09 +08:00
Roman Vasin
a156a77890
Add KerberosInit into StorageKafka
2022-06-07 14:59:46 +03:00
Roman Vasin
cb53aa15ec
Fix HDFSCommon and test_storage_kerberized_hdfs to make running integration tests
2022-06-07 12:06:22 +03:00
taiyang-li
73a484256e
Merge branch 'master' into async_hdfs_read_buffer
2022-06-07 12:16:46 +08:00
taiyang-li
c65c56fd48
fix typo
2022-06-07 09:58:29 +08:00
taiyang-li
b36d9f8143
refactor readinto
2022-06-06 12:58:22 +08:00
Roman Vasin
8b5bf02927
Add support of cache commands in KerberosInit
2022-06-03 18:07:18 +03:00
Nikita Mikhaylov
fe477355df
Added support for schema inference for hdfsCluster
[2] ( #35812 )
2022-06-03 15:54:29 +02:00
taiyang-li
f202c35311
Merge branch 'master' into async_hdfs_read_buffer
2022-06-03 17:52:09 +08:00
Roman Vasin
82ce2d76c3
Add KerberosInit class; add kerberos_init console example; modify HDFSCommon.cpp
2022-06-03 12:06:31 +03:00
Nikolai Kochetov
8991f39412
Merge branch 'master' into refactor-read-metrics-and-callbacks
2022-06-02 17:00:08 +00:00
Azat Khuzhin
545a56ce45
Fix sinks with onException() handler
...
It is possible to call onException() even after onFinish(), in case of
onFinish() throws, and in this case onException() should be no-op for
such sinks.
Also there can be caveats with PartitionedSync.
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-06-01 21:50:30 +03:00
Azat Khuzhin
02af58f41d
Fix possible "Cannot write to finalized buffer"
...
It is still possible to get this error since onException does not
finalize format correctly.
Here is an example of such error, that was found by CI [1]:
<details>
[ 2686 ] {fa01bf02-73f6-4f7f-b14f-e725de6d7f9b} <Fatal> : Logical error: 'Cannot write to finalized buffer'.
[ 34577 ] {} <Fatal> BaseDaemon: ########################################
[ 34577 ] {} <Fatal> BaseDaemon: (version 22.6.1.1, build id: AB8040A6769E01A0) (from thread 2686) (query_id: fa01bf02-73f6-4f7f-b14f-e725de6d7f9b) (query: insert into test_02302 select number from numbers(10) settings s3_truncate_on_insert=1;) Received signal Aborted (6)
[ 34577 ] {} <Fatal> BaseDaemon:
[ 34577 ] {} <Fatal> BaseDaemon: Stack trace: 0x7fcbaa5a703b 0x7fcbaa586859 0xfad9bab 0xfad9e05 0xfaf6a3b 0x24a48c7f 0x258fb9b9 0x258f2004 0x258b88f4 0x258b863b 0x2581773d 0x258177ce 0x24bb5e98 0xfad01d6 0xfad0105 0x2419b11d 0xfad01d6 0xfad0105 0x2215afbb 0x2215aa48 0xfad01d6 0xfad0105 0xfcc265d 0x225cc546 0x249a1c40 0x249bc1b6 0x2685902c 0x26859505 0x269d7767 0x269d504c 0x7fcbaa75e609 0x7fcbaa683163
[ 34577 ] {} <Fatal> BaseDaemon: 3. raise @ 0x7fcbaa5a703b in ?
[ 34577 ] {} <Fatal> BaseDaemon: 4. abort @ 0x7fcbaa586859 in ?
[ 34577 ] {} <Fatal> BaseDaemon: 5. ./build_docker/../src/Common/Exception.cpp:47: DB::abortOnFailedAssertion(std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> > const&) @ 0xfad9bab in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 6. ./build_docker/../src/Common/Exception.cpp:70: DB::Exception::Exception(std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> > const&, int, bool) @ 0xfad9e05 in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 7. ./build_docker/../src/IO/WriteBuffer.h:0: DB::WriteBuffer::write(char const*, unsigned long) @ 0xfaf6a3b in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 8. ./build_docker/../src/Processors/Formats/Impl/ArrowBufferedStreams.cpp:47: DB::ArrowBufferedOutputStream::Write(void const*, long) @ 0x24a48c7f in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 9. long parquet::ThriftSerializer::Serialize<parquet::format::FileMetaData>(parquet::format::FileMetaData const*, arrow::io::OutputStream*, std::__1::shared_ptr<parquet::Encryptor> const&) @ 0x258fb9b9 in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 10. parquet::FileMetaData::FileMetaDataImpl::WriteTo(arrow::io::OutputStream*, std::__1::shared_ptr<parquet::Encryptor> const&) const @ 0x258f2004 in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 11. parquet::WriteFileMetaData(parquet::FileMetaData const&, arrow::io::OutputStream*) @ 0x258b88f4 in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 12. parquet::ParquetFileWriter::~ParquetFileWriter() @ 0x258b863b in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 13. parquet::arrow::FileWriterImpl::~FileWriterImpl() @ 0x2581773d in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 14. parquet::arrow::FileWriterImpl::~FileWriterImpl() @ 0x258177ce in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 15. ./build_docker/../src/Processors/Formats/Impl/ParquetBlockOutputFormat.h:27: DB::ParquetBlockOutputFormat::~ParquetBlockOutputFormat() @ 0x24bb5e98 in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 16. ./build_docker/../contrib/libcxx/include/__memory/shared_ptr.h:173: std::__1::__shared_count::__release_shared() @ 0xfad01d6 in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 17. ./build_docker/../contrib/libcxx/include/__memory/shared_ptr.h:216: std::__1::__shared_weak_count::__release_shared() @ 0xfad0105 in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 18.1. inlined from ./build_docker/../contrib/libcxx/include/__memory/unique_ptr.h:312: std::__1::unique_ptr<DB::WriteBuffer, std::__1::default_delete<DB::WriteBuffer> >::reset(DB::WriteBuffer*)
[ 34577 ] {} <Fatal> BaseDaemon: 18.2. inlined from ../contrib/libcxx/include/__memory/unique_ptr.h:269: ~unique_ptr
[ 34577 ] {} <Fatal> BaseDaemon: 18. ../src/Storages/StorageS3.cpp:566: DB::StorageS3Sink::~StorageS3Sink() @ 0x2419b11d in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 19. ./build_docker/../contrib/libcxx/include/__memory/shared_ptr.h:173: std::__1::__shared_count::__release_shared() @ 0xfad01d6 in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 20. ./build_docker/../contrib/libcxx/include/__memory/shared_ptr.h:216: std::__1::__shared_weak_count::__release_shared() @ 0xfad0105 in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 21. ./build_docker/../contrib/abseil-cpp/absl/container/internal/raw_hash_set.h:1662: absl::lts_20211102::container_internal::raw_hash_set<absl::lts_20211102::container_internal::FlatHashMapPolicy<StringRef, std::__1::shared_ptr<DB::SinkToStorage> >, absl::lts_20211102::hash_internal::Hash<StringRef>, std::__1::equal_to<StringRef>, std::__1::allocator<std::__1::pair<StringRef const, std::__1::shared_ptr<DB::SinkToStorage> > > >::destroy_slots() @ 0x2215afbb in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 22.1. inlined from ./build_docker/../contrib/libcxx/include/string:1445: std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> >::__is_long() const
[ 34577 ] {} <Fatal> BaseDaemon: 22.2. inlined from ../contrib/libcxx/include/string:2231: ~basic_string
[ 34577 ] {} <Fatal> BaseDaemon: 22. ../src/Storages/PartitionedSink.h:14: DB::PartitionedSink::~PartitionedSink() @ 0x2215aa48 in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 23. ./build_docker/../contrib/libcxx/include/__memory/shared_ptr.h:173: std::__1::__shared_count::__release_shared() @ 0xfad01d6 in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 24. ./build_docker/../contrib/libcxx/include/__memory/shared_ptr.h:216: std::__1::__shared_weak_count::__release_shared() @ 0xfad0105 in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 25. ./build_docker/../contrib/libcxx/include/vector:802: std::__1::vector<std::__1::shared_ptr<DB::IProcessor>, std::__1::allocator<std::__1::shared_ptr<DB::IProcessor> > >::__base_destruct_at_end(std::__1::shared_ptr<DB::IProcessor>*) @ 0xfcc265d in /usr/bin/clickhouse
[ 34577 ] {} <Fatal> BaseDaemon: 26.1. inlined from ./build_docker/../contrib/libcxx/include/vector:402: ~vector
[ 34577 ] {} <Fatal> BaseDaemon: 26.2. inlined from ../src/QueryPipeline/QueryPipeline.cpp:29: ~QueryPipeline
[ 34577 ] {} <Fatal> BaseDaemon: 26. ../src/QueryPipeline/QueryPipeline.cpp:535: DB::QueryPipeline::reset() @ 0x225cc546 in /usr/bin/clickhouse
[ 614 ] {} <Fatal> Application: Child process was terminated by signal 6.
</details>
[1]: https://s3.amazonaws.com/clickhouse-test-reports/37542/8a224239c1d922158b4dc9f5d6609dca836dfd06/stress_test__undefined__actions_.html
Follow-up for: #36979
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-06-01 21:50:30 +03:00
Nikolai Kochetov
86fbb74703
Merge branch 'master' into refactor-read-metrics-and-callbacks
2022-05-31 18:07:47 +00:00
Kruglov Pavel
0615866aea
Merge pull request #37450 from Avogar/check-format-on-storage-creation
...
Check format name on storage creation
2022-05-30 14:23:20 +02:00
Nikolai Kochetov
5b4658aa5e
Merge branch 'master' into refactor-read-metrics-and-callbacks
2022-05-30 09:47:35 +00:00
taiyang-li
dbb8a09825
merge master and solve conflict
2022-05-30 10:47:04 +08:00
taiyang-li
51a893c8be
add some metrics
2022-05-30 10:05:20 +08:00
taiyang-li
73d2c889c6
fix log level
2022-05-27 19:23:58 +08:00
taiyang-li
ea450b86cb
add some prefetch metric codes
2022-05-27 18:06:40 +08:00
taiyang-li
561c87222d
add prefetch for hive text
2022-05-26 11:04:35 +08:00
Nikolai Kochetov
1b85f2c1d6
Merge branch 'master' into refactor-read-metrics-and-callbacks
2022-05-25 16:27:40 +02:00
msaf1980
fda6ddeffa
cleanup StorageHDFS (unused variables)
2022-05-25 19:23:05 +05:00
taiyang-li
a7a816dcb6
fix build error
2022-05-25 19:55:11 +08:00
taiyang-li
1d9f65a7d4
Merge branch 'master' into async_hdfs_read_buffer
2022-05-25 17:10:22 +08:00
avogar
f782fa31c6
Merge branch 'master' of github.com:ClickHouse/ClickHouse into check-format-on-storage-creation
2022-05-25 08:42:54 +00:00
Nikolai Kochetov
3d84aae0ab
Better.
2022-05-24 20:06:08 +00:00
avogar
37b66c8a9e
Check format name on storage creation
2022-05-23 12:48:48 +00:00
taiyang-li
29e2157469
change as request
2022-05-23 18:42:54 +08:00
Kruglov Pavel
f539fb835d
Merge branch 'master' into formats-with-names
2022-05-23 12:14:20 +02:00
Nikolai Kochetov
56feef01e7
Move some resources
2022-05-20 19:49:31 +00:00
taiyang-li
14ab7eb5a3
merge master and solve conflict
2022-05-17 16:28:08 +08:00
avogar
2d4b4b9008
Fix inserting defaults for missing values in columnar formats
2022-05-16 14:19:44 +00:00
avogar
68bb07d166
Better naming
2022-05-13 18:39:19 +00:00
avogar
b17fec659a
Improve performance and memory usage for select of subset of columns for some formats
2022-05-13 13:51:28 +00:00
wuxiaobai24
f3e573bfde
Merge remote-tracking branch 'upstream/master' into zstd_window_log_max
2022-05-13 20:42:54 +08:00
wuxiaobai24
ab5636a46a
fix
2022-05-13 17:52:39 +08:00
wuxiaobai24
58755cb156
add zstd_window_log_max setting
2022-05-08 14:02:40 +08:00
avogar
02bd5f6542
Finalize write buffers in case of exception
2022-05-06 17:30:18 +00:00
Robert Schulze
e583099158
Fix build, pt. V
2022-05-04 15:50:52 +02:00
mergify[bot]
64084b5e32
Merge branch 'master' into shared_ptr_helper3
2022-05-03 20:46:16 +00:00
Dmitry Novik
5ba7a55c18
Merge pull request #36650 from bigo-sg/hive_text_parallel_parsing
...
Parallel parsing of hive text format
2022-05-03 15:56:28 +02:00
Robert Schulze
777b5bc15b
Don't let storages inherit from boost::noncopyable
...
... IStorage has deleted copy ctor / assignment already
2022-05-03 09:07:08 +02:00
Robert Schulze
330212e0f4
Remove inherited create() method + disallow copying
...
The original motivation for this commit was that shared_ptr_helper used
std::shared_ptr<>() which does two heap allocations instead of
make_shared<>() which does a single allocation. Turned out that
1. the affected code (--> Storages/) is not on a hot path (rendering the
performance argument moot ...)
2. yet copying Storage objects is potentially dangerous and was
previously allowed.
Hence, this change
- removes shared_ptr_helper and as a result all inherited create() methods,
- instead, Storage objects are now created using make_shared<>() by the
caller (for that to work, many constructors had to be made public), and
- all Storage classes were marked as noncopyable using boost::noncopyable.
In sum, we are (likely) not making things faster but the code becomes
cleaner and harder to misuse.
2022-05-02 08:46:52 +02:00
Amos Bird
4a5e4274f0
base should not depend on Common
2022-04-29 10:26:35 +08:00
kssenii
9d364cdce2
Refactor
2022-04-26 15:33:53 +02:00
taiyang-li
b7cc344d62
remove useless codes
2022-04-26 14:42:43 +08:00
taiyang-li
99dee35b6e
parallel parsing of hive text format
2022-04-26 14:33:10 +08:00
Kruglov Pavel
34c342fdd3
Merge pull request #36205 from Avogar/improve-globs
...
Some refactoring around schema inference with globs
2022-04-25 13:14:46 +02:00
Amos Bird
e73d7fea31
Rename narrowBlockInputStream to narrowPipe
2022-04-24 18:33:48 +08:00
taiyang-li
f040c8ae65
merge master and solve conflict
2022-04-24 11:42:36 +08:00
taiyang-li
957f3f13f7
fix build error
2022-04-21 20:19:29 +08:00
taiyang-li
ad9e683356
fix code syle
2022-04-21 18:12:15 +08:00
taiyang-li
94d0358b15
fix code style
2022-04-21 17:40:55 +08:00
taiyang-li
169dae2a35
ready for review
2022-04-21 17:37:12 +08:00
taiyang-li
fb34a4ec01
ready for review
2022-04-21 17:27:41 +08:00
taiyang-li
1ea96a8732
merge master and solve conflict
2022-04-21 17:16:07 +08:00
Kruglov Pavel
813e228fcc
Merge branch 'master' into improve-globs
2022-04-20 16:31:47 +02:00
taiyang-li
3b722eea7a
profileing
2022-04-20 20:59:36 +08:00
taiyang-li
d533b569ad
debugging
2022-04-20 19:58:31 +08:00
taiyang-li
56fe6fa608
finish dev
2022-04-20 17:49:53 +08:00
taiyang-li
fb6a56d4b0
finish debug
2022-04-20 16:24:18 +08:00
avogar
1f252cedfe
Make better
2022-04-19 19:16:47 +00:00
Anton Popov
bee4ca9b62
add more tests for error diagnostics in files
2022-04-19 15:56:34 +00:00
taiyang-li
e2c51d581c
Merge branch 'fix_hdfs_seek' into async_hdfs_read_buffer
2022-04-19 15:50:01 +08:00
zhanglistar
5670dcf750
Fix crash in debug mode of ReadBufferFromHDFS.
2022-04-18 17:52:59 +08:00
taiyang-li
27fa2a019a
commit again
2022-04-18 15:56:49 +08:00
taiyang-li
cd83fd5f8a
tobe debug
2022-04-16 18:41:18 +08:00
taiyang-li
f20e80c318
wip: deving
2022-04-16 16:19:48 +08:00
zhanglistar
fe93fd83f5
Merge branch 'ClickHouse:master' into fixhdfsreadbuffer
2022-04-15 16:27:20 +08:00
zhanglistar
e22a175afb
Fix ReadBufferFromHDFS crash in debug mode
2022-04-15 14:04:30 +08:00
avogar
1c065f8c7a
Some refactoring around schema inference with globs
2022-04-13 17:02:48 +00:00
taiyang-li
b4cd8561a5
wip
2022-04-13 16:06:19 +08:00
taiyang-li
9d31c44d39
fix bug of read buffer from hdfs
2022-04-12 12:01:51 +08:00
lgbo-ustc
759e4a9a05
Merge remote-tracking branch 'ck/master' into s3_cluster_join
2022-04-08 09:25:30 +08:00
shuchaome
a565a93740
reduce mutex scope when setenv LIBHDFS3_CONF
2022-04-02 06:52:39 +08:00
Alexey Milovidov
5c6b84c398
Merge pull request #35586 from kssenii/fix-virtual-columns-s3
...
Fix s3 engine getting virtual columns
2022-04-01 04:43:32 +03:00
Nikita Mikhaylov
5f27f690f4
Revert "Added support for schema inference for hdfsCluster
( #35602 )"
...
This reverts commit f6bfdcc0c9
.
2022-03-31 14:19:56 +02:00
Nikita Mikhaylov
f6bfdcc0c9
Added support for schema inference for hdfsCluster
( #35602 )
2022-03-31 12:47:36 +02:00
lgbo-ustc
08e1fdc6b8
Fixed :a elimate AMBIGUOUS_COLUMN_NAME exception
...
elimate AMBIGUOUS_COLUMN_NAME exception in hdfs/s3 cluster when run a query with join
2022-03-30 17:06:38 +08:00
kssenii
4374415bd7
Fix clang-tidy, style check
2022-03-29 14:20:21 +02:00
kssenii
a2013ebe2b
Better
2022-03-28 22:40:27 +02:00
taiyang-li
fbadd8fc0e
only deduce absolute path of hdfs config when it's relative path do not exists
2022-03-25 22:28:47 +08:00
taiyang-li
c0eb9f73a9
remote std::cout
2022-03-24 21:17:43 +08:00
taiyang-li
6eccd09e2b
deduce hdfs config path
2022-03-24 18:04:29 +08:00
Anton Popov
36ec379aeb
Merge remote-tracking branch 'upstream/master' into HEAD
2022-03-14 16:28:35 +00:00
kssenii
5260822964
Merge master
2022-03-08 18:21:28 +01:00
Anton Popov
df3b07fe7c
Merge remote-tracking branch 'upstream/master' into HEAD
2022-03-03 22:25:28 +00:00
kssenii
a594f388a4
Merge master
2022-03-01 19:43:45 +01:00
Anton Popov
c1fdcf7a64
Merge remote-tracking branch 'upstream/master' into HEAD
2022-03-01 20:21:39 +03:00
kssenii
092ec45b47
Merge master
2022-03-01 12:06:56 +01:00
kssenii
9b64a8fe39
Fix odbc bridge
2022-02-28 14:29:05 +01:00
Hongbin
c9bc442114
fix comments
2022-02-28 16:44:35 +08:00
Hongbin
99bd56e2de
Fix some code comments style
2022-02-28 08:15:37 +08:00
Anton Popov
fcdebea925
Merge remote-tracking branch 'upstream/master' into HEAD
2022-02-25 13:41:30 +03:00
kssenii
c637385dd0
Merge master
2022-02-22 13:17:51 +01:00
kssenii
47f94120da
Merge master
2022-02-16 14:43:28 +01:00
kssenii
3bd3e51aa0
Fix tests
2022-02-16 14:08:41 +01:00
Kruglov Pavel
dd863ca2a0
Merge branch 'master' into fix-url-globs
2022-02-16 12:45:31 +03:00
alesapin
bc2d0ee7c7
Merge pull request #34215 from ClickHouse/revert-34211-revert-34153-add_func_tests_over_s3
...
Add func tests run with s3 and fix several bugs
2022-02-15 19:07:11 +03:00
alesapin
89373155fc
Merge branch 'master' into revert-34211-revert-34153-add_func_tests_over_s3
2022-02-13 21:07:54 +03:00
mergify[bot]
8a47ed2b3f
Merge branch 'master' into static-files-disk-uploader-create-symlinks
2022-02-13 06:12:39 +00:00
Dmitry Novik
50131fd732
Fix cancelation for S3 and HDFS
2022-02-11 18:14:55 +00:00
alesapin
088f8cacf3
Small improvements
2022-02-10 19:50:21 +03:00
Anton Popov
298838f891
avoid unnecessary copying of Settings
2022-02-10 12:13:51 +03:00
Anton Popov
18940b8637
Merge remote-tracking branch 'upstream/master' into HEAD
2022-02-09 23:38:38 +03:00
avogar
84087c1554
Improve schema inference with globs
2022-02-09 19:16:15 +03:00
Alexey Milovidov
742620209a
static-files-disk-uploader: add mode to create symlinks
2022-02-09 03:50:50 +03:00
alexey-milovidov
43ee8ddb5b
Merge pull request #34376 from CurtizJ/fix-reading-empty-arrays
...
Fix consecutive backward seeks in seekable read buffers
2022-02-08 02:27:57 +03:00
Anton Popov
1b16db72c3
fix consecutive backward seeks in seekable read buffers
2022-02-07 17:20:26 +03:00
feng lv
6325d4d9b0
continue of #34317
...
fix
fix
2022-02-06 08:59:17 +00:00
Anton Popov
836a348a9c
Merge remote-tracking branch 'upstream/master' into HEAD
2022-02-01 15:23:07 +03:00
alesapin
5230c59f05
Smaller scope
2022-01-31 14:03:06 +03:00
alesapin
1adc7538fe
Fix benign race condition for storage HDFS, S3, URL
2022-01-31 13:50:28 +03:00
Anton Popov
78b9f15abb
Merge remote-tracking branch 'upstream/master' into HEAD
2022-01-30 03:24:37 +03:00
Anton Popov
e8ce091e68
Merge remote-tracking branch 'upstream/master' into HEAD
2022-01-21 20:11:18 +03:00
Kruglov Pavel
7bfb1231b9
Merge branch 'master' into formats-with-suffixes
2022-01-20 14:47:17 +03:00
alexey-milovidov
28a9d569cc
Merge pull request #33690 from azat/remove-unbundled-support
...
Remove unbundled support
2022-01-20 12:29:07 +03:00
liuneng1994
59f3fe515b
change null check position
...
Signed-off-by: neng.liu <neng.liu@kyligence.io>
2022-01-20 06:26:34 +00:00
Azat Khuzhin
aef6668cb4
Remove unbundled hdfs3 support
2022-01-20 08:47:16 +03:00
liuneng1994
4d3bb1584d
fix Dereference of null pointer error
...
Signed-off-by: neng.liu <neng.liu@kyligence.io>
2022-01-19 07:41:46 +00:00
avogar
4efadfad3c
Fix tests
2022-01-18 22:26:13 +03:00
liuneng1994
5e1e512bf2
fix check style
...
Signed-off-by: neng.liu <neng.liu@kyligence.io>
2022-01-18 14:42:28 +00:00
liuneng1994
15ea37c4ae
ignore file not found exception
...
Signed-off-by: neng.liu <neng.liu@kyligence.io>
2022-01-18 12:49:40 +00:00
mergify[bot]
df869c5b9c
Merge branch 'master' into fix_ignore_hdfs_error
2022-01-18 07:05:24 +00:00
liuneng1994
c6e205b2ad
add error check
2022-01-18 06:21:40 +00:00
avogar
af54ca4866
Fix partition write
2022-01-14 21:21:36 +03:00
Kruglov Pavel
a7df9cd53a
Merge branch 'master' into formats-with-suffixes
2022-01-14 21:03:49 +03:00
Kruglov Pavel
d2e9f37bee
Merge branch 'master' into format-by-extention
2022-01-14 18:36:23 +03:00
avogar
89a181bd19
Make better
2022-01-14 18:16:18 +03:00
Kruglov Pavel
5a908e8edd
Merge branch 'master' into formats-with-suffixes
2022-01-14 16:45:20 +03:00
avogar
8884c4a3fe
Fix tests and build
2022-01-14 16:27:57 +03:00
avogar
2d7b1bfa5e
Detect format in S3/HDFS/URL table engines
2022-01-13 16:14:18 +03:00
taiyang-li
1e102bc1b2
merge master
2022-01-01 09:01:06 +08:00
avogar
131d49e6be
Remove comments
2021-12-29 21:26:23 +03:00
avogar
97788b9c21
Allow to create new files on insert for File/S3/HDFS engines
2021-12-29 21:19:13 +03:00
avogar
8436638a89
Fix style
2021-12-29 12:21:01 +03:00
avogar
26abf7aa62
Remove code duplication, use simdjson and rapidjson instead of Poco
2021-12-29 12:21:01 +03:00
avogar
8112a71233
Implement schema inference for most input formats
2021-12-29 12:18:56 +03:00
kssenii
1f6ca619b7
Allow some killing
2021-12-27 22:42:56 +03:00
taiyang-li
9036b18c2f
merge master
2021-12-27 15:12:48 +08:00
Alexey Milovidov
f03cb4c762
Cleanup trash from Kafka and HDFS
2021-12-25 06:10:59 +03:00
Nikita Mikhaylov
29fdc422d0
Merge pull request #32974 from nikitamikhaylov/hdfsCluster
...
Merging #32400
2021-12-24 16:02:54 +03:00
Nikita Mikhaylov
d9c13900f5
Fix build + style
2021-12-20 14:31:56 +00:00
Zhichang Yu
869cdbeb44
added table function hdfsCluster
2021-12-20 13:14:44 +00:00
avogar
e31c18e9f8
Fix hdfs url check
2021-12-20 16:09:48 +03:00
Anton Popov
99ebabd822
Merge remote-tracking branch 'upstream/master' into HEAD
2021-12-17 19:02:29 +03:00
李扬
8675086104
Merge branch 'master' into hive_table
2021-12-12 09:01:46 -06:00
Kseniia Sumarokova
7e261245d3
Merge pull request #32256 from taiyang-li/default_for_hdfs
...
Support default expression for storage hdfs and optimize fetching when source is column oriented
2021-12-06 23:32:01 +03:00
leosunli
6f611eec91
Fix bug
...
Signed-off-by: leosunli <lisheng.sun08@gmail.com>
2021-12-06 14:32:32 +08:00
Kseniia Sumarokova
4017bb8f78
Update WriteBufferFromHDFS.h
2021-12-06 14:32:32 +08:00
leosunli
e81a1dbe6a
Make HDFS replication configurable in WriteBufferFromHDFSImpl#WriteBufferFromHDFSImpl
...
Signed-off-by: leosunli <lisheng.sun08@gmail.com>
2021-12-06 14:32:31 +08:00
leosunli
5be58a8839
Make HDFS replication configurable in WriteBufferFromHDFSImpl#WriteBufferFromHDFSImpl
...
Signed-off-by: leosunli <lisheng.sun08@gmail.com>
2021-12-06 14:32:31 +08:00
sunlisheng
ef77f45f59
Make HDFS replication configurable in WriteBufferFromHDFSImpl#WriteBufferFromHDFSImpl
...
Signed-off-by: sunlisheng <lisheng.sls@alibaba-inc.com>
2021-12-06 14:32:31 +08:00
taiyang-li
9d1e63be0f
1. support default expression for storage hdfs; 2. optimize storage hdfs when format is column oriented
2021-12-06 11:54:45 +08:00
Anton Popov
6f4d9a53b2
Merge remote-tracking branch 'origin/sparse-serialization' into HEAD
2021-12-01 15:54:33 +03:00
taiyang-li
72f60cceb9
Merge branch 'master' into hive_table
2021-11-25 17:33:26 +08:00
Kseniia Sumarokova
93cf66df12
Merge pull request #30936 from kssenii/seekable-read-buffers
...
Reduce memory usage for some formats when reading with s3/url/hdfs
2021-11-25 11:19:24 +03:00
kssenii
46d2fdc5e6
Update
2021-11-24 21:53:53 +03:00
taiyang-li
89dcef69d5
merge master
2021-11-24 14:38:04 +08:00
Anton Popov
ccd78e3838
Merge remote-tracking branch 'upstream/master' into HEAD
2021-11-22 17:19:35 +03:00
Kruglov Pavel
d9c1a0c8ec
Merge branch 'master' into fix-write-buffers
2021-11-20 17:48:24 +03:00
kssenii
ff969b4605
Merge branch 'master' of github.com:ClickHouse/ClickHouse into seekable-read-buffers
2021-11-20 15:03:13 +03:00
taiyang-li
e8644807fe
merge master and solve conflict
2021-11-19 15:01:58 +08:00
Kruglov Pavel
f27fcf8372
Merge pull request #31004 from Avogar/output-formats
...
Small refactoring in formats
2021-11-17 20:44:46 +03:00
kssenii
f18dcd2287
Merge branch 'master' of github.com:ClickHouse/ClickHouse into seekable-read-buffers
2021-11-13 14:38:57 +03:00
avogar
e5661b9b62
Minor change
2021-11-11 21:09:21 +03:00
avogar
c521a9131a
Small refactoring of WriteBiffer-s
2021-11-11 02:11:18 +03:00
taiyang-li
79cf9fb7ce
fix code style
2021-11-10 11:41:20 +08:00
taiyang-li
26988ab757
Merge branch 'master' into hive_table
2021-11-10 11:01:03 +08:00
Anton Popov
a20922b2d3
Merge remote-tracking branch 'origin/sparse-serialization' into HEAD
2021-11-09 15:36:25 +03:00
taiyang-li
36ca0b296b
implement hive table engine
2021-11-05 19:55:30 +08:00
avogar
2dd4393ca1
Small refactoring in formats
2021-11-03 20:07:05 +03:00
kssenii
ec11179f91
Merge branch 'master' of github.com:ClickHouse/ClickHouse into seekable-read-buffers
2021-11-03 14:33:31 +03:00
kssenii
45ea820297
Reduce memory usage for some formats
2021-11-03 14:30:03 +03:00
Kruglov Pavel
77f5ef476a
Merge branch 'master' into hdfs-bad-url
2021-11-03 14:08:55 +03:00
avogar
3839d521a0
Add additional hdfs url check
2021-11-03 14:01:50 +03:00
Kseniia Sumarokova
065106c299
Merge pull request #30690 from kssenii/partitioned-write
...
Partitioned write for file, hdfs, url storages and table functions
2021-11-02 22:58:16 +03:00
kssenii
7e2ea97e3c
Merge branch 'master' of github.com:ClickHouse/ClickHouse into disk-async-read
2021-10-28 18:46:21 +03:00
Alexey Milovidov
8b4a6a2416
Remove cruft
2021-10-28 02:10:39 +03:00
kssenii
20772fde04
For storage
2021-10-27 10:04:17 +03:00
kssenii
2ba3ee830e
Partitioned write part 2
2021-10-26 14:00:41 +03:00
kssenii
1d743b9259
Partitoned write
2021-10-25 19:23:44 +03:00
kssenii
39b9e9c258
Merge branch 'master' of github.com:ClickHouse/ClickHouse into disk-async-read
2021-10-22 15:40:41 +03:00
kssenii
e4e157688d
Review fixes
2021-10-21 01:57:43 +03:00
kssenii
4a0bdd296d
Merge branch 'master' of github.com:ClickHouse/ClickHouse into disk-async-read
2021-10-16 18:35:48 +03:00
Nikolai Kochetov
a08c98d760
Move some files.
2021-10-16 17:03:50 +03:00
kssenii
881ae8617e
Merge branch 'master' of github.com:ClickHouse/ClickHouse into disk-async-read
2021-10-15 15:09:56 +03:00
kssenii
3995506d37
Adjust range reader for remote fs reads
2021-10-15 11:36:26 +03:00
Nikolai Kochetov
2957971ee3
Remove some last streams.
2021-10-13 21:22:02 +03:00
Nikolai Kochetov
ec18340351
Remove streams from formats.
2021-10-11 19:11:50 +03:00
Nikolai Kochetov
78e1db209f
Remove more data streams ( #29491 )
...
* Remove more streams.
* Fixing build.
* Fixing build.
* Rename files.
* Fix fast test.
* Fix StorageKafka.
* Try fix kafka test.
* Move createBuffer to KafkaSource ctor.
* Revert "Move createBuffer to KafkaSource ctor."
This reverts commit 81fa94d27e
.
* Revert "Try fix kafka test."
This reverts commit 2107e54969
.
* Comment some rows in test.
Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
2021-10-07 11:26:08 +03:00
Alexey Milovidov
fe6b7c77c7
Rename "common" to "base"
2021-10-02 10:13:14 +03:00
Nikolai Kochetov
236d71ea94
Merge pull request #28582 from ClickHouse/rewrite-pushing-to-views
...
Rewrite PushingToViews
2021-09-27 21:19:11 +03:00
kssenii
d71b6ad657
Fix
2021-09-23 07:21:08 +00:00
Nikolai Kochetov
f569a3e3f7
Merge branch 'master' into rewrite-pushing-to-views
2021-09-09 20:30:23 +03:00
Anton Popov
4c388e3d84
Merge remote-tracking branch 'origin/sparse-serialization' into HEAD
2021-09-09 14:10:16 +03:00
Nikolai Kochetov
66a76ab70f
Rewrite PushingToViewsBlockOutputStream part 6
2021-09-03 20:29:36 +03:00
Kseniia Sumarokova
00bea71610
Merge pull request #25251 from kssenii/disk-over-web-server
...
ClickHouse over web server with static files
2021-08-30 16:18:59 +03:00
Zhichang Yu
ef1994f420
set env LIBHDFS3_CONF, refers to ClickHouse#8159
2021-08-28 09:16:48 +08:00
kssenii
073d7fdd5e
Merge branch 'master' of https://github.com/ClickHouse/ClickHouse into disk-over-web-server
2021-08-06 19:42:29 +00:00
Anton Popov
e36736b50c
Merge remote-tracking branch 'origin/sparse-serialization' into HEAD
2021-08-02 22:52:02 +03:00
Nikolai Kochetov
9c92f43359
Update storages.
2021-07-23 22:33:59 +03:00
Anton Popov
f99374cca6
Merge remote-tracking branch 'origin/sparse-serialization' into HEAD
2021-07-20 18:20:21 +03:00
Alexey Milovidov
261a220227
Remove some code
2021-07-17 21:06:46 +03:00
Anton Popov
3ed7f5a6cc
dynamic subcolumns: add snapshot for storage
2021-07-09 06:15:41 +03:00
Kseniia Sumarokova
f3b0f11b59
Update StorageHDFS.cpp
2021-06-21 23:04:58 +03:00
kssenii
ac0f86cdbf
Truncate for s3
2021-06-21 15:44:36 +00:00
kssenii
4688f9e038
hdfs truncate table
2021-06-21 13:59:57 +00:00
kssenii
f33387837e
Merge branch 'master' of https://github.com/ClickHouse/ClickHouse into disk-over-web-server
2021-06-18 14:27:48 +00:00
Vladimir
4de0d1a593
Merge pull request #24885 from Jokser/disk-s3-read-error-fix
2021-06-18 13:38:59 +03:00
Maksim Kita
67e9b85951
Merge ext into common
2021-06-16 23:28:41 +03:00
kssenii
cd885ee750
Fix build, ya make
2021-06-14 11:52:20 +00:00
Maksim Kita
30c78c0a23
Fix arcadia
2021-06-11 14:26:33 +03:00
Pavel Kovalenko
bf5190cd34
Fix reading from ReadBufferFromHDFS
2021-06-03 14:34:47 +03:00
kssenii
b28157a0df
Merge branch 'master' of github.com:ClickHouse/ClickHouse into poco-file-to-std-fs
2021-05-27 00:01:25 +03:00
kssenii
3dee003f9b
Merge branch 'master' of github.com:ClickHouse/ClickHouse into poco-file-to-std-fs
2021-05-20 19:20:09 +03:00
kssenii
ba41d01b78
Merge branch 'master' of https://github.com/ClickHouse/ClickHouse into hdfs-idisk
2021-05-19 11:47:05 +00:00
kssenii
9b8df78fdd
Merge branch 'master' of github.com:ClickHouse/ClickHouse into poco-file-to-std-fs
2021-05-17 17:42:05 +03:00
kssenii
71fcbefdc7
Add final to class
2021-05-17 14:38:48 +00:00
kssenii
26fa920ad5
Fix clang tidy, slightly improve writeSuffix
2021-05-17 12:32:48 +00:00
kssenii
a8748a55df
Fix abnormal termination when hdfs goes away
2021-05-17 09:55:23 +00:00
kssenii
850b6f02cb
Merge branch 'master' of https://github.com/ClickHouse/ClickHouse into hdfs-idisk
2021-05-11 14:22:30 +00:00
kssenii
02288359c5
Less manual concatenation of paths
2021-05-08 13:59:55 +03:00
feng lv
39f68bf5ff
fix conflict
2021-05-02 16:33:45 +00:00
kssenii
c416fc3441
Fix hdfs read with Values format
2021-04-29 11:31:23 +00:00
feng lv
0f5c05854c
fix conflict
2021-04-23 12:22:51 +00:00
feng lv
4ffe199d39
Implement table comments
2021-04-23 12:18:23 +00:00
Kseniia Sumarokova
0b6b30887c
Fix style check
2021-04-21 01:35:13 +03:00
kssenii
87b4843134
Merge branch 'master' of https://github.com/ClickHouse/ClickHouse into hdfs-idisk
2021-04-20 21:40:01 +00:00
kssenii
a07d44e438
More common
2021-04-20 11:41:32 +00:00
kssenii
da7a4ac1ea
Better
2021-04-20 07:53:55 +00:00
kssenii
cc5f49fbee
Fix
2021-04-19 19:43:22 +00:00
kssenii
5df0fcc114
Fixes and tests
2021-04-19 15:33:33 +00:00
kssenii
387494f0c8
Common interface
2021-04-18 13:37:23 +00:00
kssenii
ab06ca3144
Merge branch 'master' of https://github.com/ClickHouse/ClickHouse into hdfs-idisk
2021-04-12 07:07:48 +00:00
Ivan
495c6e03aa
Replace all Context references with std::weak_ptr ( #22297 )
...
* Replace all Context references with std::weak_ptr
* Fix shared context captured by value
* Fix build
* Fix Context with named sessions
* Fix copy context
* Fix gcc build
* Merge with master and fix build
* Fix gcc-9 build
2021-04-11 02:33:54 +03:00
kssenii
908d872b76
Add seek method
2021-04-05 09:32:06 +00:00
kssenii
388a71e6b1
Fix build, ya check
2021-04-04 07:33:07 +00:00
kssenii
8cbfbb5de7
Small refactoring, add hdfs to test_disk_types
2021-04-03 12:02:29 +00:00
Alexey Milovidov
177a017c5c
Minor modification #22115
2021-03-25 20:51:32 +03:00
Nikita Mikhaylov
1f92c8ce58
parallel formatting everywhere
2021-03-15 14:51:24 +03:00
Nikita Mikhailov
2f04cb5ebe
abort() instead of std::terminate() + cleanup
2020-12-30 08:31:45 +03:00
Nikita Mikhailov
c5f92e5096
better formatfactory
2020-12-30 06:07:30 +03:00
Nikita Mikhailov
6a55e018e4
Merge branch 'master' of github.com:ClickHouse/ClickHouse into parallel-parsing-input-format
2020-12-18 02:25:32 +03:00
Ilya Golshtein
09d1e89674
yandex/clickhouse-kerberized-hadoop + missed header
2020-12-16 14:48:33 +03:00
Ilya Golshtein
407565b47f
fix - wrong error code
2020-12-16 14:48:33 +03:00
Ilya Golshtein
9701d6a2d5
style fix per code review, doc improvement, params consistency check
2020-12-16 14:48:33 +03:00
Ilya Golshtein
815856bc3d
pass config, not context to createHDFSBuilder
2020-12-16 14:48:33 +03:00
Ilya Golshtein
7e97814859
cleanup, fixes, new submodules, ShellCommand, WriteBufferFromString
2020-12-16 14:48:33 +03:00