ClickHouse/src/Processors/Formats/Impl
Kruglov Pavel 3396ff6c3a
Merge pull request #40516 from zjial/record_errors_for_import_by_csv
Record errors while reading text formats (CSV, TSV).
2022-09-14 12:52:32 +02:00
..
ArrowBlockInputFormat.cpp Merge branch 'master' into formats-with-names 2022-05-23 12:14:20 +02:00
ArrowBlockInputFormat.h merge master 2022-01-01 09:01:06 +08:00
ArrowBlockOutputFormat.cpp Allow to use String type instead of Binary in Arrow/Parquet/ORC formats 2022-05-18 14:51:21 +00:00
ArrowBlockOutputFormat.h Minor change 2021-11-11 21:09:21 +03:00
ArrowBufferedStreams.cpp Fixing build. 2022-07-20 20:30:16 +00:00
ArrowBufferedStreams.h remove some test code 2022-06-09 09:55:50 +08:00
ArrowColumnToCHColumn.cpp Merge pull request #40173 from Avogar/arrow-dict 2022-08-18 20:54:55 +02:00
ArrowColumnToCHColumn.h Fix 2022-08-15 11:38:28 +00:00
AvroRowInputFormat.cpp Fix half of trash 2022-09-10 04:08:16 +02:00
AvroRowInputFormat.h Support reading Array(Record) into flatten nested table in Avro 2022-08-23 11:05:02 +00:00
AvroRowOutputFormat.cpp Fix build 2022-07-18 07:58:59 +00:00
AvroRowOutputFormat.h Support Maps and Records in Avro format 2022-05-25 11:20:28 +00:00
BinaryRowInputFormat.cpp Stop removing UTF-8 BOM in RowBinary* formats 2022-06-01 13:12:55 +08:00
BinaryRowInputFormat.h Small improvements 2022-02-06 07:14:01 +03:00
BinaryRowOutputFormat.cpp Support parallel formatting for all text output formats 2021-11-23 19:56:43 +03:00
BinaryRowOutputFormat.h Small improvements 2022-02-06 07:14:01 +03:00
CapnProtoRowInputFormat.cpp Make schema inference cache better, respect format settings that can change the schema 2022-08-19 16:39:13 +00:00
CapnProtoRowInputFormat.h Small improvements 2022-02-06 07:14:01 +03:00
CapnProtoRowOutputFormat.cpp clang-tidy check performance-move-const-arg fix 2022-03-02 18:15:27 +00:00
CapnProtoRowOutputFormat.h Fix clang-tidy warnings in Parsers, Processors, QueryPipeline folders 2022-03-14 18:17:35 +00:00
CHColumnToArrowColumn.cpp Fix 2022-08-15 11:38:28 +00:00
CHColumnToArrowColumn.h Allow to use String type instead of Binary in Arrow/Parquet/ORC formats 2022-05-18 14:51:21 +00:00
CMakeLists.txt dbms/ → src/ 2020-04-03 18:14:31 +03:00
ConstantExpressionTemplate.cpp Update sort to pdqsort 2022-01-30 19:49:48 +00:00
ConstantExpressionTemplate.h Fix clang-tidy warnings in Parsers, Processors, QueryPipeline folders 2022-03-14 18:17:35 +00:00
CSVRowInputFormat.cpp Make schema inference cache better, respect format settings that can change the schema 2022-08-19 16:39:13 +00:00
CSVRowInputFormat.h Allow to skip some of the first rows in CSV/TSV formats 2022-05-25 15:00:11 +00:00
CSVRowOutputFormat.cpp Small refactoring in formats 2021-11-03 20:07:05 +03:00
CSVRowOutputFormat.h Small improvements 2022-02-06 07:14:01 +03:00
CustomSeparatedRowInputFormat.cpp Make schema inference cache better, respect format settings that can change the schema 2022-08-19 16:39:13 +00:00
CustomSeparatedRowInputFormat.h Add new features in schema inference 2022-07-13 15:57:55 +00:00
CustomSeparatedRowOutputFormat.cpp Fix 2022-01-14 19:17:06 +03:00
CustomSeparatedRowOutputFormat.h Small improvements 2022-02-06 07:14:01 +03:00
HiveTextRowInputFormat.cpp fix 2 bugs: 1. select count(1) from hive_table; 2. select _file, _path from hive_table 2022-05-31 17:39:02 +08:00
HiveTextRowInputFormat.h parallel parsing of hive text format 2022-04-26 14:33:10 +08:00
JSONAsStringRowInputFormat.cpp Make better 2022-05-18 09:25:26 +00:00
JSONAsStringRowInputFormat.h add parallel parsing and schema inference for type Object 2022-03-24 17:51:35 +00:00
JSONColumnsBlockInputFormat.cpp Make schema inference cache better, respect format settings that can change the schema 2022-08-19 16:39:13 +00:00
JSONColumnsBlockInputFormat.h Make better 2022-05-20 12:07:29 +00:00
JSONColumnsBlockInputFormatBase.cpp Fix 2022-08-23 11:42:57 +00:00
JSONColumnsBlockInputFormatBase.h Make schema inference cache better, respect format settings that can change the schema 2022-08-19 16:39:13 +00:00
JSONColumnsBlockOutputFormat.cpp Make better 2022-05-20 12:07:29 +00:00
JSONColumnsBlockOutputFormat.h Make better 2022-05-20 12:07:29 +00:00
JSONColumnsBlockOutputFormatBase.cpp Make better 2022-05-20 12:07:29 +00:00
JSONColumnsBlockOutputFormatBase.h Make better 2022-05-20 12:07:29 +00:00
JSONColumnsWithMetadataBlockOutputFormat.cpp Make better 2022-05-20 12:07:29 +00:00
JSONColumnsWithMetadataBlockOutputFormat.h Make JSONColumns frormats mono block by default 2022-05-09 11:13:44 +00:00
JSONCompactColumnsBlockInputFormat.cpp Make schema inference cache better, respect format settings that can change the schema 2022-08-19 16:39:13 +00:00
JSONCompactColumnsBlockInputFormat.h Make better 2022-05-20 12:07:29 +00:00
JSONCompactColumnsBlockOutputFormat.cpp Make better 2022-05-20 12:07:29 +00:00
JSONCompactColumnsBlockOutputFormat.h Make better 2022-05-20 12:07:29 +00:00
JSONCompactEachRowRowInputFormat.cpp Make schema inference cache better, respect format settings that can change the schema 2022-08-19 16:39:13 +00:00
JSONCompactEachRowRowInputFormat.h Add new features in schema inference 2022-07-13 15:57:55 +00:00
JSONCompactEachRowRowOutputFormat.cpp fix build fail with gcc --fix warnings without disabling some parameters 2022-03-11 21:59:19 +08:00
JSONCompactEachRowRowOutputFormat.h Small improvements 2022-02-06 07:14:01 +03:00
JSONCompactRowOutputFormat.cpp Make better 2022-05-18 09:25:26 +00:00
JSONCompactRowOutputFormat.h Add columnar JSON formats 2022-05-06 16:48:48 +00:00
JSONEachRowRowInputFormat.cpp Make schema inference cache better, respect format settings that can change the schema 2022-08-19 16:39:13 +00:00
JSONEachRowRowInputFormat.h Allow to specify structure hints in schema inference 2022-08-16 09:46:57 +00:00
JSONEachRowRowOutputFormat.cpp Add aliases JSONLines and NDJSON for JSONEachRow 2022-04-16 06:01:07 +00:00
JSONEachRowRowOutputFormat.h use application/x-ndjson for streaming JSON (#32223) 2021-12-06 10:49:14 +03:00
JSONEachRowWithProgressRowOutputFormat.cpp Fix race in JSONEachRowWithProgressRowOutputFormat 2021-11-24 22:29:43 +03:00
JSONEachRowWithProgressRowOutputFormat.h Small improvements 2022-02-06 07:14:01 +03:00
JSONRowOutputFormat.cpp Make better 2022-05-18 09:25:26 +00:00
JSONRowOutputFormat.h Fix double escaping in the metadata of FORMAT JSON 2022-07-30 23:56:41 +02:00
LineAsStringRowInputFormat.cpp Add columnar JSON formats 2022-05-06 16:48:48 +00:00
LineAsStringRowInputFormat.h Small improvements 2022-02-06 07:14:01 +03:00
MarkdownRowOutputFormat.cpp Detect format by file name in file/hdfs/s3/url table functions 2022-01-12 18:29:31 +03:00
MarkdownRowOutputFormat.h Small improvements 2022-02-06 07:14:01 +03:00
MsgPackRowInputFormat.cpp Fix 2022-08-23 11:42:57 +00:00
MsgPackRowInputFormat.h Make better 2022-02-07 17:51:26 +03:00
MsgPackRowOutputFormat.cpp Less usage of StringRef 2022-07-24 18:33:52 +00:00
MsgPackRowOutputFormat.h Support UUID in MsgPack format 2022-02-07 17:11:44 +03:00
MySQLDumpRowInputFormat.cpp Make schema inference cache better, respect format settings that can change the schema 2022-08-19 16:39:13 +00:00
MySQLDumpRowInputFormat.h Add new features in schema inference 2022-07-13 15:57:55 +00:00
MySQLOutputFormat.cpp Remove inherited create() method + disallow copying 2022-05-02 08:46:52 +02:00
MySQLOutputFormat.h Minor change 2021-11-11 21:09:21 +03:00
NativeFormat.cpp remove some dead and commented code 2022-07-21 15:05:48 +02:00
NullFormat.cpp Move formats to Impl 2021-10-13 13:01:08 +03:00
NullFormat.h Move formats to Impl 2021-10-13 13:01:08 +03:00
ODBCDriver2BlockOutputFormat.cpp Add columnar JSON formats 2022-05-06 16:48:48 +00:00
ODBCDriver2BlockOutputFormat.h Add columnar JSON formats 2022-05-06 16:48:48 +00:00
ORCBlockInputFormat.cpp Merge branch 'master' into formats-with-names 2022-05-23 12:14:20 +02:00
ORCBlockInputFormat.h suppoort skip splits in orc and parquet 2022-04-06 16:40:22 +08:00
ORCBlockOutputFormat.cpp First try at reducing the use of StringRef 2022-07-17 17:26:02 +00:00
ORCBlockOutputFormat.h Fix named tuples output in ORC/Arrow/Parquet formats 2022-05-23 14:21:08 +00:00
ParallelFormattingOutputFormat.cpp Don't use std::unique_lock unless we have to 2022-06-28 19:19:06 +00:00
ParallelFormattingOutputFormat.h Don't use std::unique_lock unless we have to 2022-06-28 19:19:06 +00:00
ParallelParsingInputFormat.cpp new implementation 2022-09-06 20:39:54 +08:00
ParallelParsingInputFormat.h base should not depend on Common 2022-04-29 10:26:35 +08:00
ParquetBlockInputFormat.cpp Use FileReader::GetRecordBatchReader instead of FileReader::ReadRowGroup to parse Parquet 2022-08-22 08:21:32 -03:00
ParquetBlockInputFormat.h suppoort skip splits in orc and parquet 2022-04-06 16:40:22 +08:00
ParquetBlockOutputFormat.cpp Allow to use String type instead of Binary in Arrow/Parquet/ORC formats 2022-05-18 14:51:21 +00:00
ParquetBlockOutputFormat.h Minor change 2021-11-11 21:09:21 +03:00
PostgreSQLOutputFormat.cpp Updated additional cases 2021-12-20 15:55:07 +03:00
PostgreSQLOutputFormat.h Small refactoring in formats 2021-11-03 20:07:05 +03:00
PrettyBlockOutputFormat.cpp Fix comments 2022-08-08 12:41:00 +00:00
PrettyBlockOutputFormat.h Fix comments 2022-08-08 12:41:00 +00:00
PrettyCompactBlockOutputFormat.cpp Add more Pretty formats 2022-07-27 11:37:02 +00:00
PrettyCompactBlockOutputFormat.h Add more Pretty formats 2022-07-27 11:37:02 +00:00
PrettySpaceBlockOutputFormat.cpp Add more Pretty formats 2022-07-27 11:37:02 +00:00
PrettySpaceBlockOutputFormat.h Add more Pretty formats 2022-07-27 11:37:02 +00:00
PrometheusTextOutputFormat.cpp Fix integration test 2022-08-27 22:28:38 +02:00
PrometheusTextOutputFormat.h Fixup PrometheusTextOutputFormat 2022-04-26 14:57:37 +00:00
ProtobufListInputFormat.cpp Make schema inference cache better, respect format settings that can change the schema 2022-08-19 16:39:13 +00:00
ProtobufListInputFormat.h Add settings to skip fields with unsupported types in Protobuf/CapnProto schema inference 2022-07-20 11:16:25 +00:00
ProtobufListOutputFormat.cpp Adding two settings in place of one, improvements to the test clarity 2022-04-29 10:01:51 +02:00
ProtobufListOutputFormat.h Adding two settings in place of one, improvements to the test clarity 2022-04-29 10:01:51 +02:00
ProtobufRowInputFormat.cpp Make schema inference cache better, respect format settings that can change the schema 2022-08-19 16:39:13 +00:00
ProtobufRowInputFormat.h Add settings to skip fields with unsupported types in Protobuf/CapnProto schema inference 2022-07-20 11:16:25 +00:00
ProtobufRowOutputFormat.cpp Adding two settings in place of one, improvements to the test clarity 2022-04-29 10:01:51 +02:00
ProtobufRowOutputFormat.h Clean up some header includes and make formatting more consistent 2022-03-13 20:24:12 +01:00
RawBLOBRowInputFormat.cpp Fix linkage 2022-02-04 00:26:44 +03:00
RawBLOBRowInputFormat.h Small improvements 2022-02-06 07:14:01 +03:00
RawBLOBRowOutputFormat.cpp Remove strange code (#40195) 2022-09-12 16:29:23 +02:00
RawBLOBRowOutputFormat.h Small improvements 2022-02-06 07:14:01 +03:00
RegexpRowInputFormat.cpp Make schema inference cache better, respect format settings that can change the schema 2022-08-19 16:39:13 +00:00
RegexpRowInputFormat.h Add new features in schema inference 2022-07-13 15:57:55 +00:00
SQLInsertRowOutputFormat.cpp Put column names in quotes 2022-06-30 16:14:30 +00:00
SQLInsertRowOutputFormat.h Put column names in quotes 2022-06-30 16:14:30 +00:00
TabSeparatedRowInputFormat.cpp Fix bug in settings input_format_tsv_skip_first_lines of format TSV 2022-08-24 10:02:57 +03:00
TabSeparatedRowInputFormat.h Allow to skip some of the first rows in CSV/TSV formats 2022-05-25 15:00:11 +00:00
TabSeparatedRowOutputFormat.cpp Remove TabSeparatedSorted 2022-07-12 20:22:35 +02:00
TabSeparatedRowOutputFormat.h Remove TabSeparatedSorted 2022-07-12 20:22:35 +02:00
TemplateBlockOutputFormat.cpp Add columnar JSON formats 2022-05-06 16:48:48 +00:00
TemplateBlockOutputFormat.h Better naming 2021-11-25 15:09:13 +03:00
TemplateRowInputFormat.cpp Make schema inference cache better, respect format settings that can change the schema 2022-08-19 16:39:13 +00:00
TemplateRowInputFormat.h Add new features in schema inference 2022-07-13 15:57:55 +00:00
TSKVRowInputFormat.cpp Make schema inference cache better, respect format settings that can change the schema 2022-08-19 16:39:13 +00:00
TSKVRowInputFormat.h Add new features in schema inference 2022-07-13 15:57:55 +00:00
TSKVRowOutputFormat.cpp Add columnar JSON formats 2022-05-06 16:48:48 +00:00
TSKVRowOutputFormat.h Small improvements 2022-02-06 07:14:01 +03:00
ValuesBlockInputFormat.cpp Make schema inference cache better, respect format settings that can change the schema 2022-08-19 16:39:13 +00:00
ValuesBlockInputFormat.h Add new features in schema inference 2022-07-13 15:57:55 +00:00
ValuesRowOutputFormat.cpp Support parallel formatting for all text output formats 2021-11-23 19:56:43 +03:00
ValuesRowOutputFormat.h Small improvements 2022-02-06 07:14:01 +03:00
VerticalRowOutputFormat.cpp fix build fail with gcc --fix warnings without disabling some parameters 2022-03-11 21:59:19 +08:00
VerticalRowOutputFormat.h Small improvements 2022-02-06 07:14:01 +03:00
XMLRowOutputFormat.cpp Add columnar JSON formats 2022-05-06 16:48:48 +00:00
XMLRowOutputFormat.h Small improvements 2022-02-06 07:14:01 +03:00