Kruglov Pavel
96700abbe1
Merge pull request #45678 from azat/formats/json-parse-tupels
...
Add ability to ignore unknown keys in JSON object for named tuples
2023-01-27 21:11:05 +01:00
Azat Khuzhin
1a8437f2c9
Add ability to ignore unknown keys in JSON object for named tuples
...
This can be useful in case your input JSON is complex, while you need
only few fields in it.
This behaviour is controlled by the
input_format_json_ignore_unknown_keys_in_named_tuple setting name, that
is turned OFF by default.
This will, almost, allow to parse gharchive dataset without jq. "almost"
because of two things:
- Tuple cannot be Nullable, so such keys with Tuple type in ClickHouse
cannot be `null` in JSON
- You cannot use dot.dot notation to extract columns for file() engine,
only tupleElement()
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-27 10:01:08 +01:00
Alexander Tokmakov
a584ad0eb1
forbid runtime strings
2023-01-26 10:52:47 +01:00
Alexander Tokmakov
3744fa2c63
format more messages
2023-01-25 21:16:42 +01:00
Alexander Tokmakov
d1baa7300c
reformat ParsingException
2023-01-24 23:21:29 +01:00
Alexander Tokmakov
dd57215934
Merge branch 'master' into exception_message_patterns4
2023-01-24 17:03:12 +01:00
Kruglov Pavel
4bd3f0e5ef
Merge pull request #44953 from Avogar/tsv-csv-detect-header
...
Detect header in CSV/TSV/CustomSeparated files automatically
2023-01-24 15:13:52 +01:00
Alexander Tokmakov
6ecae8388e
Merge branch 'master' into exception_message_patterns4
2023-01-24 14:42:36 +01:00
Maksim Kita
2f1092e6d2
Merge pull request #45425 from ClickHouse/fix-ipv6-parser
...
Fix ipv6 parser
2023-01-24 14:59:47 +03:00
Kruglov Pavel
cd1cd904a7
Merge branch 'master' into tsv-csv-detect-header
2023-01-23 23:49:56 +01:00
Alexander Tokmakov
bb4c8e169f
check number of parameters in format string
2023-01-23 23:16:16 +01:00
Alexander Tokmakov
3f6594f4c6
forbid old ctor of Exception
2023-01-23 22:18:05 +01:00
Alexander Tokmakov
70d1adfe4b
Better formatting for exception messages ( #45449 )
...
* save format string for NetException
* format exceptions
* format exceptions 2
* format exceptions 3
* format exceptions 4
* format exceptions 5
* format exceptions 6
* fix
* format exceptions 7
* format exceptions 8
* Update MergeTreeIndexGin.cpp
* Update AggregateFunctionMap.cpp
* Update AggregateFunctionMap.cpp
* fix
2023-01-24 00:13:58 +03:00
Kruglov Pavel
3b53b195be
Fix special build
2023-01-23 15:27:36 +01:00
Kruglov Pavel
28ddcc2432
Merge branch 'master' into tsv-csv-detect-header
2023-01-20 15:08:38 +01:00
avogar
c34c0aa22e
Fix comments
2023-01-19 16:03:46 +00:00
Yakov Olkhovskiy
c6ee4c3908
Merge pull request #44686 from Algunenano/fix_uuid_parsing_in_values
...
Don't parse beyond the quotes when reading UUIDs
2023-01-18 19:30:53 -05:00
Yakov Olkhovskiy
1d58ded72b
fix IP parsers to treat input as not whole string
2023-01-19 00:08:20 +00:00
Kruglov Pavel
4183f6082f
Fix special build
2023-01-17 15:18:39 +01:00
Kruglov Pavel
e9d6590926
Merge branch 'master' into tsv-csv-detect-header
2023-01-16 17:50:24 +01:00
avogar
87b934c472
Insert default values in case of missing tuple elements in JSONEachRow
2023-01-12 16:36:44 +00:00
Kruglov Pavel
05a11ff4a4
Merge branch 'master' into tsv-csv-detect-header
2023-01-12 12:35:18 +01:00
Raúl Marín
4e8b2a2d26
Merge branch 'master' into fix_uuid_parsing_in_values
2023-01-11 15:30:49 +01:00
Kruglov Pavel
50eb9fca67
Merge pull request #44696 from Avogar/schema-inference-uint
...
Infer UInt64 in case of Int64 overflow
2023-01-11 14:24:42 +01:00
Yakov Olkhovskiy
9e327306a3
Merge pull request #45043 from ClickHouse/fix-ip-function-hash
...
Fix hash functions for IPv4
2023-01-10 07:48:12 -05:00
Yakov Olkhovskiy
2559c49810
make only IPv4 as numeric, fix hash functions for IPv6, add more hash functions to test
2023-01-10 02:03:06 +00:00
Maksim Kita
0ba597240e
JIT compilation float to bool conversion fix
2023-01-09 16:35:06 +01:00
Yakov Olkhovskiy
6ec1592a9f
fix hash functions for IPv4
2023-01-08 20:58:09 +00:00
avogar
ee72799121
Fix tests, make better
2023-01-06 20:46:43 +00:00
avogar
7fcdb08ec6
Detect header in CSV/TSV/CustomSeparated files automatically
2023-01-05 22:57:25 +00:00
Yakov Olkhovskiy
7a5a36cbed
Merge branch 'master' into refactoring-ip-types
2023-01-04 11:11:06 -05:00
Kruglov Pavel
7062054d60
Merge branch 'master' into schema-inference-uint
2023-01-04 14:50:01 +01:00
Alexey Milovidov
e855d3519a
Merge branch 'master' into refactoring-ip-types
2023-01-02 21:58:53 +03:00
Kruglov Pavel
0a43976977
Merge branch 'master' into validate-types
2023-01-02 16:10:14 +01:00
Azat Khuzhin
b9125bb351
Introduce groupArrayLast() (useful to store last X values) ( #44521 )
...
* Cleanup DataTypeCustomSimpleAggregateFunction::checkSupportedFunctions()
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
* Remove unused GroupArrayGeneralListImpl
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
* Introduce groupArrayLast() (useful to store last X values)
Also do some refactoring to make code cleaner:
- rename insert() to insertWithSampler() (since it is used only for
groupArraySample())
- split merge methods into Last/RNG/...
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2023-01-02 15:20:21 +01:00
Kruglov Pavel
69b9842bc6
Merge branch 'master' into schema-inference-uint
2022-12-30 18:16:00 +01:00
Kruglov Pavel
4982d132fb
Merge branch 'master' into validate-types
2022-12-30 17:52:13 +01:00
Raúl Marín
f5ca0c0c3c
Win back some performance
2022-12-30 17:20:09 +01:00
Raúl Marín
19e056b70b
Merge remote-tracking branch 'blessed/master' into fix_uuid_parsing_in_values
2022-12-30 12:06:32 +01:00
Kruglov Pavel
150a699dda
Merge pull request #44546 from Avogar/better-object-as-string-inference
...
Improve json object as string inference
2022-12-29 21:58:46 +01:00
avogar
1ce69371fb
Infer UInt64 in case of Int64 overflow
2022-12-28 21:46:08 +00:00
Raúl Marín
6377b1675b
Don't parse beyond the quotes when reading UUIDs
2022-12-28 21:35:20 +01:00
avogar
411f98306a
Merge branch 'master' of github.com:ClickHouse/ClickHouse into validate-types
2022-12-27 19:24:15 +00:00
Alexey Milovidov
611bb44288
Merge branch 'master' into fix-30975
2022-12-27 17:15:12 +03:00
Alexey Milovidov
c5155b8d32
Merge pull request #44442 from Avogar/csv-bool-crlf
...
Fix reading bool value with CRLF in CSV format
2022-12-26 21:25:18 +03:00
avogar
835412af8d
Fix
2022-12-23 21:37:09 +00:00
avogar
8dfe90a6c1
Improve json object as string inference
2022-12-23 19:44:13 +00:00
Yakov Olkhovskiy
a8cb29da4b
Merge branch 'master' into refactoring-ip-types
2022-12-21 23:56:24 -05:00
avogar
4ab3e90382
Validate types in table function arguments/CAST function arguments/JSONAsObject schema inference
2022-12-21 21:21:30 +00:00
Alexey Milovidov
ae7a7bc850
Merge branch 'master' into fix-30975
2022-12-21 01:41:29 +03:00