Commit Graph

41 Commits

Author SHA1 Message Date
avogar
9a05461680 Better exception message 2024-03-04 17:49:33 +00:00
avogar
70abdf7a41 Small improvements in JSON schema inference 2024-03-04 17:32:22 +00:00
Kruglov Pavel
4f8ee992b6
Apply suggestions from code review
Co-authored-by: Sergei Trifonov <svtrifonov@gmail.com>
2024-02-14 12:22:21 +01:00
avogar
1bff525666 Fix tests and docs 2024-01-23 18:59:40 +00:00
avogar
f05174e441 Fix style 2024-01-23 18:59:40 +00:00
avogar
617cc514b7 Try to detect file format automatically during schema inference if it's unknown 2024-01-23 18:59:39 +00:00
avogar
7e5ba62017 Allow to read Bool values into String in JSON input formats 2024-01-05 20:33:30 +00:00
Kruglov Pavel
6567fb2c08
Merge pull request #56859 from Avogar/csv-infer-numbers-from-strings
Allow to infer numbers from strings in CSV format
2023-12-12 17:09:02 +01:00
avogar
1db000dae5 Add docs 2023-12-11 18:02:05 +00:00
Kruglov Pavel
91632e3554
Fix typo in docs 2023-10-24 14:30:05 +02:00
avogar
cfa510ea0a Add more documentation, fix build 2023-10-23 14:38:34 +00:00
avogar
e2d7c543eb Don't try to infer numbers from strings in JSON formats by default to avoid parsing errors, add docs for setting input_format_json_try_infer_numbers_from_strings 2023-09-28 22:15:26 +00:00
avogar
42ca897f2d Better schema inference for JSON formats 2023-09-25 15:42:59 +00:00
avogar
ba307c7466 Allow to infer named Tuples from JSON objects under a setting in JSON formats 2023-09-07 19:41:19 +00:00
avogar
b4145aeddc Cache number of rows in files for count in file/s3/url/hdfs/azure functions 2023-08-22 11:59:59 +00:00
Kruglov Pavel
607f337d67
Merge pull request #50592 from Avogar/max-bytes-to-read-in-schema-inference
Add setting to limit the number of bytes to read in schema inference
2023-06-13 16:47:57 +02:00
avogar
028e48dfa7 Update docs 2023-06-05 17:33:10 +00:00
avogar
33e51d4f3b Add setting to limit the number of bytes to read in schema inference 2023-06-05 15:22:04 +00:00
Kruglov Pavel
a5628a1905
Fix typo 2023-03-10 17:11:48 +01:00
avogar
b6c91d9ace Support Decimals and Date32 in Avro format 2023-03-10 15:48:18 +00:00
Kruglov Pavel
d81065ea64
Fix headers in schema inference docs 2023-03-09 16:14:54 +01:00
Derek Chia
bc4418bfb6
Update schema-inference.md 2023-02-27 23:14:22 +08:00
Peignon Melvyn
7c5bd4a5f1
Update schema-inference.md 2023-02-27 13:53:44 +01:00
Peignon Melvyn
e680cd7b12
Update schema-inference.md
Fixed the example
2023-02-27 13:48:54 +01:00
Dan Roscigno
1cc3708092
Merge branch 'master' into update-settings-url 2023-01-24 19:51:39 -05:00
DanRoscigno
77ae27f26c update for split of format settings 2023-01-24 19:37:55 -05:00
Kruglov Pavel
9820beae68
Apply suggestions from code review
Co-authored-by: Antonio Andelic <antonio2368@users.noreply.github.com>
2023-01-19 16:11:13 +01:00
avogar
1c0941d72a Add docs and examples 2023-01-16 16:46:41 +00:00
Ivan Blinkov
61c2f23713 Remove leftover empty lines at the end of markdown files 2023-01-09 15:15:18 +01:00
Ivan Blinkov
b7e082d033 Remove "Original article links" 2023-01-09 15:13:36 +01:00
Kruglov Pavel
027b525820
Slightly better docs 2022-12-31 15:06:45 +01:00
Kruglov Pavel
791e8a90cf
Fix 2022-12-30 23:15:45 +01:00
Kruglov Pavel
7e14ee6eef
Fix incorrect docs 2022-12-30 23:14:14 +01:00
DanRoscigno
f6c94d8645 remove table of contents 2022-12-30 10:19:09 -05:00
DanRoscigno
0df0fe2cf0 edits 2022-12-30 09:55:03 -05:00
DanRoscigno
925ce4b96c edits 2022-12-30 09:21:12 -05:00
DanRoscigno
0902db3fe0 edits 2022-12-29 22:34:25 -05:00
avogar
f7168ce46b Add missing new line 2022-12-29 21:31:52 +00:00
avogar
ced668fafb Better 2022-12-29 21:21:37 +00:00
avogar
ae715b9d00 Finish docs 2022-12-29 20:42:03 +00:00
avogar
46b7ec4209 Add detailed documentation about schema inference 2022-12-29 13:42:56 +00:00