Commit Graph

274 Commits

Author SHA1 Message Date
justindeguzman
0dd4735e07 [Docs] Fix style check errors 2024-06-28 19:49:26 -07:00
Dale Mcdiarmid
20385161e4 add diagram 2024-06-12 12:33:18 +01:00
Dale Mcdiarmid
b4e12156dd stackoverflow dataset 2024-06-11 18:55:49 +01:00
Nikita Mikhaylov
ab7aa8c1ee
Merge pull request #62429 from peter279k/improve_recipes_dataset_doc
Validating zip and fix query result in Recipes doc
2024-04-26 11:23:32 +00:00
Yarik Briukhovetskyi
2ad89ed96c
add language tags to parts of code 2024-04-18 17:23:34 +01:00
Peter
8caa2dfb22
Be consistency for creating and inserting SQL 2024-04-18 23:49:23 +08:00
peter279k
a91da66961
Add the loading data with cleaning approach 2024-04-16 19:57:18 +08:00
Yarik Briukhovetskyi
17d3d57f9f
fix flaky result 2024-04-09 18:01:12 +02:00
Peter
42a906dca9
Remove useless param, fix typo and query result 2024-04-09 22:17:00 +08:00
peter279k
45b2619f7a
Validating zip and fix query result in Recipes doc 2024-04-09 13:29:34 +08:00
peter279k
057893c310
Add checksum to validate the downloaded archive 2024-04-08 17:57:46 +08:00
robot-ch-test-poll4
661cd7eca4
Merge pull request #61414 from peter279k/fix_geo_dataset_info
Correct the GEO datasets information
2024-03-22 01:28:59 +01:00
robot-clickhouse-ci-1
b6cc44bd3b
Merge pull request #61054 from peter279k/add_tw_weather_dataset
Add another example dataset for presenting usage
2024-03-19 18:21:32 +01:00
Peter
9599e0a13e
Merge branch 'master' of ClickHouse 2024-03-17 00:55:23 +08:00
Peter
6b0f77bbfc
Merge branch 'master' of orign ClickHouse 2024-03-17 00:29:50 +08:00
Peter
88f146aa51
Merge branch 'master' of https://github.com/ClickHouse/ClickHouse into add_tw_weather_dataset 2024-03-17 00:29:20 +08:00
peter279k
978fb78351
Correct Criteo example dataset instruction section 2024-03-15 11:18:21 +08:00
peter279k
e5f0079fc6
Correct the GEO datasets information 2024-03-15 10:27:10 +08:00
Peter
7c5ef07c7b
Add checksum validating before extracting archive 2024-03-15 00:26:32 +08:00
peter279k
0b522f5fb3
Fix issue #61351 2024-03-14 14:46:22 +08:00
Peter
f4fc65449c
Add another example dataset for presenting usage 2024-03-08 01:20:50 +08:00
Peter
7dd37f08d7
Correct the COVID-19 open dataset date column type 2024-03-03 19:50:23 +08:00
Peter
f2e325851d
Remove unavailable referenced tag link 2024-03-02 01:31:33 +08:00
Nikolai Fedorovskikh
a98af159b5 [Docs] fix some typos and missing commas 2024-02-13 02:10:41 +01:00
Alexey Milovidov
1c1e1512bf
Update noaa.md 2024-01-15 01:29:38 +03:00
Alexey Milovidov
5ba6def57d
Update noaa.md 2024-01-14 07:29:28 +03:00
Dale Mcdiarmid
1dacfc53ff weather data 2024-01-12 17:28:45 +00:00
Alexey Milovidov
385f4da819
Update youtube-dislikes.md 2023-12-30 21:30:16 +03:00
johnnymatthews
ecc012cc4d Removes duplicated slugs in docs. 2023-11-09 12:41:39 -04:00
pppeace
986154d6fb
Update wikistat.md
align the create sql and data load script about the column `size`.
2023-11-08 22:24:45 +08:00
rfraposa
a993683a5b Update amazon-reviews.md 2023-10-28 19:34:04 -06:00
rfraposa
11f2dd1d10 Update amazon-reviews.md 2023-10-28 13:13:24 -06:00
rfraposa
7ab492edad Update amazon-reviews.md 2023-10-28 12:58:47 -06:00
Robert Schulze
2746aa87bb
Various fixups 2023-08-31 19:18:42 +00:00
Michael Kolupaev
33c03eda3d
Add warnings about ingestion script speed and memory usage in Laion dataset instructions
The command given in the instructions would run 100 instances of a script that take 41 GB each. I'm not sure how the author of the instructions was able to run it successfully.
2023-08-31 12:07:51 -07:00
Robert Schulze
43367f99fb
Fix style 2023-08-29 12:35:56 +02:00
Robert Schulze
b4219886b4
Dataset docs: Update + fix LAION-400M tutorial 2023-08-29 10:17:13 +00:00
Rich Raposa
a89c129c49
Update nyc-taxi.md
Use gcs function (instead of s3) for the GCS files
2023-06-06 15:54:57 -06:00
Robert Schulze
a22bb07fbd
Merge remote-tracking branch 'rschu1ze/master' into fix-typo-check-on-nested-docs 2023-06-02 12:33:16 +00:00
Robert Schulze
65cc92a78d
CI: Fix aspell on nested docs 2023-06-02 12:24:41 +00:00
Dan Roscigno
c70aa9592b
Merge pull request #50419 from ClickHouse/reddit-fixes
Reddit dataset fixes
2023-06-01 10:30:56 -04:00
rfraposa
86e97f5f5c Update reddit-comments.md 2023-06-01 03:19:23 -06:00
rfraposa
bed7443181 Fixes 2023-05-31 09:31:46 -06:00
rfraposa
308db6784c Update environmental-sensors.md 2023-05-30 08:50:58 -05:00
rfraposa
6a136897e3 Create reddit-comments.md 2023-05-17 13:23:53 -06:00
Ivan Takarlikov
8873856ce5 Fix some grammar mistakes in documentation, code and tests 2023-05-04 13:35:18 -03:00
Robert Schulze
cdf28f9b71
Minor fixups 2023-04-19 16:16:51 +00:00
Robert Schulze
c406663442
Docs: Replace annoying three spaces in enumerations by a single space 2023-04-19 15:56:55 +00:00
rfraposa
d98bee8ea3 Update youtube-dislikes.md 2023-04-14 14:32:55 -06:00
rfraposa
42554e2671 Update youtube-dislikes.md 2023-04-14 14:30:36 -06:00