Merge pull request #40914 from DanRoscigno/update-uk

Update uk price paid example
This commit is contained in:
Rich Raposa 2022-09-02 09:54:21 -06:00 committed by GitHub
commit 88ef639629
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

View File

@ -6,7 +6,6 @@ title: "UK Property Price Paid"
---
The dataset contains data about prices paid for real-estate property in England and Wales. The data is available since year 1995.
The size of the dataset in uncompressed form is about 4 GiB and it will take about 270 MiB in ClickHouse.
Source: https://www.gov.uk/government/statistical-data-sets/price-paid-data-downloads <br/>
Description of the fields: https://www.gov.uk/guidance/about-the-price-paid-data
@ -94,8 +93,6 @@ FROM url(
SETTINGS max_http_get_redirects=1;
```
It will take about 2 minutes depending on where you are in the world, and where your ClickHouse servers are. Almost all of the time is the download time of the CSV file from the UK government server.
## Validate the Data {#validate-data}
Query:
@ -112,26 +109,6 @@ Result:
└──────────┘
```
The size of dataset in ClickHouse is just 540 MiB, check it.
Query:
```sql
SELECT formatReadableSize(total_bytes) FROM system.tables WHERE name = 'uk_price_paid';
```
Result:
```text
┌─formatReadableSize(total_bytes)─┐
│ 545.04 MiB │
└─────────────────────────────────┘
```
:::note
The above size is for a replicated table, if you are using this dataset with a single instance the size will be half.
:::
## Run Some Queries {#run-queries}
### Query 1. Average Price Per Year {#average-price}