ClickHouse/docs/en/index.md

# What is ClickHouse?

ClickHouse is a column-oriented database management system (DBMS) for online analytical processing of queries (OLAP).

In a "normal" row-oriented DBMS, data is stored in this order:

| Row | WatchID | JavaEnable | Title | GoodEvent | EventTime |
| ------ | ------------------- | ---------- | ------------------ | --------- | ------------------- |
| #0 | 89354350662 | 1 | Investor Relations | 1 | 2016-05-18 05:19:20 |
| #1 | 90329509958 | 0 | Contact us | 1 | 2016-05-18 08:10:20 |
| #2 | 89953706054 | 1 | Mission | 1 | 2016-05-18 07:38:00 |
| #N | ... | ... | ... | ... | ... |

In other words, all the values related to a row are physically stored next to each other.

Examples of a row-oriented DBMS are MySQL, Postgres, and MS SQL Server.
{: .grey }

In a column-oriented DBMS, data is stored like this:

| Row: | #0 | #1 | #2 | #N |
| ----------- | ------------------- | ------------------- | ------------------- | ------------------- |
| WatchID: | 89354350662 | 90329509958 | 89953706054 | ... |
| JavaEnable: | 1 | 0 | 1 | ... |
| Title: | Investor Relations | Contact us | Mission | ... |
| GoodEvent: | 1 | 1 | 1 | ... |
| EventTime: | 2016-05-18 05:19:20 | 2016-05-18 08:10:20 | 2016-05-18 07:38:00 | ... |

These examples only show the order that data is arranged in.
The values from different columns are stored separately, and data from the same column is stored together.

Examples of a column-oriented DBMS: Vertica, Paraccel (Actian Matrix and Amazon Redshift), Sybase IQ, Exasol, Infobright, InfiniDB, MonetDB (VectorWise and Actian Vector), LucidDB, SAP HANA, Google Dremel, Google PowerDrill, Druid, and kdb+.
{: .grey }

Different orders for storing data are better suited to different scenarios.
The data access scenario refers to what queries are made, how often, and in what proportion; how much data is read for each type of query – rows, columns, and bytes; the relationship between reading and updating data; the working size of the data and how locally it is used; whether transactions are used, and how isolated they are; requirements for data replication and logical integrity; requirements for latency and throughput for each type of query, and so on.

The higher the load on the system, the more important it is to customize the system set up to match the requirements of the usage scenario, and the more fine grained this customization becomes. There is no system that is equally well-suited to significantly different scenarios. If a system is adaptable to a wide set of scenarios, under a high load, the system will handle all the scenarios equally poorly, or will work well for just one or few of possible scenarios.

## Key Properties of OLAP Scenario

- The vast majority of requests are for read access.
- Data is updated in fairly large batches (> 1000 rows), not by single rows; or it is not updated at all.
- Data is added to the DB but is not modified.
- For reads, quite a large number of rows are extracted from the DB, but only a small subset of columns.
- Tables are "wide," meaning they contain a large number of columns.
- Queries are relatively rare (usually hundreds of queries per server or less per second).
- For simple queries, latencies around 50 ms are allowed.
- Column values are fairly small: numbers and short strings (for example, 60 bytes per URL).
- Requires high throughput when processing a single query (up to billions of rows per second per server).
- Transactions are not necessary.
- Low requirements for data consistency.
- There is one large table per query. All tables are small, except for one.
- A query result is significantly smaller than the source data. In other words, data is filtered or aggregated, so the result fits in a single server's RAM.

It is easy to see that the OLAP scenario is very different from other popular scenarios (such as OLTP or Key-Value access). So it doesn't make sense to try to use OLTP or a Key-Value DB for processing analytical queries if you want to get decent performance. For example, if you try to use MongoDB or Redis for analytics, you will get very poor performance compared to OLAP databases.

## Why Column-Oriented Databases Work Better in the OLAP Scenario

Column-oriented databases are better suited to OLAP scenarios: they are at least 100 times faster in processing most queries. The reasons are explained in detail below, but the fact is easier to demonstrate visually:

**Row-oriented DBMS**

![Row-oriented](images/row_oriented.gif#)

**Column-oriented DBMS**

![Column-oriented](images/column_oriented.gif#)

See the difference?

### Input/output

1. For an analytical query, only a small number of table columns need to be read. In a column-oriented database, you can read just the data you need. For example, if you need 5 columns out of 100, you can expect a 20-fold reduction in I/O.
2. Since data is read in packets, it is easier to compress. Data in columns is also easier to compress. This further reduces the I/O volume.
3. Due to the reduced I/O, more data fits in the system cache.

For example, the query "count the number of records for each advertising platform" requires reading one "advertising platform ID" column, which takes up 1 byte uncompressed. If most of the traffic was not from advertising platforms, you can expect at least 10-fold compression of this column. When using a quick compression algorithm, data decompression is possible at a speed of at least several gigabytes of uncompressed data per second. In other words, this query can be processed at a speed of approximately several billion rows per second on a single server. This speed is actually achieved in practice.

<details markdown="1"><summary>Example</summary>
```bash
$ clickhouse-client
ClickHouse client version 0.0.52053.
Connecting to localhost:9000.
Connected to ClickHouse server version 0.0.52053.
```
```sql
SELECT CounterID, count() FROM hits GROUP BY CounterID ORDER BY count() DESC LIMIT 20
```
```text
┌─CounterID─┬──count()─┐
│    114208 │ 56057344 │
│    115080 │ 51619590 │
│      3228 │ 44658301 │
│     38230 │ 42045932 │
│    145263 │ 42042158 │
│     91244 │ 38297270 │
│    154139 │ 26647572 │
│    150748 │ 24112755 │
│    242232 │ 21302571 │
│    338158 │ 13507087 │
│     62180 │ 12229491 │
│     82264 │ 12187441 │
│    232261 │ 12148031 │
│    146272 │ 11438516 │
│    168777 │ 11403636 │
│   4120072 │ 11227824 │
│  10938808 │ 10519739 │
│     74088 │  9047015 │
│    115079 │  8837972 │
│    337234 │  8205961 │
└───────────┴──────────┘
```

</details>

### CPU

Since executing a query requires processing a large number of rows, it helps to dispatch all operations for entire vectors instead of for separate rows, or to implement the query engine so that there is almost no dispatching cost. If you don't do this, with any half-decent disk subsystem, the query interpreter inevitably stalls the CPU.
It makes sense to both store data in columns and process it, when possible, by columns.

There are two ways to do this:

1. A vector engine. All operations are written for vectors, instead of for separate values. This means you don't need to call operations very often, and dispatching costs are negligible. Operation code contains an optimized internal cycle.

2. Code generation. The code generated for the query has all the indirect calls in it.

This is not done in "normal" databases, because it doesn't make sense when running simple queries. However, there are exceptions. For example, MemSQL uses code generation to reduce latency when processing SQL queries. (For comparison, analytical DBMSs require optimization of throughput, not latency.)

Note that for CPU efficiency, the query language must be declarative (SQL or MDX), or at least a vector (J, K). The query should only contain implicit loops, allowing for optimization.

[Original article](https://clickhouse.tech/docs/en/) <!--hide-->
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
+								# What is ClickHouse?
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
+								ClickHouse is a column-oriented database management system (DBMS) for online analytical processing of queries (OLAP).
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
 								In a "normal" row-oriented DBMS, data is stored in this order:
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
+								| Row | WatchID | JavaEnable | Title | GoodEvent | EventTime |
 								| ------ | ------------------- | ---------- | ------------------ | --------- | ------------------- |
-												fixes for index.md

											
										
										
											2018-09-06 17:08:43 +00:00
+								| #0 | 89354350662 | 1 | Investor Relations | 1 | 2016-05-18 05:19:20 |
 								| #1 | 90329509958 | 0 | Contact us | 1 | 2016-05-18 08:10:20 |
 								| #2 | 89953706054 | 1 | Mission | 1 | 2016-05-18 07:38:00 |
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
+								| #N | ... | ... | ... | ... | ... |
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
-												fixed other (#4441)

Did you mean 'In other words'...?
											
										
										
											2019-02-19 05:54:31 +00:00
+								In other words, all the values related to a row are physically stored next to each other.
-												Some introduction text refactoring

											
										
										
											2018-07-20 09:34:42 +00:00
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
+								Examples of a row-oriented DBMS are MySQL, Postgres, and MS SQL Server.
-												Some introduction text refactoring

											
										
										
											2018-07-20 09:34:42 +00:00
+								{: .grey }
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
 								In a column-oriented DBMS, data is stored like this:
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
+								| Row: | #0 | #1 | #2 | #N |
-												Make tables in introduction somewhat readable + move abbreviation definitions earlier

											
										
										
											2018-07-20 09:18:08 +00:00
+								| ----------- | ------------------- | ------------------- | ------------------- | ------------------- |
-												More docs fixes (#3068)

* lost backtick

* back to short examples on docs front page

* publish sitemap_static.xml too

* add link to "fa" sitemap

* add "fa" to robots.txt

											
										
										
											2018-09-07 10:27:44 +00:00
+								| WatchID: | 89354350662 | 90329509958 | 89953706054 | ... |
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
+								| JavaEnable: | 1 | 0 | 1 | ... |
 								| Title: | Investor Relations | Contact us | Mission | ... |
 								| GoodEvent: | 1 | 1 | 1 | ... |
 								| EventTime: | 2016-05-18 05:19:20 | 2016-05-18 08:10:20 | 2016-05-18 07:38:00 | ... |
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
 								These examples only show the order that data is arranged in.
 								The values from different columns are stored separately, and data from the same column is stored together.
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
+								Examples of a column-oriented DBMS: Vertica, Paraccel (Actian Matrix and Amazon Redshift), Sybase IQ, Exasol, Infobright, InfiniDB, MonetDB (VectorWise and Actian Vector), LucidDB, SAP HANA, Google Dremel, Google PowerDrill, Druid, and kdb+.
-												Some introduction text refactoring

											
										
										
											2018-07-20 09:34:42 +00:00
+								{: .grey }
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
+								Different orders for storing data are better suited to different scenarios.
 								The data access scenario refers to what queries are made, how often, and in what proportion; how much data is read for each type of query – rows, columns, and bytes; the relationship between reading and updating data; the working size of the data and how locally it is used; whether transactions are used, and how isolated they are; requirements for data replication and logical integrity; requirements for latency and throughput for each type of query, and so on.
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
-												fixes for index.md

											
										
										
											2018-09-06 17:08:43 +00:00
+								The higher the load on the system, the more important it is to customize the system set up to match the requirements of the usage scenario, and the more fine grained this customization becomes. There is no system that is equally well-suited to significantly different scenarios. If a system is adaptable to a wide set of scenarios, under a high load, the system will handle all the scenarios equally poorly, or will work well for just one or few of possible scenarios.
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
-												Update index.md
											
										
										
											2020-02-03 09:38:01 +00:00
+								## Key Properties of OLAP Scenario
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
 								- The vast majority of requests are for read access.
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
+								- Data is updated in fairly large batches (> 1000 rows), not by single rows; or it is not updated at all.
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
+								- Data is added to the DB but is not modified.
 								- For reads, quite a large number of rows are extracted from the DB, but only a small subset of columns.
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
+								- Tables are "wide," meaning they contain a large number of columns.
 								- Queries are relatively rare (usually hundreds of queries per server or less per second).
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
+								- For simple queries, latencies around 50 ms are allowed.
 								- Column values are fairly small: numbers and short strings (for example, 60 bytes per URL).
 								- Requires high throughput when processing a single query (up to billions of rows per second per server).
-												fixes for index.md

											
										
										
											2018-09-06 17:08:43 +00:00
+								- Transactions are not necessary.
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
+								- Low requirements for data consistency.
 								- There is one large table per query. All tables are small, except for one.
-												fixes for index.md

											
										
										
											2018-09-06 17:08:43 +00:00
+								- A query result is significantly smaller than the source data. In other words, data is filtered or aggregated, so the result fits in a single server's RAM.
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
-												Some docs introduction refactoring

											
										
										
											2018-07-20 12:47:37 +00:00
+								It is easy to see that the OLAP scenario is very different from other popular scenarios (such as OLTP or Key-Value access). So it doesn't make sense to try to use OLTP or a Key-Value DB for processing analytical queries if you want to get decent performance. For example, if you try to use MongoDB or Redis for analytics, you will get very poor performance compared to OLAP databases.
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
-												fixes for index.md

											
										
										
											2018-09-06 17:08:43 +00:00
+								## Why Column-Oriented Databases Work Better in the OLAP Scenario
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
+								Column-oriented databases are better suited to OLAP scenarios: they are at least 100 times faster in processing most queries. The reasons are explained in detail below, but the fact is easier to demonstrate visually:
-												Some docs introduction refactoring

											
										
										
											2018-07-20 12:47:37 +00:00
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
+								**Row-oriented DBMS**
-												Some docs introduction refactoring

											
										
										
											2018-07-20 12:47:37 +00:00
-												More docs fixes (#3068)

* lost backtick

* back to short examples on docs front page

* publish sitemap_static.xml too

* add link to "fa" sitemap

* add "fa" to robots.txt

											
										
										
											2018-09-07 10:27:44 +00:00
+								![Row-oriented](images/row_oriented.gif#)
-												Some docs introduction refactoring

											
										
										
											2018-07-20 12:47:37 +00:00
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
+								**Column-oriented DBMS**
-												Some docs introduction refactoring

											
										
										
											2018-07-20 12:47:37 +00:00
-												More docs fixes (#3068)

* lost backtick

* back to short examples on docs front page

* publish sitemap_static.xml too

* add link to "fa" sitemap

* add "fa" to robots.txt

											
										
										
											2018-09-07 10:27:44 +00:00
+								![Column-oriented](images/column_oriented.gif#)
-												Some docs introduction refactoring

											
										
										
											2018-07-20 12:47:37 +00:00
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
+								See the difference?
-												Some docs introduction refactoring

											
										
										
											2018-07-20 12:47:37 +00:00
-												fixes for index.md

											
										
										
											2018-09-06 17:08:43 +00:00
+								### Input/output
-												Some docs introduction refactoring

											
										
										
											2018-07-20 12:47:37 +00:00
 . For an analytical query, only a small number of table columns need to be read. In a column-oriented database, you can read just the data you need. For example, if you need 5 columns out of 100, you can expect a 20-fold reduction in I/O.
 . Since data is read in packets, it is easier to compress. Data in columns is also easier to compress. This further reduces the I/O volume.
 . Due to the reduced I/O, more data fits in the system cache.
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
 								For example, the query "count the number of records for each advertising platform" requires reading one "advertising platform ID" column, which takes up 1 byte uncompressed. If most of the traffic was not from advertising platforms, you can expect at least 10-fold compression of this column. When using a quick compression algorithm, data decompression is possible at a speed of at least several gigabytes of uncompressed data per second. In other words, this query can be processed at a speed of approximately several billion rows per second on a single server. This speed is actually achieved in practice.
-												WIP on docs/website (#3383)

* CLICKHOUSE-4063: less manual html @ index.md

* CLICKHOUSE-4063: recommend markdown="1" in README.md

* CLICKHOUSE-4003: manually purge custom.css for now

* CLICKHOUSE-4064: expand <details> before any print (including to pdf)

* CLICKHOUSE-3927: rearrange interfaces/formats.md a bit

* CLICKHOUSE-3306: add few http headers

* Remove copy-paste introduced in #3392

* Hopefully better chinese fonts #3392

* get rid of tabs @ custom.css

* Apply comments and patch from #3384

* Add jdbc.md to ToC and some translation, though it still looks badly incomplete

* minor punctuation

* Add some backlinks to official website from mirrors that just blindly take markdown sources

* Do not make fonts extra light

* find . -name '*.md' -type f | xargs -I{} perl -pi -e 's//g' {}

* find . -name '*.md' -type f | xargs -I{} perl -pi -e 's/ sql/g' {}

* Remove outdated stuff from roadmap.md

* Not so light font on front page too

* Refactor Chinese formats.md to match recent changes in other languages

											
										
										
											2018-10-16 10:47:17 +00:00
+								<details markdown="1"><summary>Example</summary>
-												DOCAPI-8530: Code blocks markup fix (#7060)

* Typo fix.

* Links fix.

* Fixed links in docs.

* More fixes.

* docs/en: cleaning some files

* docs/en: cleaning data_types

* docs/en: cleaning database_engines

* docs/en: cleaning development

* docs/en: cleaning getting_started

* docs/en: cleaning interfaces

* docs/en: cleaning operations

* docs/en: cleaning query_lamguage

* docs/en: cleaning en

* docs/ru: cleaning data_types

* docs/ru: cleaning index

* docs/ru: cleaning database_engines

* docs/ru: cleaning development

* docs/ru: cleaning general

* docs/ru: cleaning getting_started

* docs/ru: cleaning interfaces

* docs/ru: cleaning operations

* docs/ru: cleaning query_language

* docs: cleaning interfaces/http

* Update docs/en/data_types/array.md

decorated ```

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/getting_started/example_datasets/nyc_taxi.md

fixed typo

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/getting_started/example_datasets/ontime.md

fixed typo

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/interfaces/formats.md

fixed error

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/operations/table_engines/custom_partitioning_key.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/operations/utils/clickhouse-local.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/dicts/external_dicts_dict_sources.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/operations/utils/clickhouse-local.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/functions/json_functions.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/functions/json_functions.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/functions/other_functions.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/functions/other_functions.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/functions/date_time_functions.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/operations/table_engines/jdbc.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* docs: fixed error

* docs: fixed error

											
										
										
											2019-09-23 15:31:46 +00:00
+								```bash
-												Some introduction text refactoring

											
										
										
											2018-07-20 09:34:42 +00:00
+								$ clickhouse-client
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
+								ClickHouse client version 0.0.52053.
 								Connecting to localhost:9000.
 								Connected to ClickHouse server version 0.0.52053.
-												DOCAPI-8530: Code blocks markup fix (#7060)

* Typo fix.

* Links fix.

* Fixed links in docs.

* More fixes.

* docs/en: cleaning some files

* docs/en: cleaning data_types

* docs/en: cleaning database_engines

* docs/en: cleaning development

* docs/en: cleaning getting_started

* docs/en: cleaning interfaces

* docs/en: cleaning operations

* docs/en: cleaning query_lamguage

* docs/en: cleaning en

* docs/ru: cleaning data_types

* docs/ru: cleaning index

* docs/ru: cleaning database_engines

* docs/ru: cleaning development

* docs/ru: cleaning general

* docs/ru: cleaning getting_started

* docs/ru: cleaning interfaces

* docs/ru: cleaning operations

* docs/ru: cleaning query_language

* docs: cleaning interfaces/http

* Update docs/en/data_types/array.md

decorated ```

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/getting_started/example_datasets/nyc_taxi.md

fixed typo

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/getting_started/example_datasets/ontime.md

fixed typo

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/interfaces/formats.md

fixed error

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/operations/table_engines/custom_partitioning_key.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/operations/utils/clickhouse-local.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/dicts/external_dicts_dict_sources.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/operations/utils/clickhouse-local.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/functions/json_functions.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/functions/json_functions.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/functions/other_functions.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/functions/other_functions.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/query_language/functions/date_time_functions.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* Update docs/en/operations/table_engines/jdbc.md

Co-Authored-By: BayoNet <da-daos@yandex.ru>

* docs: fixed error

* docs: fixed error

											
										
										
											2019-09-23 15:31:46 +00:00
+								```
 								```sql
 								SELECT CounterID, count() FROM hits GROUP BY CounterID ORDER BY count() DESC LIMIT 20
 								```
 								```text
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
+								┌─CounterID─┬──count()─┐
 								│    114208 │ 56057344 │
 								│    115080 │ 51619590 │
 								│      3228 │ 44658301 │
 								│     38230 │ 42045932 │
 								│    145263 │ 42042158 │
 								│     91244 │ 38297270 │
 								│    154139 │ 26647572 │
 								│    150748 │ 24112755 │
 								│    242232 │ 21302571 │
 								│    338158 │ 13507087 │
 								│     62180 │ 12229491 │
 								│     82264 │ 12187441 │
 								│    232261 │ 12148031 │
 								│    146272 │ 11438516 │
 								│    168777 │ 11403636 │
 								│   4120072 │ 11227824 │
 								│  10938808 │ 10519739 │
 								│     74088 │  9047015 │
 								│    115079 │  8837972 │
 								│    337234 │  8205961 │
 								└───────────┴──────────┘
-												WIP on docs/website (#3383)

* CLICKHOUSE-4063: less manual html @ index.md

* CLICKHOUSE-4063: recommend markdown="1" in README.md

* CLICKHOUSE-4003: manually purge custom.css for now

* CLICKHOUSE-4064: expand <details> before any print (including to pdf)

* CLICKHOUSE-3927: rearrange interfaces/formats.md a bit

* CLICKHOUSE-3306: add few http headers

* Remove copy-paste introduced in #3392

* Hopefully better chinese fonts #3392

* get rid of tabs @ custom.css

* Apply comments and patch from #3384

* Add jdbc.md to ToC and some translation, though it still looks badly incomplete

* minor punctuation

* Add some backlinks to official website from mirrors that just blindly take markdown sources

* Do not make fonts extra light

* find . -name '*.md' -type f | xargs -I{} perl -pi -e 's//g' {}

* find . -name '*.md' -type f | xargs -I{} perl -pi -e 's/ sql/g' {}

* Remove outdated stuff from roadmap.md

* Not so light font on front page too

* Refactor Chinese formats.md to match recent changes in other languages

											
										
										
											2018-10-16 10:47:17 +00:00
+								```
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												WIP on docs/website (#3383)

* CLICKHOUSE-4063: less manual html @ index.md

* CLICKHOUSE-4063: recommend markdown="1" in README.md

* CLICKHOUSE-4003: manually purge custom.css for now

* CLICKHOUSE-4064: expand <details> before any print (including to pdf)

* CLICKHOUSE-3927: rearrange interfaces/formats.md a bit

* CLICKHOUSE-3306: add few http headers

* Remove copy-paste introduced in #3392

* Hopefully better chinese fonts #3392

* get rid of tabs @ custom.css

* Apply comments and patch from #3384

* Add jdbc.md to ToC and some translation, though it still looks badly incomplete

* minor punctuation

* Add some backlinks to official website from mirrors that just blindly take markdown sources

* Do not make fonts extra light

* find . -name '*.md' -type f | xargs -I{} perl -pi -e 's//g' {}

* find . -name '*.md' -type f | xargs -I{} perl -pi -e 's/ sql/g' {}

* Remove outdated stuff from roadmap.md

* Not so light font on front page too

* Refactor Chinese formats.md to match recent changes in other languages

											
										
										
											2018-10-16 10:47:17 +00:00
+								</details>
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
-												fixes for index.md

											
										
										
											2018-09-06 17:08:43 +00:00
+								### CPU
-												Document tree and project settings are prepared for site generation. Final step of NO-RST company.

											
										
										
											2018-02-11 08:18:20 +00:00
 								Since executing a query requires processing a large number of rows, it helps to dispatch all operations for entire vectors instead of for separate rows, or to implement the query engine so that there is almost no dispatching cost. If you don't do this, with any half-decent disk subsystem, the query interpreter inevitably stalls the CPU.
 								It makes sense to both store data in columns and process it, when possible, by columns.
 								There are two ways to do this:
 . A vector engine. All operations are written for vectors, instead of for separate values. This means you don't need to call operations very often, and dispatching costs are negligible. Operation code contains an optimized internal cycle.
 . Code generation. The code generated for the query has all the indirect calls in it.
 								This is not done in "normal" databases, because it doesn't make sense when running simple queries. However, there are exceptions. For example, MemSQL uses code generation to reduce latency when processing SQL queries. (For comparison, analytical DBMSs require optimization of throughput, not latency.)
 								Note that for CPU efficiency, the query language must be declarative (SQL or MDX), or at least a vector (J, K). The query should only contain implicit loops, allowing for optimization.
-												WIP on docs/website (#3383)

* CLICKHOUSE-4063: less manual html @ index.md

* CLICKHOUSE-4063: recommend markdown="1" in README.md

* CLICKHOUSE-4003: manually purge custom.css for now

* CLICKHOUSE-4064: expand <details> before any print (including to pdf)

* CLICKHOUSE-3927: rearrange interfaces/formats.md a bit

* CLICKHOUSE-3306: add few http headers

* Remove copy-paste introduced in #3392

* Hopefully better chinese fonts #3392

* get rid of tabs @ custom.css

* Apply comments and patch from #3384

* Add jdbc.md to ToC and some translation, though it still looks badly incomplete

* minor punctuation

* Add some backlinks to official website from mirrors that just blindly take markdown sources

* Do not make fonts extra light

* find . -name '*.md' -type f | xargs -I{} perl -pi -e 's//g' {}

* find . -name '*.md' -type f | xargs -I{} perl -pi -e 's/ sql/g' {}

* Remove outdated stuff from roadmap.md

* Not so light font on front page too

* Refactor Chinese formats.md to match recent changes in other languages

											
										
										
											2018-10-16 10:47:17 +00:00
-												Domain change in docs

											
										
										
											2020-01-30 10:34:55 +00:00
+								[Original article](https://clickhouse.tech/docs/en/) <!--hide-->