ClickHouse/docs/en/sql-reference/functions/index.md

---
slug: /en/sql-reference/functions/
sidebar_position: 1
sidebar_label: Overview
---

# Regular Functions

There are at least\* two types of functions - regular functions (they are just called “functions”) and aggregate functions. These are completely different concepts. Regular functions work as if they are applied to each row separately (for each row, the result of the function does not depend on the other rows). Aggregate functions accumulate a set of values from various rows (i.e. they depend on the entire set of rows).

In this section we discuss regular functions. For aggregate functions, see the section “Aggregate functions”.

\* - There is a third type of function that the ‘arrayJoin’ function belongs to; table functions can also be mentioned separately.\*

## Strong Typing

In contrast to standard SQL, ClickHouse has strong typing. In other words, it does not make implicit conversions between types. Each function works for a specific set of types. This means that sometimes you need to use type conversion functions.

## Common Subexpression Elimination

All expressions in a query that have the same AST (the same record or same result of syntactic parsing) are considered to have identical values. Such expressions are concatenated and executed once. Identical subqueries are also eliminated this way.

## Types of Results

All functions return a single return as the result (not several values, and not zero values). The type of result is usually defined only by the types of arguments, not by the values. Exceptions are the tupleElement function (the a.N operator), and the toFixedString function.

## Constants

For simplicity, certain functions can only work with constants for some arguments. For example, the right argument of the LIKE operator must be a constant.
Almost all functions return a constant for constant arguments. The exception is functions that generate random numbers.
The ‘now’ function returns different values for queries that were run at different times, but the result is considered a constant, since constancy is only important within a single query.
A constant expression is also considered a constant (for example, the right half of the LIKE operator can be constructed from multiple constants).

Functions can be implemented in different ways for constant and non-constant arguments (different code is executed). But the results for a constant and for a true column containing only the same value should match each other.

## NULL Processing

Functions have the following behaviors:

- If at least one of the arguments of the function is `NULL`, the function result is also `NULL`.
- Special behavior that is specified individually in the description of each function. In the ClickHouse source code, these functions have `UseDefaultImplementationForNulls=false`.

## Constancy

Functions can’t change the values of their arguments – any changes are returned as the result. Thus, the result of calculating separate functions does not depend on the order in which the functions are written in the query.

## Higher-order functions, `->` operator and lambda(params, expr) function

Higher-order functions can only accept lambda functions as their functional argument. To pass a lambda function to a higher-order function use `->` operator. The left side of the arrow has a formal parameter, which is any ID, or multiple formal parameters – any IDs in a tuple. The right side of the arrow has an expression that can use these formal parameters, as well as any table columns.

Examples:

```
x -> 2 * x
str -> str != Referer
```

A lambda function that accepts multiple arguments can also be passed to a higher-order function. In this case, the higher-order function is passed several arrays of identical length that these arguments will correspond to.

For some functions the first argument (the lambda function) can be omitted. In this case, identical mapping is assumed.

## SQL User Defined Functions

Custom functions from lambda expressions can be created using the [CREATE FUNCTION](../statements/create/function.md) statement. To delete these functions use the [DROP FUNCTION](../statements/drop.md#drop-function) statement.

## Executable User Defined Functions
ClickHouse can call any external executable program or script to process data.

The configuration of executable user defined functions can be located in one or more xml-files. The path to the configuration is specified in the [user_defined_executable_functions_config](../../operations/server-configuration-parameters/settings.md#server_configuration_parameters-user_defined_executable_functions_config) parameter.

A function configuration contains the following settings:

- `name` - a function name.
- `command` - script name to execute or command if `execute_direct` is false.
- `argument` - argument description with the `type`, and optional `name` of an argument. Each argument is described in a separate setting. Specifying name is necessary if argument names are part of serialization for user defined function format like [Native](../../interfaces/formats.md#native) or [JSONEachRow](../../interfaces/formats.md#jsoneachrow). Default argument name value is `c` + argument_number.
- `format` - a [format](../../interfaces/formats.md) in which arguments are passed to the command.
- `return_type` - the type of a returned value.
- `return_name` - name of retuned value. Specifying return name is necessary if return name is part of serialization for user defined function format like [Native](../../interfaces/formats.md#native) or [JSONEachRow](../../interfaces/formats.md#jsoneachrow). Optional. Default value is `result`.
- `type` - an executable type. If `type` is set to `executable` then single command is started. If it is set to `executable_pool` then a pool of commands is created.
- `max_command_execution_time` - maximum execution time in seconds for processing block of data. This setting is valid for `executable_pool` commands only. Optional. Default value is `10`.
- `command_termination_timeout` - time in seconds during which a command should finish after its pipe is closed. After that time `SIGTERM` is sent to the process executing the command. Optional. Default value is `10`.
- `command_read_timeout` - timeout for reading data from command stdout in milliseconds. Default value 10000. Optional parameter.
- `command_write_timeout` - timeout for writing data to command stdin in milliseconds. Default value 10000. Optional parameter.
- `pool_size` - the size of a command pool. Optional. Default value is `16`.
- `send_chunk_header` - controls whether to send row count before sending a chunk of data to process. Optional. Default value is `false`.
- `execute_direct` - If `execute_direct` = `1`, then `command` will be searched inside user_scripts folder specified by [user_scripts_path](../../operations/server-configuration-parameters/settings.md#server_configuration_parameters-user_scripts_path). Additional script arguments can be specified using whitespace separator. Example: `script_name arg1 arg2`. If `execute_direct` = `0`, `command` is passed as argument for `bin/sh -c`. Default value is `1`. Optional parameter.
- `lifetime` - the reload interval of a function in seconds. If it is set to `0` then the function is not reloaded. Default value is `0`. Optional parameter.

The command must read arguments from `STDIN` and must output the result to `STDOUT`. The command must process arguments iteratively. That is after processing a chunk of arguments it must wait for the next chunk.

**Example**

Creating `test_function` using XML configuration.
File `test_function.xml` (`/etc/clickhouse-server/test_function.xml` with default path settings).
```xml
<functions>
    <function>
        <type>executable</type>
        <name>test_function_python</name>
        <return_type>String</return_type>
        <argument>
            <type>UInt64</type>
            <name>value</name>
        </argument>
        <format>TabSeparated</format>
        <command>test_function.py</command>
    </function>
</functions>
```

Script file inside `user_scripts` folder `test_function.py` (`/var/lib/clickhouse/user_scripts/test_function.py` with default path settings).

```python
#!/usr/bin/python3

import sys

if __name__ == '__main__':
    for line in sys.stdin:
        print("Value " + line, end='')
        sys.stdout.flush()
```

Query:

``` sql
SELECT test_function_python(toUInt64(2));
```

Result:

``` text
┌─test_function_python(2)─┐
│ Value 2                 │
└─────────────────────────┘
```

Creating `test_function_sum` manually specifying `execute_direct` to `0` using XML configuration.
File `test_function.xml` (`/etc/clickhouse-server/test_function.xml` with default path settings).
```xml
<functions>
    <function>
        <type>executable</type>
        <name>test_function_sum</name>
        <return_type>UInt64</return_type>
        <argument>
            <type>UInt64</type>
            <name>lhs</name>
        </argument>
        <argument>
            <type>UInt64</type>
            <name>rhs</name>
        </argument>
        <format>TabSeparated</format>
        <command>cd /; clickhouse-local --input-format TabSeparated --output-format TabSeparated --structure 'x UInt64, y UInt64' --query "SELECT x + y FROM table"</command>
        <execute_direct>0</execute_direct>
    </function>
</functions>
```

Query:

``` sql
SELECT test_function_sum(2, 2);
```

Result:

``` text
┌─test_function_sum(2, 2)─┐
│                       4 │
└─────────────────────────┘
```

Creating `test_function_sum_json` with named arguments and format [JSONEachRow](../../interfaces/formats.md#jsoneachrow) using XML configuration.
File `test_function.xml` (`/etc/clickhouse-server/test_function.xml` with default path settings).
```xml
<functions>
    <function>
        <type>executable</type>
        <name>test_function_sum_json</name>
        <return_type>UInt64</return_type>
        <return_name>result_name</return_name>
        <argument>
            <type>UInt64</type>
            <name>argument_1</name>
        </argument>
        <argument>
            <type>UInt64</type>
            <name>argument_2</name>
        </argument>
        <format>JSONEachRow</format>
        <command>test_function_sum_json.py</command>
    </function>
</functions>
```

Script file inside `user_scripts` folder `test_function_sum_json.py` (`/var/lib/clickhouse/user_scripts/test_function_sum_json.py` with default path settings).

```python
#!/usr/bin/python3

import sys
import json

if __name__ == '__main__':
    for line in sys.stdin:
        value = json.loads(line)
        first_arg = int(value['argument_1'])
        second_arg = int(value['argument_2'])
        result = {'result_name': first_arg + second_arg}
        print(json.dumps(result), end='\n')
        sys.stdout.flush()
```

Query:

``` sql
SELECT test_function_sum_json(2, 2);
```

Result:

``` text
┌─test_function_sum_json(2, 2)─┐
│                            4 │
└──────────────────────────────┘
```

Executable user defined functions can take constant parameters configured in `command` setting (works only for user defined functions with `executable` type).
File `test_function_parameter_python.xml` (`/etc/clickhouse-server/test_function_parameter_python.xml` with default path settings).
```xml
<functions>
    <function>
        <type>executable</type>
        <name>test_function_parameter_python</name>
        <return_type>String</return_type>
        <argument>
            <type>UInt64</type>
        </argument>
        <format>TabSeparated</format>
        <command>test_function_parameter_python.py {test_parameter:UInt64}</command>
    </function>
</functions>
```

Script file inside `user_scripts` folder `test_function_parameter_python.py` (`/var/lib/clickhouse/user_scripts/test_function_parameter_python.py` with default path settings).

```python
#!/usr/bin/python3

import sys

if __name__ == "__main__":
    for line in sys.stdin:
        print("Parameter " + str(sys.argv[1]) + " value " + str(line), end="")
        sys.stdout.flush()
```

Query:

``` sql
SELECT test_function_parameter_python(1)(2);
```

Result:

``` text
┌─test_function_parameter_python(1)(2)─┐
│ Parameter 1 value 2                  │
└──────────────────────────────────────┘
```

## Error Handling

Some functions might throw an exception if the data is invalid. In this case, the query is canceled and an error text is returned to the client. For distributed processing, when an exception occurs on one of the servers, the other servers also attempt to abort the query.

## Evaluation of Argument Expressions

In almost all programming languages, one of the arguments might not be evaluated for certain operators. This is usually the operators `&&`, `||`, and `?:`.
But in ClickHouse, arguments of functions (operators) are always evaluated. This is because entire parts of columns are evaluated at once, instead of calculating each row separately.

## Performing Functions for Distributed Query Processing

For distributed query processing, as many stages of query processing as possible are performed on remote servers, and the rest of the stages (merging intermediate results and everything after that) are performed on the requestor server.

This means that functions can be performed on different servers.
For example, in the query `SELECT f(sum(g(x))) FROM distributed_table GROUP BY h(y),`

- if a `distributed_table` has at least two shards, the functions ‘g’ and ‘h’ are performed on remote servers, and the function ‘f’ is performed on the requestor server.
- if a `distributed_table` has only one shard, all the ‘f’, ‘g’, and ‘h’ functions are performed on this shard’s server.

The result of a function usually does not depend on which server it is performed on. However, sometimes this is important.
For example, functions that work with dictionaries use the dictionary that exists on the server they are running on.
Another example is the `hostName` function, which returns the name of the server it is running on in order to make `GROUP BY` by servers in a `SELECT` query.

If a function in a query is performed on the requestor server, but you need to perform it on remote servers, you can wrap it in an ‘any’ aggregate function or add it to a key in `GROUP BY`.


## Related Content

- [User-defined functions in ClickHouse Cloud](https://clickhouse.com/blog/user-defined-functions-clickhouse-udfs)
-												Get rid of toc_en.yml (#10023)


											
										
										
											2020-04-03 13:23:32 +00:00
+								---
-												add slugs

											
										
										
											2022-08-28 14:53:34 +00:00
+								slug: /en/sql-reference/functions/
-												Docs: Sort functions in sidebar

											
										
										
											2023-04-19 17:05:55 +00:00
+								sidebar_position: 1
 								sidebar_label: Overview
-												Get rid of toc_en.yml (#10023)


											
										
										
											2020-04-03 13:23:32 +00:00
+								---
-												Docs: Sort functions in sidebar

											
										
										
											2023-04-19 17:05:55 +00:00
+								# Regular Functions
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Avoid short syntax

											
										
										
											2021-05-27 19:44:11 +00:00
+								There are at least\* two types of functions - regular functions (they are just called “functions”) and aggregate functions. These are completely different concepts. Regular functions work as if they are applied to each row separately (for each row, the result of the function does not depend on the other rows). Aggregate functions accumulate a set of values from various rows (i.e. they depend on the entire set of rows).
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								In this section we discuss regular functions. For aggregate functions, see the section “Aggregate functions”.
-												CLICKHOUSE-2720: progress on website and reference (#886)

* update presentations

* CLICKHOUSE-2936: redirect from clickhouse.yandex.ru and clickhouse.yandex.com

* update submodule

* lost files

* CLICKHOUSE-2981: prefer sphinx docs over original reference

* CLICKHOUSE-2981: docs styles more similar to main website + add flags to switch language links

* update presentations

* Less confusing directory structure (docs -> doc/reference/)

* Minify sphinx docs too

* Website release script: fail fast + pass docker hash on deploy

* Do not underline links in docs

* shorter

* cleanup docker images

* tune nginx config

* CLICKHOUSE-3043: get rid of habrastorage links

* Lost translation

* CLICKHOUSE-2936: temporary client-side redirect

* behaves weird in test

* put redirect back

* CLICKHOUSE-3047: copy docs txts to public too

* move to proper file

* remove old pages to avoid confusion

* Remove reference redirect warning for now

* Refresh README.md

* Yellow buttons in docs

* Use svg flags instead of unicode ones in docs

* fix test website instance

* Put flags to separate files

* wrong flag

* Copy Yandex.Metrica introduction from main page to docs

* Yet another home page structure change, couple new blocks (CLICKHOUSE-3045)

* Update Contacts section

* CLICKHOUSE-2849: more detailed legal information

* CLICKHOUSE-2978 preparation - split by files

* More changes in Contacts block

* Tune texts on index page

* update presentations

* One more benchmark

* Add usage sections to index page, adapted from slides

* Get the roadmap started, based on slides from last ClickHouse Meetup

* CLICKHOUSE-2977: some rendering tuning

* Get rid of excessive section in the end of getting started

* Make headers linkable

* CLICKHOUSE-2981: links to editing reference - https://github.com/yandex/ClickHouse/issues/849

* CLICKHOUSE-2981: fix mobile styles in docs

* Ban crawling of duplicating docs

* Open some external links in new tab

* Ban old docs too

* Lots of trivial fixes in english docs

* Lots of trivial fixes in russian docs

* Remove getting started copies in markdown

* Add Yandex.Webmaster

* Fix some sphinx warnings

* More warnings fixed in english docs

* More sphinx warnings fixed

* Add code-block:: text

* More code-block:: text

* These headers look not that well

* Better switch between documentation languages

* merge use_case.rst into ya_metrika_task.rst

* Edit the agg_functions.rst texts

* Add lost empty lines

* Lost blank lines

* Add new logo sizes

* update presentations

* Next step in migrating to new documentation

* Fix all warnings in en reference

* Fix all warnings in ru reference

* Re-arrange existing reference

* Move operation tips to main reference

* Fix typos noticed by milovidov@

* Get rid of zookeeper.md

* Looks like duplicate of tutorial.html

* Fix some mess with html tags in tutorial

* No idea why nobody noticed this before, but it was completely not clear whet to get the data

* Match code block styling between main and tutorial pages (in favor of the latter)

* Get rid of some copypaste in tutorial

* Normalize header styles

* Move example_datasets to sphinx

* Move presentations submodule to website

* Move and update README.md

* No point in duplicating articles from habrahabr here

* Move development-related docs as is for now

* doc/reference/ -> docs/ (to match the URL on website)

* Adapt links to match the previous commit

* Adapt development docs to rst (still lacks translation and strikethrough support)

* clean on release

* blacklist presentations in gulp

* strikethrough support in sphinx

* just copy development folder for now

* fix weird introduction in style article

* Style guide translation (WIP)

* Finish style guide translation to English

* gulp clean separately

* Update year in LICENSE

* Initial CONTRIBUTING.md

* Fix remaining links to old docs in tutorial

* Some tutorial fixes

* Typo

* Another typo

* Update list of authors from yandex-team accoding to git log

											
										
										
											2017-06-20 14:19:03 +00:00
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								\* - There is a third type of function that the ‘arrayJoin’ function belongs to; table functions can also be mentioned separately.\*
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								## Strong Typing
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Avoid short syntax

											
										
										
											2021-05-27 19:44:11 +00:00
+								In contrast to standard SQL, ClickHouse has strong typing. In other words, it does not make implicit conversions between types. Each function works for a specific set of types. This means that sometimes you need to use type conversion functions.
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								## Common Subexpression Elimination
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Functions

											
										
										
											2017-04-26 19:16:38 +00:00
+								All expressions in a query that have the same AST (the same record or same result of syntactic parsing) are considered to have identical values. Such expressions are concatenated and executed once. Identical subqueries are also eliminated this way.
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								## Types of Results
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Functions

											
										
										
											2017-04-26 19:16:38 +00:00
+								All functions return a single return as the result (not several values, and not zero values). The type of result is usually defined only by the types of arguments, not by the values. Exceptions are the tupleElement function (the a.N operator), and the toFixedString function.
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								## Constants
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Functions

											
										
										
											2017-04-26 19:16:38 +00:00
+								For simplicity, certain functions can only work with constants for some arguments. For example, the right argument of the LIKE operator must be a constant.
 								Almost all functions return a constant for constant arguments. The exception is functions that generate random numbers.
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								The ‘now’ function returns different values for queries that were run at different times, but the result is considered a constant, since constancy is only important within a single query.
-												Functions

											
										
										
											2017-04-26 19:16:38 +00:00
+								A constant expression is also considered a constant (for example, the right half of the LIKE operator can be constructed from multiple constants).
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Functions

											
										
										
											2017-04-26 19:16:38 +00:00
+								Functions can be implemented in different ways for constant and non-constant arguments (different code is executed). But the results for a constant and for a true column containing only the same value should match each other.
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								## NULL Processing
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
 								Functions have the following behaviors:
-												Docs: Replace annoying three spaces in enumerations by a single space

											
										
										
											2023-04-19 15:55:29 +00:00
+								- If at least one of the arguments of the function is `NULL`, the function result is also `NULL`.
 								- Special behavior that is specified individually in the description of each function. In the ClickHouse source code, these functions have `UseDefaultImplementationForNulls=false`.
-												Update of english documentation (#2918)

* Updating of english translation.

* Some bugs are fixed.

											
										
										
											2018-09-04 11:18:59 +00:00
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								## Constancy
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								Functions can’t change the values of their arguments – any changes are returned as the result. Thus, the result of calculating separate functions does not depend on the order in which the functions are written in the query.
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								## Higher-order functions, `->` operator and lambda(params, expr) function
-												DOCSUP-924: Higher-order functions moved to Array functions (#14092)

* Higher-order functions description moved to Array functions (English).

* Bad anchor fixed.

* Update docs/en/sql-reference/functions/array-functions.md

Co-authored-by: Ivan Blinkov <github@blinkov.ru>

* Higher-order functions description moved to Array functions (Russian).

* Update array-functions.md

Minor fixes in Russian text.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Ivan Blinkov <github@blinkov.ru>
											
										
										
											2020-09-08 15:55:00 +00:00
 								Higher-order functions can only accept lambda functions as their functional argument. To pass a lambda function to a higher-order function use `->` operator. The left side of the arrow has a formal parameter, which is any ID, or multiple formal parameters – any IDs in a tuple. The right side of the arrow has an expression that can use these formal parameters, as well as any table columns.
-												Remove trailing whitespaces from docs

											
										
										
											2021-07-29 15:20:55 +00:00
+								Examples:
-												DOCSUP-924: Higher-order functions moved to Array functions (#14092)

* Higher-order functions description moved to Array functions (English).

* Bad anchor fixed.

* Update docs/en/sql-reference/functions/array-functions.md

Co-authored-by: Ivan Blinkov <github@blinkov.ru>

* Higher-order functions description moved to Array functions (Russian).

* Update array-functions.md

Minor fixes in Russian text.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Ivan Blinkov <github@blinkov.ru>
											
										
										
											2020-09-08 15:55:00 +00:00
 								```
 								x -> 2 * x
 								str -> str != Referer
 								```
 								A lambda function that accepts multiple arguments can also be passed to a higher-order function. In this case, the higher-order function is passed several arrays of identical length that these arguments will correspond to.
 								For some functions the first argument (the lambda function) can be omitted. In this case, identical mapping is assumed.
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								## SQL User Defined Functions
-												Links to CREATE FUNCTION and DROP FUNCTION from function Introduction

											
										
										
											2021-09-07 19:32:12 +00:00
-												en draft

											
										
										
											2021-09-24 11:44:29 +00:00
+								Custom functions from lambda expressions can be created using the [CREATE FUNCTION](../statements/create/function.md) statement. To delete these functions use the [DROP FUNCTION](../statements/drop.md#drop-function) statement.
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								## Executable User Defined Functions
-												Fix documentation for executable user defined functions configuration

											
										
										
											2022-04-28 16:01:10 +00:00
+								ClickHouse can call any external executable program or script to process data.
 								The configuration of executable user defined functions can be located in one or more xml-files. The path to the configuration is specified in the [user_defined_executable_functions_config](../../operations/server-configuration-parameters/settings.md#server_configuration_parameters-user_defined_executable_functions_config) parameter.
-												en draft

											
										
										
											2021-09-24 11:44:29 +00:00
-												Improved text

											
										
										
											2021-09-26 19:44:53 +00:00
+								A function configuration contains the following settings:
-												en draft

											
										
										
											2021-09-24 11:44:29 +00:00
-												Docs: Replace annoying three spaces in enumerations by a single space

											
										
										
											2023-04-19 15:55:29 +00:00
+								- `name` - a function name.
 								- `command` - script name to execute or command if `execute_direct` is false.
 								- `argument` - argument description with the `type`, and optional `name` of an argument. Each argument is described in a separate setting. Specifying name is necessary if argument names are part of serialization for user defined function format like [Native](../../interfaces/formats.md#native) or [JSONEachRow](../../interfaces/formats.md#jsoneachrow). Default argument name value is `c` + argument_number.
 								- `format` - a [format](../../interfaces/formats.md) in which arguments are passed to the command.
 								- `return_type` - the type of a returned value.
 								- `return_name` - name of retuned value. Specifying return name is necessary if return name is part of serialization for user defined function format like [Native](../../interfaces/formats.md#native) or [JSONEachRow](../../interfaces/formats.md#jsoneachrow). Optional. Default value is `result`.
 								- `type` - an executable type. If `type` is set to `executable` then single command is started. If it is set to `executable_pool` then a pool of commands is created.
 								- `max_command_execution_time` - maximum execution time in seconds for processing block of data. This setting is valid for `executable_pool` commands only. Optional. Default value is `10`.
 								- `command_termination_timeout` - time in seconds during which a command should finish after its pipe is closed. After that time `SIGTERM` is sent to the process executing the command. Optional. Default value is `10`.
 								- `command_read_timeout` - timeout for reading data from command stdout in milliseconds. Default value 10000. Optional parameter.
 								- `command_write_timeout` - timeout for writing data to command stdin in milliseconds. Default value 10000. Optional parameter.
 								- `pool_size` - the size of a command pool. Optional. Default value is `16`.
 								- `send_chunk_header` - controls whether to send row count before sending a chunk of data to process. Optional. Default value is `false`.
 								- `execute_direct` - If `execute_direct` = `1`, then `command` will be searched inside user_scripts folder specified by [user_scripts_path](../../operations/server-configuration-parameters/settings.md#server_configuration_parameters-user_scripts_path). Additional script arguments can be specified using whitespace separator. Example: `script_name arg1 arg2`. If `execute_direct` = `0`, `command` is passed as argument for `bin/sh -c`. Default value is `1`. Optional parameter.
 								- `lifetime` - the reload interval of a function in seconds. If it is set to `0` then the function is not reloaded. Default value is `0`. Optional parameter.
-												Improved text

											
										
										
											2021-09-26 19:44:53 +00:00
-												Update docs/en/sql-reference/functions/index.md

Co-authored-by: Anna <42538400+adevyatova@users.noreply.github.com>
											
										
										
											2021-09-29 05:25:44 +00:00
+								The command must read arguments from `STDIN` and must output the result to `STDOUT`. The command must process arguments iteratively. That is after processing a chunk of arguments it must wait for the next chunk.
-												en draft

											
										
										
											2021-09-24 11:44:29 +00:00
 								**Example**
-												Fix documentation for executable user defined functions configuration

											
										
										
											2022-04-28 16:01:10 +00:00
-												UserDefinedExecutableFunction fix documentation

											
										
										
											2022-01-02 13:46:11 +00:00
+								Creating `test_function` using XML configuration.
-												Update index.md
											
										
										
											2023-03-30 20:01:16 +00:00
+								File `test_function.xml` (`/etc/clickhouse-server/test_function.xml` with default path settings).
-												UserDefinedExecutableFunction fix documentation

											
										
										
											2022-01-02 13:46:11 +00:00
+								```xml
 								<functions>
 								    <function>
 								        <type>executable</type>
 								        <name>test_function_python</name>
 								        <return_type>String</return_type>
 								        <argument>
 								            <type>UInt64</type>
-												Updated documenttation

											
										
										
											2022-02-16 15:49:27 +00:00
+								            <name>value</name>
-												UserDefinedExecutableFunction fix documentation

											
										
										
											2022-01-02 13:46:11 +00:00
+								        </argument>
 								        <format>TabSeparated</format>
 								        <command>test_function.py</command>
 								    </function>
 								</functions>
-												en draft

											
										
										
											2021-09-24 11:44:29 +00:00
+								```
-												UserDefinedExecutableFunction fix documentation

											
										
										
											2022-01-02 13:46:11 +00:00
-												Update index.md
											
										
										
											2023-03-30 20:01:16 +00:00
+								Script file inside `user_scripts` folder `test_function.py` (`/var/lib/clickhouse/user_scripts/test_function.py` with default path settings).
-												UserDefinedExecutableFunction fix documentation

											
										
										
											2022-01-02 13:46:11 +00:00
 								```python
 								#!/usr/bin/python3
 								import sys
 								if __name__ == '__main__':
 								    for line in sys.stdin:
 								        print("Value " + line, end='')
 								        sys.stdout.flush()
 								```
 								Query:
 								``` sql
 								SELECT test_function_python(toUInt64(2));
 								```
 								Result:
 								``` text
 								┌─test_function_python(2)─┐
 								│ Value 2                 │
 								└─────────────────────────┘
 								```
 								Creating `test_function_sum` manually specifying `execute_direct` to `0` using XML configuration.
-												Update index.md
											
										
										
											2023-03-30 20:01:16 +00:00
+								File `test_function.xml` (`/etc/clickhouse-server/test_function.xml` with default path settings).
-												UserDefinedExecutableFunction fix documentation

											
										
										
											2022-01-02 13:46:11 +00:00
+								```xml
-												en draft

											
										
										
											2021-09-24 11:44:29 +00:00
+								<functions>
 								    <function>
-												en fixes

											
										
										
											2021-09-27 20:15:15 +00:00
+								        <type>executable</type>
-												UserDefinedExecutableFunction fix documentation

											
										
										
											2022-01-02 13:46:11 +00:00
+								        <name>test_function_sum</name>
-												en fixes

											
										
										
											2021-09-27 20:15:15 +00:00
+								        <return_type>UInt64</return_type>
 								        <argument>
 								            <type>UInt64</type>
-												Updated documenttation

											
										
										
											2022-02-16 15:49:27 +00:00
+								            <name>lhs</name>
-												en fixes

											
										
										
											2021-09-27 20:15:15 +00:00
+								        </argument>
-												en draft

											
										
										
											2021-09-24 11:44:29 +00:00
+								        <argument>
-												en fixes

											
										
										
											2021-09-27 20:15:15 +00:00
+								            <type>UInt64</type>
-												Updated documenttation

											
										
										
											2022-02-16 15:49:27 +00:00
+								            <name>rhs</name>
-												en draft

											
										
										
											2021-09-24 11:44:29 +00:00
+								        </argument>
 								        <format>TabSeparated</format>
-												en fixes

											
										
										
											2021-09-27 20:15:15 +00:00
+								        <command>cd /; clickhouse-local --input-format TabSeparated --output-format TabSeparated --structure 'x UInt64, y UInt64' --query "SELECT x + y FROM table"</command>
-												UserDefinedExecutableFunction fix documentation

											
										
										
											2022-01-02 13:46:11 +00:00
+								        <execute_direct>0</execute_direct>
-												en draft

											
										
										
											2021-09-24 11:44:29 +00:00
+								    </function>
 								</functions>
 								```
 								Query:
 								``` sql
-												UserDefinedExecutableFunction fix documentation

											
										
										
											2022-01-02 13:46:11 +00:00
+								SELECT test_function_sum(2, 2);
-												en draft

											
										
										
											2021-09-24 11:44:29 +00:00
+								```
 								Result:
 								``` text
-												UserDefinedExecutableFunction fix documentation

											
										
										
											2022-01-02 13:46:11 +00:00
+								┌─test_function_sum(2, 2)─┐
 								│                       4 │
 								└─────────────────────────┘
-												en draft

											
										
										
											2021-09-24 11:44:29 +00:00
+								```
-												Updated documentation

											
										
										
											2022-02-18 15:35:18 +00:00
+								Creating `test_function_sum_json` with named arguments and format [JSONEachRow](../../interfaces/formats.md#jsoneachrow) using XML configuration.
-												Update index.md
											
										
										
											2023-03-30 20:01:16 +00:00
+								File `test_function.xml` (`/etc/clickhouse-server/test_function.xml` with default path settings).
-												Updated documentation

											
										
										
											2022-02-18 15:35:18 +00:00
+								```xml
-												ExecutableUserDefinedFunctions added parameters documentation

											
										
										
											2022-06-08 14:59:41 +00:00
+								<functions>
 								    <function>
 								        <type>executable</type>
 								        <name>test_function_sum_json</name>
 								        <return_type>UInt64</return_type>
 								        <return_name>result_name</return_name>
 								        <argument>
 								            <type>UInt64</type>
 								            <name>argument_1</name>
 								        </argument>
 								        <argument>
 								            <type>UInt64</type>
 								            <name>argument_2</name>
 								        </argument>
 								        <format>JSONEachRow</format>
 								        <command>test_function_sum_json.py</command>
 								    </function>
 								</functions>
-												Updated documentation

											
										
										
											2022-02-18 15:35:18 +00:00
+								```
-												Update index.md
											
										
										
											2023-03-30 20:01:16 +00:00
+								Script file inside `user_scripts` folder `test_function_sum_json.py` (`/var/lib/clickhouse/user_scripts/test_function_sum_json.py` with default path settings).
-												Updated documentation

											
										
										
											2022-02-18 15:35:18 +00:00
 								```python
 								#!/usr/bin/python3
 								import sys
 								import json
 								if __name__ == '__main__':
 								    for line in sys.stdin:
 								        value = json.loads(line)
 								        first_arg = int(value['argument_1'])
 								        second_arg = int(value['argument_2'])
 								        result = {'result_name': first_arg + second_arg}
 								        print(json.dumps(result), end='\n')
 								        sys.stdout.flush()
 								```
 								Query:
 								``` sql
 								SELECT test_function_sum_json(2, 2);
 								```
 								Result:
 								``` text
 								┌─test_function_sum_json(2, 2)─┐
 								│                            4 │
 								└──────────────────────────────┘
-												ExecutableUserDefinedFunctions added parameters documentation

											
										
										
											2022-06-08 14:59:41 +00:00
+								```
 								Executable user defined functions can take constant parameters configured in `command` setting (works only for user defined functions with `executable` type).
-												Update index.md
											
										
										
											2023-03-30 20:01:16 +00:00
+								File `test_function_parameter_python.xml` (`/etc/clickhouse-server/test_function_parameter_python.xml` with default path settings).
-												ExecutableUserDefinedFunctions added parameters documentation

											
										
										
											2022-06-08 14:59:41 +00:00
+								```xml
 								<functions>
 								    <function>
 								        <type>executable</type>
 								        <name>test_function_parameter_python</name>
 								        <return_type>String</return_type>
 								        <argument>
 								            <type>UInt64</type>
 								        </argument>
 								        <format>TabSeparated</format>
 								        <command>test_function_parameter_python.py {test_parameter:UInt64}</command>
 								    </function>
 								</functions>
 								```
-												Update index.md
											
										
										
											2023-03-30 20:01:16 +00:00
+								Script file inside `user_scripts` folder `test_function_parameter_python.py` (`/var/lib/clickhouse/user_scripts/test_function_parameter_python.py` with default path settings).
-												ExecutableUserDefinedFunctions added parameters documentation

											
										
										
											2022-06-08 14:59:41 +00:00
 								```python
 								#!/usr/bin/python3
 								import sys
 								if __name__ == "__main__":
 								    for line in sys.stdin:
 								        print("Parameter " + str(sys.argv[1]) + " value " + str(line), end="")
 								        sys.stdout.flush()
 								```
 								Query:
 								``` sql
 								SELECT test_function_parameter_python(1)(2);
 								```
 								Result:
 								``` text
 								┌─test_function_parameter_python(1)(2)─┐
 								│ Parameter 1 value 2                  │
 								└──────────────────────────────────────┘
-												Updated documentation

											
										
										
											2022-02-18 15:35:18 +00:00
+								```
-												Links to CREATE FUNCTION and DROP FUNCTION from function Introduction

											
										
										
											2021-09-07 19:32:12 +00:00
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								## Error Handling
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Functions

											
										
										
											2017-04-26 19:16:38 +00:00
+								Some functions might throw an exception if the data is invalid. In this case, the query is canceled and an error text is returned to the client. For distributed processing, when an exception occurs on one of the servers, the other servers also attempt to abort the query.
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								## Evaluation of Argument Expressions
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
+								In almost all programming languages, one of the arguments might not be evaluated for certain operators. This is usually the operators `&&`, `||`, and `?:`.
-												Functions

											
										
										
											2017-04-26 19:16:38 +00:00
+								But in ClickHouse, arguments of functions (operators) are always evaluated. This is because entire parts of columns are evaluated at once, instead of calculating each row separately.
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								## Performing Functions for Distributed Query Processing
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Functions

											
										
										
											2017-04-26 19:16:38 +00:00
+								For distributed query processing, as many stages of query processing as possible are performed on remote servers, and the rest of the stages (merging intermediate results and everything after that) are performed on the requestor server.
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Functions

											
										
										
											2017-04-26 19:16:38 +00:00
+								This means that functions can be performed on different servers.
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
+								For example, in the query `SELECT f(sum(g(x))) FROM distributed_table GROUP BY h(y),`
-												Docs: Replace annoying three spaces in enumerations by a single space

											
										
										
											2023-04-19 15:55:29 +00:00
+								- if a `distributed_table` has at least two shards, the functions ‘g’ and ‘h’ are performed on remote servers, and the function ‘f’ is performed on the requestor server.
 								- if a `distributed_table` has only one shard, all the ‘f’, ‘g’, and ‘h’ functions are performed on this shard’s server.
-												Initial commit if EN docs

											
										
										
											2017-04-03 19:49:50 +00:00
-												Avoid short syntax

											
										
										
											2021-05-27 19:44:11 +00:00
+								The result of a function usually does not depend on which server it is performed on. However, sometimes this is important.
-												Functions

											
										
										
											2017-04-26 19:16:38 +00:00
+								For example, functions that work with dictionaries use the dictionary that exists on the server they are running on.
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
+								Another example is the `hostName` function, which returns the name of the server it is running on in order to make `GROUP BY` by servers in a `SELECT` query.
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								If a function in a query is performed on the requestor server, but you need to perform it on remote servers, you can wrap it in an ‘any’ aggregate function or add it to a key in `GROUP BY`.
-												WIP on docs/website (#3383)

* CLICKHOUSE-4063: less manual html @ index.md

* CLICKHOUSE-4063: recommend markdown="1" in README.md

* CLICKHOUSE-4003: manually purge custom.css for now

* CLICKHOUSE-4064: expand <details> before any print (including to pdf)

* CLICKHOUSE-3927: rearrange interfaces/formats.md a bit

* CLICKHOUSE-3306: add few http headers

* Remove copy-paste introduced in #3392

* Hopefully better chinese fonts #3392

* get rid of tabs @ custom.css

* Apply comments and patch from #3384

* Add jdbc.md to ToC and some translation, though it still looks badly incomplete

* minor punctuation

* Add some backlinks to official website from mirrors that just blindly take markdown sources

* Do not make fonts extra light

* find . -name '*.md' -type f | xargs -I{} perl -pi -e 's//g' {}

* find . -name '*.md' -type f | xargs -I{} perl -pi -e 's/ sql/g' {}

* Remove outdated stuff from roadmap.md

* Not so light font on front page too

* Refactor Chinese formats.md to match recent changes in other languages

											
										
										
											2018-10-16 10:47:17 +00:00
-												cross link dictionaries + udf posts£

											
										
										
											2022-12-14 15:01:15 +00:00
 								## Related Content
 								- [User-defined functions in ClickHouse Cloud](https://clickhouse.com/blog/user-defined-functions-clickhouse-udfs)