2020-04-03 13:23:32 +00:00
---
2022-08-28 14:53:34 +00:00
slug: /en/sql-reference/functions/other-functions
2022-04-09 13:29:05 +00:00
sidebar_position: 67
sidebar_label: Other
2020-04-03 13:23:32 +00:00
---
2022-06-02 10:55:18 +00:00
# Other Functions
2017-12-28 15:13:23 +00:00
2022-06-02 10:55:18 +00:00
## hostName()
2017-12-28 15:13:23 +00:00
Returns a string with the name of the host that this function was performed on. For distributed processing, this is the name of the remote server host, if the function is performed on a remote server.
2021-10-06 20:13:46 +00:00
If it is executed in the context of a distributed table, then it generates a normal column with values relevant to each shard. Otherwise it produces a constant value.
2017-12-28 15:13:23 +00:00
2022-06-02 10:55:18 +00:00
## getMacro
2020-05-02 18:01:13 +00:00
Gets a named value from the [macros ](../../operations/server-configuration-parameters/settings.md#macros ) section of the server configuration.
2020-06-18 08:24:31 +00:00
**Syntax**
2020-05-02 18:01:13 +00:00
2020-06-18 08:24:31 +00:00
``` sql
2020-05-02 18:01:13 +00:00
getMacro(name);
```
2021-02-15 21:22:10 +00:00
**Arguments**
2020-05-02 18:01:13 +00:00
2020-06-18 08:24:31 +00:00
- `name` — Name to retrieve from the `macros` section. [String ](../../sql-reference/data-types/string.md#string ).
2020-05-02 18:01:13 +00:00
**Returned value**
2020-06-18 08:24:31 +00:00
- Value of the specified macro.
2020-05-02 18:01:13 +00:00
Type: [String ](../../sql-reference/data-types/string.md ).
**Example**
The example `macros` section in the server configuration file:
2020-06-18 08:24:31 +00:00
``` xml
2020-05-02 18:01:13 +00:00
< macros >
< test > Value< / test >
< / macros >
```
Query:
2020-06-18 08:24:31 +00:00
``` sql
2020-05-02 18:01:13 +00:00
SELECT getMacro('test');
```
Result:
2020-06-18 08:24:31 +00:00
``` text
2020-05-02 18:01:13 +00:00
┌─getMacro('test')─┐
│ Value │
└──────────────────┘
```
An alternative way to get the same value:
2020-06-18 08:24:31 +00:00
``` sql
2020-05-02 18:01:13 +00:00
SELECT * FROM system.macros
WHERE macro = 'test';
```
2020-06-18 08:24:31 +00:00
``` text
2020-05-02 18:01:13 +00:00
┌─macro─┬─substitution─┐
│ test │ Value │
└───────┴──────────────┘
```
2022-06-02 10:55:18 +00:00
## FQDN
2019-11-26 08:40:55 +00:00
2019-12-28 23:11:02 +00:00
Returns the fully qualified domain name.
2019-11-26 08:40:55 +00:00
**Syntax**
2020-03-20 10:10:48 +00:00
``` sql
2019-11-26 08:40:55 +00:00
fqdn();
```
This function is case-insensitive.
**Returned value**
2020-03-21 04:11:51 +00:00
- String with the fully qualified domain name.
2019-11-26 08:40:55 +00:00
Type: `String` .
**Example**
Query:
2020-03-20 10:10:48 +00:00
``` sql
2019-11-26 08:40:55 +00:00
SELECT FQDN();
```
Result:
2020-03-20 10:10:48 +00:00
``` text
2019-11-26 08:40:55 +00:00
┌─FQDN()──────────────────────────┐
│ clickhouse.ru-central1.internal │
└─────────────────────────────────┘
```
2019-10-12 07:17:30 +00:00
2022-06-02 10:55:18 +00:00
## basename
2019-05-06 08:25:46 +00:00
2019-06-14 11:26:46 +00:00
Extracts the trailing part of a string after the last slash or backslash. This function if often used to extract the filename from a path.
2019-05-06 08:25:46 +00:00
2020-03-20 10:10:48 +00:00
``` sql
2019-05-06 08:25:46 +00:00
basename( expr )
```
2021-02-15 21:22:10 +00:00
**Arguments**
2019-05-06 08:25:46 +00:00
2020-04-30 18:19:18 +00:00
- `expr` — Expression resulting in a [String ](../../sql-reference/data-types/string.md ) type value. All the backslashes must be escaped in the resulting value.
2019-05-06 08:25:46 +00:00
2019-05-20 15:06:11 +00:00
**Returned Value**
2019-05-06 08:25:46 +00:00
2019-06-14 11:26:46 +00:00
A string that contains:
2019-05-06 08:25:46 +00:00
2020-03-21 04:11:51 +00:00
- The trailing part of a string after the last slash or backslash.
2019-05-07 15:07:29 +00:00
2020-03-21 04:11:51 +00:00
If the input string contains a path ending with slash or backslash, for example, `/` or `c:\` , the function returns an empty string.
2019-05-07 15:07:29 +00:00
2020-03-21 04:11:51 +00:00
- The original string if there are no slashes or backslashes.
2019-05-06 08:25:46 +00:00
**Example**
2020-03-20 10:10:48 +00:00
``` sql
2019-05-07 15:07:29 +00:00
SELECT 'some/long/path/to/file' AS a, basename(a)
2019-05-06 08:25:46 +00:00
```
2020-03-20 10:10:48 +00:00
``` text
2019-05-07 15:07:29 +00:00
┌─a──────────────────────┬─basename('some\\long\\path\\to\\file')─┐
│ some\long\path\to\file │ file │
└────────────────────────┴────────────────────────────────────────┘
2019-05-06 08:25:46 +00:00
```
2020-03-20 10:10:48 +00:00
``` sql
2019-05-07 15:07:29 +00:00
SELECT 'some\\long\\path\\to\\file' AS a, basename(a)
2019-05-07 14:56:46 +00:00
```
2020-03-20 10:10:48 +00:00
``` text
2019-05-07 15:07:29 +00:00
┌─a──────────────────────┬─basename('some\\long\\path\\to\\file')─┐
│ some\long\path\to\file │ file │
└────────────────────────┴────────────────────────────────────────┘
2019-05-07 14:56:46 +00:00
```
2020-03-20 10:10:48 +00:00
``` sql
2019-05-07 15:07:29 +00:00
SELECT 'some-file-name' AS a, basename(a)
2019-05-06 08:25:46 +00:00
```
2020-03-20 10:10:48 +00:00
``` text
2019-05-07 15:07:29 +00:00
┌─a──────────────┬─basename('some-file-name')─┐
│ some-file-name │ some-file-name │
└────────────────┴────────────────────────────┘
2019-05-06 08:25:46 +00:00
```
2022-06-02 10:55:18 +00:00
## visibleWidth(x)
2017-12-28 15:13:23 +00:00
Calculates the approximate width when outputting values to the console in text format (tab-separated).
This function is used by the system for implementing Pretty formats.
2018-09-04 11:18:59 +00:00
`NULL` is represented as a string corresponding to `NULL` in `Pretty` formats.
2020-03-20 10:10:48 +00:00
``` sql
2018-09-04 11:18:59 +00:00
SELECT visibleWidth(NULL)
2019-09-23 15:31:46 +00:00
```
2020-03-20 10:10:48 +00:00
``` text
2018-09-04 11:18:59 +00:00
┌─visibleWidth(NULL)─┐
│ 4 │
└────────────────────┘
```
2022-06-02 10:55:18 +00:00
## toTypeName(x)
2017-12-28 15:13:23 +00:00
Returns a string containing the type name of the passed argument.
2018-09-04 11:18:59 +00:00
If `NULL` is passed to the function as input, then it returns the `Nullable(Nothing)` type, which corresponds to an internal `NULL` representation in ClickHouse.
2022-06-02 10:55:18 +00:00
## blockSize()
2017-12-28 15:13:23 +00:00
Gets the size of the block.
In ClickHouse, queries are always run on blocks (sets of column parts). This function allows getting the size of the block that you called it for.
2022-06-02 10:55:18 +00:00
## byteSize
2021-01-28 11:38:24 +00:00
Returns estimation of uncompressed byte size of its arguments in memory.
**Syntax**
```sql
2021-01-28 14:13:41 +00:00
byteSize(argument [, ...])
2021-01-28 11:38:24 +00:00
```
2021-02-15 21:22:10 +00:00
**Arguments**
2021-01-28 11:38:24 +00:00
- `argument` — Value.
**Returned value**
- Estimation of byte size of the arguments in memory.
2021-01-28 12:14:57 +00:00
Type: [UInt64 ](../../sql-reference/data-types/int-uint.md ).
2021-01-28 11:38:24 +00:00
2021-01-31 18:23:57 +00:00
**Examples**
For [String ](../../sql-reference/data-types/string.md ) arguments the funtion returns the string length + 9 (terminating zero + length).
2021-01-28 11:38:24 +00:00
Query:
```sql
SELECT byteSize('string');
```
Result:
2021-01-31 18:23:57 +00:00
```text
2021-01-28 11:38:24 +00:00
┌─byteSize('string')─┐
│ 15 │
└────────────────────┘
2021-01-31 18:23:57 +00:00
```
2021-01-28 11:38:24 +00:00
Query:
```sql
CREATE TABLE test
(
`key` Int32,
`u8` UInt8,
`u16` UInt16,
`u32` UInt32,
`u64` UInt64,
`i8` Int8,
`i16` Int16,
`i32` Int32,
`i64` Int64,
`f32` Float32,
`f64` Float64
)
ENGINE = MergeTree
ORDER BY key;
2021-01-31 18:23:57 +00:00
INSERT INTO test VALUES(1, 8, 16, 32, 64, -8, -16, -32, -64, 32.32, 64.64);
2021-01-28 11:38:24 +00:00
2021-02-02 02:54:38 +00:00
SELECT key, byteSize(u8) AS `byteSize(UInt8)` , byteSize(u16) AS `byteSize(UInt16)` , byteSize(u32) AS `byteSize(UInt32)` , byteSize(u64) AS `byteSize(UInt64)` , byteSize(i8) AS `byteSize(Int8)` , byteSize(i16) AS `byteSize(Int16)` , byteSize(i32) AS `byteSize(Int32)` , byteSize(i64) AS `byteSize(Int64)` , byteSize(f32) AS `byteSize(Float32)` , byteSize(f64) AS `byteSize(Float64)` FROM test ORDER BY key ASC FORMAT Vertical;
2021-01-28 11:38:24 +00:00
```
Result:
``` text
2021-01-31 18:23:57 +00:00
Row 1:
──────
key: 1
2021-02-02 02:54:38 +00:00
byteSize(UInt8): 1
byteSize(UInt16): 2
byteSize(UInt32): 4
byteSize(UInt64): 8
byteSize(Int8): 1
byteSize(Int16): 2
byteSize(Int32): 4
byteSize(Int64): 8
2021-01-31 18:23:57 +00:00
byteSize(Float32): 4
byteSize(Float64): 8
2021-01-28 11:38:24 +00:00
```
2021-02-02 02:54:38 +00:00
If the function takes multiple arguments, it returns their combined byte size.
2021-01-28 11:38:24 +00:00
Query:
```sql
SELECT byteSize(NULL, 1, 0.3, '');
```
Result:
```text
┌─byteSize(NULL, 1, 0.3, '')─┐
│ 19 │
└────────────────────────────┘
```
2020-12-28 17:55:47 +00:00
2022-06-02 10:55:18 +00:00
## materialize(x)
2017-12-28 15:13:23 +00:00
Turns a constant into a full column containing just one value.
In ClickHouse, full columns and constants are represented differently in memory. Functions work differently for constant arguments and normal arguments (different code is executed), although the result is almost always the same. This function is for debugging this behavior.
2022-06-02 10:55:18 +00:00
## ignore(…)
2017-12-28 15:13:23 +00:00
2018-09-04 11:18:59 +00:00
Accepts any arguments, including `NULL` . Always returns 0.
2017-12-28 15:13:23 +00:00
However, the argument is still evaluated. This can be used for benchmarks.
2022-06-02 10:55:18 +00:00
## sleep(seconds)
2017-12-28 15:13:23 +00:00
2020-03-20 10:10:48 +00:00
Sleeps ‘ seconds’ seconds on each data block. You can specify an integer or a floating-point number.
2017-12-28 15:13:23 +00:00
2022-06-02 10:55:18 +00:00
## sleepEachRow(seconds)
2019-01-30 10:39:46 +00:00
2020-03-20 10:10:48 +00:00
Sleeps ‘ seconds’ seconds on each row. You can specify an integer or a floating-point number.
2019-01-30 10:39:46 +00:00
2022-06-02 10:55:18 +00:00
## currentDatabase()
2017-12-28 15:13:23 +00:00
Returns the name of the current database.
You can use this function in table engine parameters in a CREATE TABLE query where you need to specify the database.
2022-06-02 10:55:18 +00:00
## currentUser()
2019-09-29 11:15:26 +00:00
Returns the login of current user. Login of user, that initiated query, will be returned in case distibuted query.
2020-03-20 10:10:48 +00:00
``` sql
2019-10-04 07:24:41 +00:00
SELECT currentUser();
2019-09-29 11:15:26 +00:00
```
Alias: `user()` , `USER()` .
**Returned values**
2020-03-21 04:11:51 +00:00
- Login of current user.
- Login of user that initiated query in case of disributed query.
2019-09-29 11:15:26 +00:00
Type: `String` .
**Example**
Query:
2020-03-20 10:10:48 +00:00
``` sql
2019-10-04 07:24:41 +00:00
SELECT currentUser();
2019-09-29 11:15:26 +00:00
```
Result:
2020-03-20 10:10:48 +00:00
``` text
2019-09-29 11:15:26 +00:00
┌─currentUser()─┐
│ default │
└───────────────┘
```
2019-08-13 16:08:12 +00:00
2022-06-02 10:55:18 +00:00
## isConstant
2020-05-02 15:14:10 +00:00
Checks whether the argument is a constant expression.
2020-06-18 08:24:31 +00:00
A constant expression means an expression whose resulting value is known at the query analysis (i.e. before execution). For example, expressions over [literals ](../../sql-reference/syntax.md#literals ) are constant expressions.
2020-05-02 15:14:10 +00:00
The function is intended for development, debugging and demonstration.
**Syntax**
``` sql
isConstant(x)
```
2021-02-15 21:22:10 +00:00
**Arguments**
2020-05-02 15:14:10 +00:00
2020-06-18 08:24:31 +00:00
- `x` — Expression to check.
2020-05-02 15:14:10 +00:00
**Returned values**
2020-06-18 08:24:31 +00:00
- `1` — `x` is constant.
- `0` — `x` is non-constant.
2020-05-02 15:14:10 +00:00
2020-06-18 08:24:31 +00:00
Type: [UInt8 ](../../sql-reference/data-types/int-uint.md ).
2020-05-02 15:14:10 +00:00
**Examples**
Query:
2020-06-18 08:24:31 +00:00
``` sql
2020-05-02 15:14:10 +00:00
SELECT isConstant(x + 1) FROM (SELECT 43 AS x)
```
Result:
2020-06-18 08:24:31 +00:00
``` text
2020-05-02 15:14:10 +00:00
┌─isConstant(plus(x, 1))─┐
│ 1 │
└────────────────────────┘
```
Query:
2020-06-18 08:24:31 +00:00
``` sql
2020-05-02 15:14:10 +00:00
WITH 3.14 AS pi SELECT isConstant(cos(pi))
```
Result:
2020-06-18 08:24:31 +00:00
``` text
2020-05-02 15:14:10 +00:00
┌─isConstant(cos(pi))─┐
│ 1 │
└─────────────────────┘
```
Query:
2020-06-18 08:24:31 +00:00
``` sql
2020-05-02 15:14:10 +00:00
SELECT isConstant(number) FROM numbers(1)
```
Result:
2020-06-18 08:24:31 +00:00
``` text
2020-05-02 15:14:10 +00:00
┌─isConstant(number)─┐
│ 0 │
└────────────────────┘
```
2022-06-02 10:55:18 +00:00
## isFinite(x)
2017-12-28 15:13:23 +00:00
Accepts Float32 and Float64 and returns UInt8 equal to 1 if the argument is not infinite and not a NaN, otherwise 0.
2022-06-02 10:55:18 +00:00
## isInfinite(x)
2017-12-28 15:13:23 +00:00
Accepts Float32 and Float64 and returns UInt8 equal to 1 if the argument is infinite, otherwise 0. Note that 0 is returned for a NaN.
2022-06-02 10:55:18 +00:00
## ifNotFinite
2020-03-17 15:24:11 +00:00
Checks whether floating point value is finite.
2020-03-20 10:10:48 +00:00
**Syntax**
2020-03-17 15:24:11 +00:00
2020-03-21 04:11:51 +00:00
ifNotFinite(x,y)
2020-03-17 15:24:11 +00:00
2021-02-15 21:22:10 +00:00
**Arguments**
2020-03-20 10:10:48 +00:00
2020-04-30 18:19:18 +00:00
- `x` — Value to be checked for infinity. Type: [Float\* ](../../sql-reference/data-types/float.md ).
- `y` — Fallback value. Type: [Float\* ](../../sql-reference/data-types/float.md ).
2020-03-17 15:24:11 +00:00
**Returned value**
2020-03-21 04:11:51 +00:00
- `x` if `x` is finite.
- `y` if `x` is not finite.
2020-03-17 15:24:11 +00:00
**Example**
Query:
2020-03-21 04:11:51 +00:00
SELECT 1/0 as infimum, ifNotFinite(infimum,42)
2020-03-17 15:24:11 +00:00
Result:
2020-03-21 04:11:51 +00:00
┌─infimum─┬─ifNotFinite(divide(1, 0), 42)─┐
│ inf │ 42 │
└─────────┴───────────────────────────────┘
2020-03-17 15:24:11 +00:00
2020-06-18 08:24:31 +00:00
You can get similar result by using [ternary operator ](../../sql-reference/functions/conditional-functions.md#ternary-operator ): `isFinite(x) ? x : y` .
2020-03-17 15:24:11 +00:00
2022-06-02 10:55:18 +00:00
## isNaN(x)
2017-12-28 15:13:23 +00:00
Accepts Float32 and Float64 and returns UInt8 equal to 1 if the argument is a NaN, otherwise 0.
2022-06-02 10:55:18 +00:00
## hasColumnInTable(\[‘ hostname’ \[, ‘ username’ \[, ‘ password’ \]\],\] ‘ database’ , ‘ table’ , ‘ column’ )
2017-12-28 15:13:23 +00:00
Accepts constant strings: database name, table name, and column name. Returns a UInt8 constant expression equal to 1 if there is a column, otherwise 0. If the hostname parameter is set, the test will run on a remote server.
The function throws an exception if the table does not exist.
For elements in a nested data structure, the function checks for the existence of a column. For the nested data structure itself, the function returns 0.
2022-06-02 10:55:18 +00:00
## bar
2017-12-28 15:13:23 +00:00
Allows building a unicode-art diagram.
2018-09-04 11:18:59 +00:00
`bar(x, min, max, width)` draws a band with a width proportional to `(x - min)` and equal to `width` characters when `x = max` .
2018-04-28 11:45:37 +00:00
2021-02-16 11:21:23 +00:00
**Arguments**
2018-04-28 11:45:37 +00:00
2020-03-21 04:11:51 +00:00
- `x` — Size to display.
- `min, max` — Integer constants. The value must fit in `Int64` .
- `width` — Constant, positive integer, can be fractional.
2017-12-28 15:13:23 +00:00
The band is drawn with accuracy to one eighth of a symbol.
Example:
2020-03-20 10:10:48 +00:00
``` sql
2017-12-28 15:13:23 +00:00
SELECT
toHour(EventTime) AS h,
count() AS c,
bar(c, 0, 600000, 20) AS bar
FROM test.hits
GROUP BY h
ORDER BY h ASC
```
2020-03-20 10:10:48 +00:00
``` text
2017-12-28 15:13:23 +00:00
┌──h─┬──────c─┬─bar────────────────┐
│ 0 │ 292907 │ █████████▋ │
│ 1 │ 180563 │ ██████ │
│ 2 │ 114861 │ ███▋ │
│ 3 │ 85069 │ ██▋ │
│ 4 │ 68543 │ ██▎ │
│ 5 │ 78116 │ ██▌ │
│ 6 │ 113474 │ ███▋ │
│ 7 │ 170678 │ █████▋ │
│ 8 │ 278380 │ █████████▎ │
│ 9 │ 391053 │ █████████████ │
│ 10 │ 457681 │ ███████████████▎ │
│ 11 │ 493667 │ ████████████████▍ │
│ 12 │ 509641 │ ████████████████▊ │
│ 13 │ 522947 │ █████████████████▍ │
│ 14 │ 539954 │ █████████████████▊ │
│ 15 │ 528460 │ █████████████████▌ │
│ 16 │ 539201 │ █████████████████▊ │
│ 17 │ 523539 │ █████████████████▍ │
│ 18 │ 506467 │ ████████████████▊ │
│ 19 │ 520915 │ █████████████████▎ │
│ 20 │ 521665 │ █████████████████▍ │
│ 21 │ 542078 │ ██████████████████ │
│ 22 │ 493642 │ ████████████████▍ │
│ 23 │ 400397 │ █████████████▎ │
└────┴────────┴────────────────────┘
```
2022-06-02 10:55:18 +00:00
## transform
2017-12-28 15:13:23 +00:00
Transforms a value according to the explicitly defined mapping of some elements to other ones.
There are two variations of this function:
2022-06-02 10:55:18 +00:00
### transform(x, array_from, array_to, default)
2017-12-28 15:13:23 +00:00
`x` – What to transform.
`array_from` – Constant array of values for converting.
2020-03-20 10:10:48 +00:00
`array_to` – Constant array of values to convert the values in ‘ from’ to.
2017-12-28 15:13:23 +00:00
2020-03-20 10:10:48 +00:00
`default` – Which value to use if ‘ x’ is not equal to any of the values in ‘ from’ .
2017-12-28 15:13:23 +00:00
`array_from` and `array_to` – Arrays of the same size.
Types:
`transform(T, Array(T), Array(U), U) -> U`
`T` and `U` can be numeric, string, or Date or DateTime types.
Where the same letter is indicated (T or U), for numeric types these might not be matching types, but types that have a common type.
2019-05-05 17:38:05 +00:00
For example, the first argument can have the Int64 type, while the second has the Array(UInt16) type.
2017-12-28 15:13:23 +00:00
2020-10-13 17:23:29 +00:00
If the ‘ x’ value is equal to one of the elements in the ‘ array_from’ array, it returns the existing element (that is numbered the same) from the ‘ array_to’ array. Otherwise, it returns ‘ default’ . If there are multiple matching elements in ‘ array_from’ , it returns one of the matches.
2017-12-28 15:13:23 +00:00
Example:
2020-03-20 10:10:48 +00:00
``` sql
2017-12-28 15:13:23 +00:00
SELECT
2018-03-25 02:04:22 +00:00
transform(SearchEngineID, [2, 3], ['Yandex', 'Google'], 'Other') AS title,
2017-12-28 15:13:23 +00:00
count() AS c
FROM test.hits
WHERE SearchEngineID != 0
GROUP BY title
ORDER BY c DESC
```
2020-03-20 10:10:48 +00:00
``` text
2017-12-28 15:13:23 +00:00
┌─title─────┬──────c─┐
│ Yandex │ 498635 │
│ Google │ 229872 │
│ Other │ 104472 │
└───────────┴────────┘
```
2022-06-02 10:55:18 +00:00
### transform(x, array_from, array_to)
2017-12-28 15:13:23 +00:00
2020-03-20 10:10:48 +00:00
Differs from the first variation in that the ‘ default’ argument is omitted.
2020-10-13 17:23:29 +00:00
If the ‘ x’ value is equal to one of the elements in the ‘ array_from’ array, it returns the matching element (that is numbered the same) from the ‘ array_to’ array. Otherwise, it returns ‘ x’ .
2017-12-28 15:13:23 +00:00
Types:
`transform(T, Array(T), Array(T)) -> T`
Example:
2020-03-20 10:10:48 +00:00
``` sql
2017-12-28 15:13:23 +00:00
SELECT
2022-10-21 17:08:44 +00:00
transform(domain(Referer), ['yandex.ru', 'google.ru', 'vkontakte.ru'], ['www.yandex', 'example.com', 'vk.com']) AS s,
2017-12-28 15:13:23 +00:00
count() AS c
FROM test.hits
GROUP BY domain(Referer)
ORDER BY count() DESC
LIMIT 10
```
2020-03-20 10:10:48 +00:00
``` text
2017-12-28 15:13:23 +00:00
┌─s──────────────┬───────c─┐
│ │ 2906259 │
│ www.yandex │ 867767 │
│ ███████.ru │ 313599 │
│ mail.yandex.ru │ 107147 │
│ ██████.ru │ 100355 │
│ █████████.ru │ 65040 │
│ news.yandex.ru │ 64515 │
│ ██████.net │ 59141 │
│ example.com │ 57316 │
└────────────────┴─────────┘
```
2022-11-01 09:41:49 +00:00
## formatReadableDecimalSize(x)
Accepts the size (number of bytes). Returns a rounded size with a suffix (KB, MB, etc.) as a string.
Example:
``` sql
SELECT
arrayJoin([1, 1024, 1024*1024, 192851925]) AS filesize_bytes,
formatReadableDecimalSize(filesize_bytes) AS filesize
```
``` text
┌─filesize_bytes─┬─filesize───┐
│ 1 │ 1.00 B │
2022-11-01 21:33:18 +00:00
│ 1024 │ 1.02 KB │
│ 1048576 │ 1.05 MB │
│ 192851925 │ 192.85 MB │
2022-11-01 09:41:49 +00:00
└────────────────┴────────────┘
```
2022-06-02 10:55:18 +00:00
## formatReadableSize(x)
2017-12-28 15:13:23 +00:00
Accepts the size (number of bytes). Returns a rounded size with a suffix (KiB, MiB, etc.) as a string.
Example:
2020-03-20 10:10:48 +00:00
``` sql
2017-12-28 15:13:23 +00:00
SELECT
arrayJoin([1, 1024, 1024*1024, 192851925]) AS filesize_bytes,
formatReadableSize(filesize_bytes) AS filesize
```
2020-03-20 10:10:48 +00:00
``` text
2017-12-28 15:13:23 +00:00
┌─filesize_bytes─┬─filesize───┐
│ 1 │ 1.00 B │
│ 1024 │ 1.00 KiB │
│ 1048576 │ 1.00 MiB │
│ 192851925 │ 183.92 MiB │
└────────────────┴────────────┘
```
2022-06-02 10:55:18 +00:00
## formatReadableQuantity(x)
2020-09-10 17:34:23 +00:00
Accepts the number. Returns a rounded number with a suffix (thousand, million, billion, etc.) as a string.
It is useful for reading big numbers by human.
Example:
``` sql
SELECT
arrayJoin([1024, 1234 * 1000, (4567 * 1000) * 1000, 98765432101234]) AS number,
formatReadableQuantity(number) AS number_for_humans
```
``` text
┌─────────number─┬─number_for_humans─┐
│ 1024 │ 1.02 thousand │
│ 1234000 │ 1.23 million │
│ 4567000000 │ 4.57 billion │
│ 98765432101234 │ 98.77 trillion │
└────────────────┴───────────────────┘
```
2022-06-02 10:55:18 +00:00
## formatReadableTimeDelta
2020-10-01 02:55:46 +00:00
Accepts the time delta in seconds. Returns a time delta with (year, month, day, hour, minute, second) as a string.
2020-10-03 22:38:51 +00:00
**Syntax**
``` sql
formatReadableTimeDelta(column[, maximum_unit])
```
2021-02-15 21:22:10 +00:00
**Arguments**
2020-10-03 22:38:51 +00:00
- `column` — A column with numeric time delta.
2020-10-27 11:04:03 +00:00
- `maximum_unit` — Optional. Maximum unit to show. Acceptable values seconds, minutes, hours, days, months, years.
2020-10-03 22:38:51 +00:00
2020-10-01 02:55:46 +00:00
Example:
``` sql
SELECT
2020-10-03 22:38:51 +00:00
arrayJoin([100, 12345, 432546534]) AS elapsed,
formatReadableTimeDelta(elapsed) AS time_delta
2020-10-01 02:55:46 +00:00
```
``` text
2020-10-03 22:38:51 +00:00
┌────elapsed─┬─time_delta ─────────────────────────────────────────────────────┐
2020-10-01 02:55:46 +00:00
│ 100 │ 1 minute and 40 seconds │
│ 12345 │ 3 hours, 25 minutes and 45 seconds │
│ 432546534 │ 13 years, 8 months, 17 days, 7 hours, 48 minutes and 54 seconds │
└────────────┴─────────────────────────────────────────────────────────────────┘
```
2020-10-03 22:38:51 +00:00
``` sql
SELECT
arrayJoin([100, 12345, 432546534]) AS elapsed,
formatReadableTimeDelta(elapsed, 'minutes') AS time_delta
```
``` text
┌────elapsed─┬─time_delta ─────────────────────────────────────────────────────┐
│ 100 │ 1 minute and 40 seconds │
│ 12345 │ 205 minutes and 45 seconds │
│ 432546534 │ 7209108 minutes and 54 seconds │
└────────────┴─────────────────────────────────────────────────────────────────┘
```
2022-07-10 11:39:45 +00:00
## parseTimeDelta
Parse a sequence of numbers followed by something resembling a time unit.
**Syntax**
```sql
parseTimeDelta(timestr)
```
**Arguments**
- `timestr` — A sequence of numbers followed by something resembling a time unit.
**Returned value**
- A floating-point number with the number of seconds.
**Example**
```sql
SELECT parseTimeDelta('11s+22min')
```
```text
┌─parseTimeDelta('11s+22min')─┐
│ 1331 │
└─────────────────────────────┘
```
```sql
SELECT parseTimeDelta('1yr2mo')
```
```text
┌─parseTimeDelta('1yr2mo')─┐
│ 36806400 │
└──────────────────────────┘
```
2022-06-02 10:55:18 +00:00
## least(a, b)
2017-12-28 15:13:23 +00:00
Returns the smallest value from a and b.
2022-06-02 10:55:18 +00:00
## greatest(a, b)
2017-12-28 15:13:23 +00:00
Returns the largest value of a and b.
2022-06-02 10:55:18 +00:00
## uptime()
2017-12-28 15:13:23 +00:00
2020-03-20 10:10:48 +00:00
Returns the server’ s uptime in seconds.
2021-10-06 20:13:46 +00:00
If it is executed in the context of a distributed table, then it generates a normal column with values relevant to each shard. Otherwise it produces a constant value.
2017-12-28 15:13:23 +00:00
2022-06-02 10:55:18 +00:00
## version()
2017-12-28 15:13:23 +00:00
Returns the version of the server as a string.
2021-10-06 20:13:46 +00:00
If it is executed in the context of a distributed table, then it generates a normal column with values relevant to each shard. Otherwise it produces a constant value.
2017-12-28 15:13:23 +00:00
2022-06-02 10:55:18 +00:00
## buildId()
2021-10-08 05:05:12 +00:00
2021-10-09 18:37:28 +00:00
Returns the build ID generated by a compiler for the running ClickHouse server binary.
2021-10-08 05:05:12 +00:00
If it is executed in the context of a distributed table, then it generates a normal column with values relevant to each shard. Otherwise it produces a constant value.
2022-06-02 10:55:18 +00:00
## blockNumber
2019-01-30 10:39:46 +00:00
Returns the sequence number of the data block where the row is located.
2022-06-02 10:55:18 +00:00
## rowNumberInBlock
2019-01-30 10:39:46 +00:00
Returns the ordinal number of the row in the data block. Different data blocks are always recalculated.
2022-06-02 10:55:18 +00:00
## rowNumberInAllBlocks()
2017-12-28 15:13:23 +00:00
Returns the ordinal number of the row in the data block. This function only considers the affected data blocks.
2022-06-02 10:55:18 +00:00
## neighbor
2019-08-13 13:11:24 +00:00
2019-11-08 13:15:45 +00:00
The window function that provides access to a row at a specified offset which comes before or after the current row of a given column.
2019-08-13 13:11:24 +00:00
2019-11-08 13:15:45 +00:00
**Syntax**
2020-03-20 10:10:48 +00:00
``` sql
2019-11-12 08:01:46 +00:00
neighbor(column, offset[, default_value])
2019-11-08 13:15:45 +00:00
```
2019-08-13 13:11:24 +00:00
The result of the function depends on the affected data blocks and the order of data in the block.
2020-10-28 14:13:27 +00:00
2022-04-09 13:29:05 +00:00
:::warning
It can reach the neighbor rows only inside the currently processed data block.
:::
2020-10-28 14:13:27 +00:00
The rows order used during the calculation of `neighbor` can differ from the order of rows returned to the user.
2021-02-28 17:10:44 +00:00
To prevent that you can make a subquery with [ORDER BY ](../../sql-reference/statements/select/order-by.md ) and call the function from outside the subquery.
2019-08-13 13:11:24 +00:00
2021-02-15 21:22:10 +00:00
**Arguments**
2019-11-08 13:15:45 +00:00
2020-03-21 04:11:51 +00:00
- `column` — A column name or scalar expression.
2020-04-30 18:19:18 +00:00
- `offset` — The number of rows forwards or backwards from the current row of `column` . [Int64 ](../../sql-reference/data-types/int-uint.md ).
2020-03-21 04:11:51 +00:00
- `default_value` — Optional. The value to be returned if offset goes beyond the scope of the block. Type of data blocks affected.
2019-11-08 13:15:45 +00:00
2019-11-12 08:01:46 +00:00
**Returned values**
2019-11-08 13:15:45 +00:00
2020-03-21 04:11:51 +00:00
- Value for `column` in `offset` distance from current row if `offset` value is not outside block bounds.
- Default value for `column` if `offset` value is outside block bounds. If `default_value` is given, then it will be used.
2019-11-08 13:15:45 +00:00
2019-11-12 08:01:46 +00:00
Type: type of data blocks affected or default value type.
2019-11-08 13:15:45 +00:00
**Example**
Query:
2020-03-20 10:10:48 +00:00
``` sql
2019-11-08 13:15:45 +00:00
SELECT number, neighbor(number, 2) FROM system.numbers LIMIT 10;
```
Result:
2020-03-20 10:10:48 +00:00
``` text
2019-11-08 13:15:45 +00:00
┌─number─┬─neighbor(number, 2)─┐
│ 0 │ 2 │
│ 1 │ 3 │
│ 2 │ 4 │
│ 3 │ 5 │
│ 4 │ 6 │
│ 5 │ 7 │
│ 6 │ 8 │
│ 7 │ 9 │
│ 8 │ 0 │
│ 9 │ 0 │
└────────┴─────────────────────┘
```
Query:
2020-03-20 10:10:48 +00:00
``` sql
2019-11-08 13:15:45 +00:00
SELECT number, neighbor(number, 2, 999) FROM system.numbers LIMIT 10;
```
Result:
2020-03-20 10:10:48 +00:00
``` text
2019-11-08 13:15:45 +00:00
┌─number─┬─neighbor(number, 2, 999)─┐
│ 0 │ 2 │
│ 1 │ 3 │
│ 2 │ 4 │
│ 3 │ 5 │
│ 4 │ 6 │
│ 5 │ 7 │
│ 6 │ 8 │
│ 7 │ 9 │
│ 8 │ 999 │
│ 9 │ 999 │
└────────┴──────────────────────────┘
```
2019-08-13 13:11:24 +00:00
This function can be used to compute year-over-year metric value:
2019-11-08 13:15:45 +00:00
Query:
2020-03-20 10:10:48 +00:00
``` sql
2019-08-13 13:11:24 +00:00
WITH toDate('2018-01-01') AS start_date
SELECT
toStartOfMonth(start_date + (number * 32)) AS month,
toInt32(month) % 100 AS money,
2019-08-23 01:31:04 +00:00
neighbor(money, -12) AS prev_year,
2019-08-13 13:11:24 +00:00
round(prev_year / money, 2) AS year_over_year
FROM numbers(16)
```
2019-11-08 13:15:45 +00:00
Result:
2020-03-20 10:10:48 +00:00
``` text
2019-08-13 13:11:24 +00:00
┌──────month─┬─money─┬─prev_year─┬─year_over_year─┐
│ 2018-01-01 │ 32 │ 0 │ 0 │
│ 2018-02-01 │ 63 │ 0 │ 0 │
│ 2018-03-01 │ 91 │ 0 │ 0 │
│ 2018-04-01 │ 22 │ 0 │ 0 │
│ 2018-05-01 │ 52 │ 0 │ 0 │
│ 2018-06-01 │ 83 │ 0 │ 0 │
│ 2018-07-01 │ 13 │ 0 │ 0 │
│ 2018-08-01 │ 44 │ 0 │ 0 │
│ 2018-09-01 │ 75 │ 0 │ 0 │
│ 2018-10-01 │ 5 │ 0 │ 0 │
│ 2018-11-01 │ 36 │ 0 │ 0 │
│ 2018-12-01 │ 66 │ 0 │ 0 │
│ 2019-01-01 │ 97 │ 32 │ 0.33 │
│ 2019-02-01 │ 28 │ 63 │ 2.25 │
│ 2019-03-01 │ 56 │ 91 │ 1.62 │
│ 2019-04-01 │ 87 │ 22 │ 0.25 │
└────────────┴───────┴───────────┴────────────────┘
```
2022-06-02 10:55:18 +00:00
## runningDifference(x)
2017-12-28 15:13:23 +00:00
Calculates the difference between successive row values in the data block.
Returns 0 for the first row and the difference from the previous row for each subsequent row.
2022-04-09 13:29:05 +00:00
:::warning
It can reach the previous row only inside the currently processed data block.
:::
2021-06-22 10:14:24 +00:00
2017-12-28 15:13:23 +00:00
The result of the function depends on the affected data blocks and the order of data in the block.
2020-10-28 14:13:27 +00:00
The rows order used during the calculation of `runningDifference` can differ from the order of rows returned to the user.
2021-02-28 17:10:44 +00:00
To prevent that you can make a subquery with [ORDER BY ](../../sql-reference/statements/select/order-by.md ) and call the function from outside the subquery.
2017-12-28 15:13:23 +00:00
Example:
2020-03-20 10:10:48 +00:00
``` sql
2017-12-28 15:13:23 +00:00
SELECT
EventID,
EventTime,
runningDifference(EventTime) AS delta
FROM
(
SELECT
EventID,
EventTime
FROM events
WHERE EventDate = '2016-11-24'
ORDER BY EventTime ASC
LIMIT 5
)
```
2020-03-20 10:10:48 +00:00
``` text
2017-12-28 15:13:23 +00:00
┌─EventID─┬───────────EventTime─┬─delta─┐
│ 1106 │ 2016-11-24 00:00:04 │ 0 │
│ 1107 │ 2016-11-24 00:00:05 │ 1 │
│ 1108 │ 2016-11-24 00:00:05 │ 0 │
│ 1109 │ 2016-11-24 00:00:09 │ 4 │
│ 1110 │ 2016-11-24 00:00:10 │ 1 │
└─────────┴─────────────────────┴───────┘
```
2019-09-02 20:15:40 +00:00
Please note - block size affects the result. With each new block, the `runningDifference` state is reset.
2020-03-20 10:10:48 +00:00
``` sql
2019-09-02 20:15:40 +00:00
SELECT
number,
runningDifference(number + 1) AS diff
FROM numbers(100000)
WHERE diff != 1
2019-09-23 15:31:46 +00:00
```
2020-03-20 10:10:48 +00:00
``` text
2019-09-02 20:15:40 +00:00
┌─number─┬─diff─┐
│ 0 │ 0 │
└────────┴──────┘
┌─number─┬─diff─┐
│ 65536 │ 0 │
└────────┴──────┘
2019-09-23 15:31:46 +00:00
```
2020-03-20 10:10:48 +00:00
``` sql
2019-12-28 23:11:02 +00:00
set max_block_size=100000 -- default value is 65536!
2019-09-02 20:15:40 +00:00
SELECT
number,
runningDifference(number + 1) AS diff
FROM numbers(100000)
WHERE diff != 1
2019-09-23 15:31:46 +00:00
```
2020-03-20 10:10:48 +00:00
``` text
2019-09-02 20:15:40 +00:00
┌─number─┬─diff─┐
│ 0 │ 0 │
└────────┴──────┘
```
2022-06-02 10:55:18 +00:00
## runningDifferenceStartingWithFirstValue
2019-01-30 10:39:46 +00:00
2021-03-14 18:03:56 +00:00
Same as for [runningDifference ](./other-functions.md#other_functions-runningdifference ), the difference is the value of the first row, returned the value of the first row, and each subsequent row returns the difference from the previous row.
2019-01-30 10:39:46 +00:00
2022-06-02 10:55:18 +00:00
## runningConcurrency
2020-12-21 03:08:37 +00:00
2021-03-14 16:27:58 +00:00
Calculates the number of concurrent events.
2021-06-22 10:14:24 +00:00
Each event has a start time and an end time. The start time is included in the event, while the end time is excluded. Columns with a start time and an end time must be of the same data type.
2021-03-14 16:27:58 +00:00
The function calculates the total number of active (concurrent) events for each event start time.
2020-12-21 03:08:37 +00:00
2021-03-14 17:33:12 +00:00
2022-04-09 13:29:05 +00:00
:::warning
Events must be ordered by the start time in ascending order. If this requirement is violated the function raises an exception. Every data block is processed separately. If events from different data blocks overlap then they can not be processed correctly.
:::
2021-03-14 17:33:12 +00:00
2020-12-21 03:08:37 +00:00
**Syntax**
``` sql
2021-03-08 19:44:45 +00:00
runningConcurrency(start, end)
2020-12-21 03:08:37 +00:00
```
2021-03-08 19:44:45 +00:00
**Arguments**
2020-12-21 03:08:37 +00:00
2021-03-10 20:46:29 +00:00
- `start` — A column with the start time of events. [Date ](../../sql-reference/data-types/date.md ), [DateTime ](../../sql-reference/data-types/datetime.md ), or [DateTime64 ](../../sql-reference/data-types/datetime64.md ).
- `end` — A column with the end time of events. [Date ](../../sql-reference/data-types/date.md ), [DateTime ](../../sql-reference/data-types/datetime.md ), or [DateTime64 ](../../sql-reference/data-types/datetime64.md ).
2020-12-21 03:08:37 +00:00
**Returned values**
2021-03-08 19:44:45 +00:00
- The number of concurrent events at each event start time.
2020-12-21 03:08:37 +00:00
Type: [UInt32 ](../../sql-reference/data-types/int-uint.md )
**Example**
2021-03-10 20:46:29 +00:00
Consider the table:
2020-12-21 03:08:37 +00:00
``` text
2021-03-08 19:44:45 +00:00
┌──────start─┬────────end─┐
│ 2021-03-03 │ 2021-03-11 │
│ 2021-03-06 │ 2021-03-12 │
│ 2021-03-07 │ 2021-03-08 │
│ 2021-03-11 │ 2021-03-12 │
└────────────┴────────────┘
2020-12-21 03:08:37 +00:00
```
Query:
``` sql
2021-03-08 19:44:45 +00:00
SELECT start, runningConcurrency(start, end) FROM example_table;
2020-12-21 03:08:37 +00:00
```
Result:
``` text
2021-03-08 19:44:45 +00:00
┌──────start─┬─runningConcurrency(start, end)─┐
│ 2021-03-03 │ 1 │
│ 2021-03-06 │ 2 │
│ 2021-03-07 │ 3 │
│ 2021-03-11 │ 2 │
└────────────┴────────────────────────────────┘
2020-12-21 03:08:37 +00:00
```
2022-06-02 10:55:18 +00:00
## MACNumToString(num)
2017-12-28 15:13:23 +00:00
Accepts a UInt64 number. Interprets it as a MAC address in big endian. Returns a string containing the corresponding MAC address in the format AA:BB:CC:DD:EE:FF (colon-separated numbers in hexadecimal form).
2022-06-02 10:55:18 +00:00
## MACStringToNum(s)
2017-12-28 15:13:23 +00:00
The inverse function of MACNumToString. If the MAC address has an invalid format, it returns 0.
2022-06-02 10:55:18 +00:00
## MACStringToOUI(s)
2017-12-28 15:13:23 +00:00
Accepts a MAC address in the format AA:BB:CC:DD:EE:FF (colon-separated numbers in hexadecimal form). Returns the first three octets as a UInt64 number. If the MAC address has an invalid format, it returns 0.
2018-09-04 11:18:59 +00:00
2022-06-02 10:55:18 +00:00
## getSizeOfEnumType
2018-09-04 11:18:59 +00:00
2020-04-30 18:19:18 +00:00
Returns the number of fields in [Enum ](../../sql-reference/data-types/enum.md ).
2018-09-04 11:18:59 +00:00
2020-03-20 10:10:48 +00:00
``` sql
2018-09-04 11:18:59 +00:00
getSizeOfEnumType(value)
```
2021-02-15 21:22:10 +00:00
**Arguments:**
2018-09-04 11:18:59 +00:00
2020-03-21 04:11:51 +00:00
- `value` — Value of type `Enum` .
2018-09-04 11:18:59 +00:00
**Returned values**
2020-03-21 04:11:51 +00:00
- The number of fields with `Enum` input values.
- An exception is thrown if the type is not `Enum` .
2018-09-04 11:18:59 +00:00
**Example**
2020-03-20 10:10:48 +00:00
``` sql
2018-09-04 11:18:59 +00:00
SELECT getSizeOfEnumType( CAST('a' AS Enum8('a' = 1, 'b' = 2) ) ) AS x
2019-09-23 15:31:46 +00:00
```
2020-03-20 10:10:48 +00:00
``` text
2018-09-04 11:18:59 +00:00
┌─x─┐
│ 2 │
└───┘
```
2022-06-02 10:55:18 +00:00
## blockSerializedSize
2020-02-01 19:41:35 +00:00
Returns size on disk (without taking into account compression).
2020-03-20 10:10:48 +00:00
``` sql
2020-02-01 19:41:35 +00:00
blockSerializedSize(value[, value[, ...]])
```
2021-02-15 21:22:10 +00:00
**Arguments**
2020-02-01 19:41:35 +00:00
2020-03-21 04:11:51 +00:00
- `value` — Any value.
2020-02-01 19:41:35 +00:00
**Returned values**
2020-03-21 04:11:51 +00:00
- The number of bytes that will be written to disk for block of values (without compression).
2020-02-01 19:41:35 +00:00
**Example**
2020-06-29 09:48:18 +00:00
Query:
2020-03-20 10:10:48 +00:00
``` sql
2020-02-01 19:41:35 +00:00
SELECT blockSerializedSize(maxState(1)) as x
```
2020-03-20 10:10:48 +00:00
2020-07-09 15:10:35 +00:00
Result:
2020-06-29 09:48:18 +00:00
2020-03-20 10:10:48 +00:00
``` text
2020-02-01 19:41:35 +00:00
┌─x─┐
│ 2 │
└───┘
```
2022-06-02 10:55:18 +00:00
## toColumnTypeName
2018-09-04 11:18:59 +00:00
Returns the name of the class that represents the data type of the column in RAM.
2020-03-20 10:10:48 +00:00
``` sql
2018-09-04 11:18:59 +00:00
toColumnTypeName(value)
```
2021-02-15 21:22:10 +00:00
**Arguments:**
2018-09-04 11:18:59 +00:00
2020-03-21 04:11:51 +00:00
- `value` — Any type of value.
2018-09-04 11:18:59 +00:00
**Returned values**
2020-03-21 04:11:51 +00:00
- A string with the name of the class that is used for representing the `value` data type in RAM.
2018-09-04 11:18:59 +00:00
2020-03-20 10:10:48 +00:00
**Example of the difference between`toTypeName ' and ' toColumnTypeName`**
2018-09-04 11:18:59 +00:00
2020-03-20 10:10:48 +00:00
``` sql
2019-09-23 15:31:46 +00:00
SELECT toTypeName(CAST('2018-01-01 01:02:03' AS DateTime))
2018-09-04 11:18:59 +00:00
```
2020-03-20 10:10:48 +00:00
``` text
2018-09-04 11:18:59 +00:00
┌─toTypeName(CAST('2018-01-01 01:02:03', 'DateTime'))─┐
│ DateTime │
└─────────────────────────────────────────────────────┘
2019-09-23 15:31:46 +00:00
```
2020-03-20 10:10:48 +00:00
``` sql
2019-09-23 15:31:46 +00:00
SELECT toColumnTypeName(CAST('2018-01-01 01:02:03' AS DateTime))
```
2020-03-20 10:10:48 +00:00
``` text
2018-09-04 11:18:59 +00:00
┌─toColumnTypeName(CAST('2018-01-01 01:02:03', 'DateTime'))─┐
│ Const(UInt32) │
└───────────────────────────────────────────────────────────┘
```
The example shows that the `DateTime` data type is stored in memory as `Const(UInt32)` .
2022-06-02 10:55:18 +00:00
## dumpColumnStructure
2018-09-04 11:18:59 +00:00
Outputs a detailed description of data structures in RAM
2020-03-20 10:10:48 +00:00
``` sql
2018-09-04 11:18:59 +00:00
dumpColumnStructure(value)
```
2021-02-15 21:22:10 +00:00
**Arguments:**
2018-09-04 11:18:59 +00:00
2020-03-21 04:11:51 +00:00
- `value` — Any type of value.
2018-09-04 11:18:59 +00:00
**Returned values**
2020-03-21 04:11:51 +00:00
- A string describing the structure that is used for representing the `value` data type in RAM.
2018-09-04 11:18:59 +00:00
**Example**
2020-03-20 10:10:48 +00:00
``` sql
2018-09-04 11:18:59 +00:00
SELECT dumpColumnStructure(CAST('2018-01-01 01:02:03', 'DateTime'))
2019-09-23 15:31:46 +00:00
```
2020-03-20 10:10:48 +00:00
``` text
2018-09-04 11:18:59 +00:00
┌─dumpColumnStructure(CAST('2018-01-01 01:02:03', 'DateTime'))─┐
│ DateTime, Const(size = 1, UInt32(size = 1)) │
└──────────────────────────────────────────────────────────────┘
```
2022-06-02 10:55:18 +00:00
## defaultValueOfArgumentType
2018-09-04 11:18:59 +00:00
Outputs the default value for the data type.
Does not include default values for custom columns set by the user.
2020-03-20 10:10:48 +00:00
``` sql
2018-09-04 11:18:59 +00:00
defaultValueOfArgumentType(expression)
```
2021-02-15 21:22:10 +00:00
**Arguments:**
2018-09-04 11:18:59 +00:00
2020-03-21 04:11:51 +00:00
- `expression` — Arbitrary type of value or an expression that results in a value of an arbitrary type.
2018-09-04 11:18:59 +00:00
**Returned values**
2020-03-21 04:11:51 +00:00
- `0` for numbers.
- Empty string for strings.
2020-04-30 18:19:18 +00:00
- `ᴺᵁᴸᴸ` for [Nullable ](../../sql-reference/data-types/nullable.md ).
2018-09-04 11:18:59 +00:00
**Example**
2020-03-20 10:10:48 +00:00
``` sql
2019-09-23 15:31:46 +00:00
SELECT defaultValueOfArgumentType( CAST(1 AS Int8) )
2018-09-04 11:18:59 +00:00
```
2020-03-20 10:10:48 +00:00
``` text
2018-09-04 11:18:59 +00:00
┌─defaultValueOfArgumentType(CAST(1, 'Int8'))─┐
│ 0 │
└─────────────────────────────────────────────┘
2019-09-23 15:31:46 +00:00
```
2020-03-20 10:10:48 +00:00
``` sql
2019-09-23 15:31:46 +00:00
SELECT defaultValueOfArgumentType( CAST(1 AS Nullable(Int8) ) )
```
2020-03-20 10:10:48 +00:00
``` text
2018-09-04 11:18:59 +00:00
┌─defaultValueOfArgumentType(CAST(1, 'Nullable(Int8)'))─┐
│ ᴺᵁᴸᴸ │
└───────────────────────────────────────────────────────┘
```
2022-06-02 10:55:18 +00:00
## defaultValueOfTypeName
2020-08-19 07:52:33 +00:00
Outputs the default value for given type name.
Does not include default values for custom columns set by the user.
``` sql
defaultValueOfTypeName(type)
```
2021-02-15 21:22:10 +00:00
**Arguments:**
2020-08-19 07:52:33 +00:00
- `type` — A string representing a type name.
**Returned values**
- `0` for numbers.
- Empty string for strings.
- `ᴺᵁᴸᴸ` for [Nullable ](../../sql-reference/data-types/nullable.md ).
**Example**
``` sql
SELECT defaultValueOfTypeName('Int8')
```
``` text
┌─defaultValueOfTypeName('Int8')─┐
│ 0 │
└────────────────────────────────┘
```
``` sql
SELECT defaultValueOfTypeName('Nullable(Int8)')
```
``` text
┌─defaultValueOfTypeName('Nullable(Int8)')─┐
│ ᴺᵁᴸᴸ │
└──────────────────────────────────────────┘
```
2022-06-02 10:55:18 +00:00
## indexHint
2021-02-28 07:25:56 +00:00
The function is intended for debugging and introspection purposes. The function ignores it's argument and always returns 1. Arguments are not even evaluated.
But for the purpose of index analysis, the argument of this function is analyzed as if it was present directly without being wrapped inside `indexHint` function. This allows to select data in index ranges by the corresponding condition but without further filtering by this condition. The index in ClickHouse is sparse and using `indexHint` will yield more data than specifying the same condition directly.
**Syntax**
```sql
SELECT * FROM table WHERE indexHint(< expression > )
```
**Returned value**
2021-09-22 00:22:57 +00:00
1. Type: [Uint8 ](https://clickhouse.com/docs/en/data_types/int_uint/#diapazony-uint ).
2021-02-28 07:25:56 +00:00
**Example**
2022-04-11 05:01:34 +00:00
Here is the example of test data from the table [ontime ](../../getting-started/example-datasets/ontime.md ).
2021-02-28 07:25:56 +00:00
Input table:
```sql
SELECT count() FROM ontime
```
```text
┌─count()─┐
│ 4276457 │
└─────────┘
```
The table has indexes on the fields `(FlightDate, (Year, FlightDate))` .
Create a query, where the index is not used.
Query:
```sql
SELECT FlightDate AS k, count() FROM ontime GROUP BY k ORDER BY k
```
ClickHouse processed the entire table (`Processed 4.28 million rows`).
Result:
```text
┌──────────k─┬─count()─┐
│ 2017-01-01 │ 13970 │
│ 2017-01-02 │ 15882 │
........................
│ 2017-09-28 │ 16411 │
│ 2017-09-29 │ 16384 │
│ 2017-09-30 │ 12520 │
└────────────┴─────────┘
```
To apply the index, select a specific date.
Query:
```sql
SELECT FlightDate AS k, count() FROM ontime WHERE k = '2017-09-15' GROUP BY k ORDER BY k
```
By using the index, ClickHouse processed a significantly smaller number of rows (`Processed 32.74 thousand rows`).
Result:
```text
┌──────────k─┬─count()─┐
│ 2017-09-15 │ 16428 │
└────────────┴─────────┘
```
Now wrap the expression `k = '2017-09-15'` into `indexHint` function.
Query:
```sql
SELECT
FlightDate AS k,
count()
FROM ontime
WHERE indexHint(k = '2017-09-15')
GROUP BY k
ORDER BY k ASC
```
ClickHouse used the index in the same way as the previous time (`Processed 32.74 thousand rows`).
The expression `k = '2017-09-15'` was not used when generating the result.
In examle the `indexHint` function allows to see adjacent dates.
Result:
```text
┌──────────k─┬─count()─┐
│ 2017-09-14 │ 7071 │
│ 2017-09-15 │ 16428 │
│ 2017-09-16 │ 1077 │
│ 2017-09-30 │ 8167 │
└────────────┴─────────┘
```
2022-06-02 10:55:18 +00:00
## replicate
2018-09-04 11:18:59 +00:00
Creates an array with a single value.
2020-06-18 08:24:31 +00:00
Used for internal implementation of [arrayJoin ](../../sql-reference/functions/array-join.md#functions_arrayjoin ).
2018-09-04 11:18:59 +00:00
2020-03-20 10:10:48 +00:00
``` sql
2019-09-30 09:17:55 +00:00
SELECT replicate(x, arr);
2018-09-04 11:18:59 +00:00
```
2021-02-15 21:22:10 +00:00
**Arguments:**
2018-09-04 11:18:59 +00:00
2020-03-21 04:11:51 +00:00
- `arr` — Original array. ClickHouse creates a new array of the same length as the original and fills it with the value `x` .
- `x` — The value that the resulting array will be filled with.
2018-09-04 11:18:59 +00:00
2019-09-30 09:17:55 +00:00
**Returned value**
An array filled with the value `x` .
2018-09-04 11:18:59 +00:00
2019-09-30 09:17:55 +00:00
Type: `Array` .
2018-09-04 11:18:59 +00:00
**Example**
2019-09-30 09:17:55 +00:00
Query:
2020-03-20 10:10:48 +00:00
``` sql
2018-09-04 11:18:59 +00:00
SELECT replicate(1, ['a', 'b', 'c'])
2019-09-23 15:31:46 +00:00
```
2019-09-30 09:17:55 +00:00
Result:
2020-03-20 10:10:48 +00:00
``` text
2018-09-04 11:18:59 +00:00
┌─replicate(1, ['a', 'b', 'c'])─┐
│ [1,1,1] │
└───────────────────────────────┘
```
2022-06-02 10:55:18 +00:00
## filesystemAvailable
2019-01-30 10:39:46 +00:00
2020-02-02 21:41:23 +00:00
Returns amount of remaining space on the filesystem where the files of the databases located. It is always smaller than total free space ([filesystemFree](#filesystemfree)) because some space is reserved for OS.
2019-01-30 10:39:46 +00:00
2019-10-07 19:32:18 +00:00
**Syntax**
2019-07-18 11:04:45 +00:00
2020-03-20 10:10:48 +00:00
``` sql
2019-07-18 11:04:45 +00:00
filesystemAvailable()
```
2019-10-07 19:32:18 +00:00
**Returned value**
2019-07-18 11:04:45 +00:00
2020-03-21 04:11:51 +00:00
- The amount of remaining space available in bytes.
2019-07-18 11:04:45 +00:00
2020-04-30 18:19:18 +00:00
Type: [UInt64 ](../../sql-reference/data-types/int-uint.md ).
2019-07-18 11:04:45 +00:00
**Example**
2019-10-07 19:32:18 +00:00
Query:
2020-03-20 10:10:48 +00:00
``` sql
2019-11-11 11:41:33 +00:00
SELECT formatReadableSize(filesystemAvailable()) AS "Available space", toTypeName(filesystemAvailable()) AS "Type";
2019-07-18 11:04:45 +00:00
```
2019-10-07 19:32:18 +00:00
Result:
2020-03-20 10:10:48 +00:00
``` text
2019-10-07 19:32:18 +00:00
┌─Available space─┬─Type───┐
2019-11-11 11:41:33 +00:00
│ 30.75 GiB │ UInt64 │
2019-10-07 19:32:18 +00:00
└─────────────────┴────────┘
```
2022-06-02 10:55:18 +00:00
## filesystemFree
2019-10-07 19:32:18 +00:00
2020-02-02 21:41:23 +00:00
Returns total amount of the free space on the filesystem where the files of the databases located. See also `filesystemAvailable`
2019-10-07 19:32:18 +00:00
**Syntax**
2020-03-20 10:10:48 +00:00
``` sql
2019-11-11 11:41:33 +00:00
filesystemFree()
2019-10-07 19:32:18 +00:00
```
**Returned value**
2019-07-18 11:04:45 +00:00
2020-03-21 04:11:51 +00:00
- Amount of free space in bytes.
2019-07-18 11:04:45 +00:00
2020-04-30 18:19:18 +00:00
Type: [UInt64 ](../../sql-reference/data-types/int-uint.md ).
2019-07-18 11:04:45 +00:00
**Example**
2019-10-07 19:32:18 +00:00
Query:
2020-03-20 10:10:48 +00:00
``` sql
2019-11-11 11:41:33 +00:00
SELECT formatReadableSize(filesystemFree()) AS "Free space", toTypeName(filesystemFree()) AS "Type";
2019-07-18 11:04:45 +00:00
```
2019-01-30 10:39:46 +00:00
2019-10-07 19:32:18 +00:00
Result:
2019-01-30 10:39:46 +00:00
2020-03-20 10:10:48 +00:00
``` text
2019-11-11 11:41:33 +00:00
┌─Free space─┬─Type───┐
│ 32.39 GiB │ UInt64 │
└────────────┴────────┘
2019-07-18 11:04:45 +00:00
```
2019-01-30 10:39:46 +00:00
2022-06-02 10:55:18 +00:00
## filesystemCapacity
2019-10-07 19:32:18 +00:00
2020-04-30 18:19:18 +00:00
Returns the capacity of the filesystem in bytes. For evaluation, the [path ](../../operations/server-configuration-parameters/settings.md#server_configuration_parameters-path ) to the data directory must be configured.
2019-10-07 19:32:18 +00:00
**Syntax**
2020-03-20 10:10:48 +00:00
``` sql
2019-11-11 11:42:53 +00:00
filesystemCapacity()
2019-10-07 19:32:18 +00:00
```
**Returned value**
2020-03-21 04:11:51 +00:00
- Capacity information of the filesystem in bytes.
2019-10-07 19:32:18 +00:00
2020-04-30 18:19:18 +00:00
Type: [UInt64 ](../../sql-reference/data-types/int-uint.md ).
2019-10-07 19:32:18 +00:00
**Example**
Query:
2020-03-20 10:10:48 +00:00
``` sql
2019-11-11 11:41:33 +00:00
SELECT formatReadableSize(filesystemCapacity()) AS "Capacity", toTypeName(filesystemCapacity()) AS "Type"
2019-07-18 11:04:45 +00:00
```
2019-01-30 10:39:46 +00:00
2019-10-07 19:32:18 +00:00
Result:
2019-01-30 10:39:46 +00:00
2020-03-20 10:10:48 +00:00
``` text
2019-11-11 11:41:33 +00:00
┌─Capacity──┬─Type───┐
│ 39.32 GiB │ UInt64 │
└───────────┴────────┘
2019-10-07 19:32:18 +00:00
```
2019-01-30 10:39:46 +00:00
2022-06-02 10:55:18 +00:00
## initializeAggregation
2021-06-22 10:14:24 +00:00
Calculates result of aggregate function based on single value. It is intended to use this function to initialize aggregate functions with combinator [-State ](../../sql-reference/aggregate-functions/combinators.md#agg-functions-combinator-state ). You can create states of aggregate functions and insert them to columns of type [AggregateFunction ](../../sql-reference/data-types/aggregatefunction.md#data-type-aggregatefunction ) or use initialized aggregates as default values.
**Syntax**
``` sql
initializeAggregation (aggregate_function, arg1, arg2, ..., argN)
```
**Arguments**
- `aggregate_function` — Name of the aggregation function to initialize. [String ](../../sql-reference/data-types/string.md ).
- `arg` — Arguments of aggregate function.
**Returned value(s)**
- Result of aggregation for every row passed to the function.
The return type is the same as the return type of function, that `initializeAgregation` takes as first argument.
**Example**
Query:
```sql
SELECT uniqMerge(state) FROM (SELECT initializeAggregation('uniqState', number % 3) AS state FROM numbers(10000));
```
Result:
```text
┌─uniqMerge(state)─┐
│ 3 │
└──────────────────┘
```
Query:
```sql
SELECT finalizeAggregation(state), toTypeName(state) FROM (SELECT initializeAggregation('sumState', number % 3) AS state FROM numbers(5));
```
Result:
```text
┌─finalizeAggregation(state)─┬─toTypeName(state)─────────────┐
│ 0 │ AggregateFunction(sum, UInt8) │
│ 1 │ AggregateFunction(sum, UInt8) │
│ 2 │ AggregateFunction(sum, UInt8) │
│ 0 │ AggregateFunction(sum, UInt8) │
│ 1 │ AggregateFunction(sum, UInt8) │
└────────────────────────────┴───────────────────────────────┘
```
Example with `AggregatingMergeTree` table engine and `AggregateFunction` column:
```sql
CREATE TABLE metrics
(
key UInt64,
value AggregateFunction(sum, UInt64) DEFAULT initializeAggregation('sumState', toUInt64(0))
)
ENGINE = AggregatingMergeTree
ORDER BY key
```
```sql
INSERT INTO metrics VALUES (0, initializeAggregation('sumState', toUInt64(42)))
```
**See Also**
- [arrayReduce ](../../sql-reference/functions/array-functions.md#arrayreduce )
2022-06-02 10:55:18 +00:00
## finalizeAggregation
2019-01-30 10:39:46 +00:00
2020-12-25 15:50:38 +00:00
Takes state of aggregate function. Returns result of aggregation (or finalized state when using[-State](../../sql-reference/aggregate-functions/combinators.md#agg-functions-combinator-state) combinator).
2020-12-23 01:24:05 +00:00
2021-06-22 10:14:24 +00:00
**Syntax**
2020-12-23 01:24:05 +00:00
``` sql
2020-12-24 08:25:47 +00:00
finalizeAggregation(state)
2020-12-23 01:24:05 +00:00
```
2021-02-15 21:22:10 +00:00
**Arguments**
2020-12-23 01:24:05 +00:00
2020-12-23 08:52:45 +00:00
- `state` — State of aggregation. [AggregateFunction ](../../sql-reference/data-types/aggregatefunction.md#data-type-aggregatefunction ).
2020-12-23 01:24:05 +00:00
**Returned value(s)**
- Value/values that was aggregated.
2021-06-22 10:14:24 +00:00
Type: Value of any types that was aggregated.
2020-12-23 01:24:05 +00:00
**Examples**
Query:
```sql
SELECT finalizeAggregation(( SELECT countState(number) FROM numbers(10)));
```
Result:
```text
┌─finalizeAggregation(_subquery16)─┐
│ 10 │
└──────────────────────────────────┘
```
Query:
```sql
SELECT finalizeAggregation(( SELECT sumState(number) FROM numbers(10)));
```
Result:
```text
┌─finalizeAggregation(_subquery20)─┐
│ 45 │
└──────────────────────────────────┘
```
2021-06-22 10:14:24 +00:00
Note that `NULL` values are ignored.
2020-12-23 01:24:05 +00:00
Query:
```sql
SELECT finalizeAggregation(arrayReduce('anyState', [NULL, 2, 3]));
```
Result:
```text
┌─finalizeAggregation(arrayReduce('anyState', [NULL, 2, 3]))─┐
│ 2 │
└────────────────────────────────────────────────────────────┘
```
Combined example:
Query:
```sql
WITH initializeAggregation('sumState', number) AS one_row_sum_state
SELECT
number,
finalizeAggregation(one_row_sum_state) AS one_row_sum,
runningAccumulate(one_row_sum_state) AS cumulative_sum
2020-12-24 08:25:47 +00:00
FROM numbers(10);
2020-12-23 01:24:05 +00:00
```
Result:
```text
┌─number─┬─one_row_sum─┬─cumulative_sum─┐
│ 0 │ 0 │ 0 │
│ 1 │ 1 │ 1 │
│ 2 │ 2 │ 3 │
│ 3 │ 3 │ 6 │
│ 4 │ 4 │ 10 │
│ 5 │ 5 │ 15 │
│ 6 │ 6 │ 21 │
│ 7 │ 7 │ 28 │
│ 8 │ 8 │ 36 │
│ 9 │ 9 │ 45 │
└────────┴─────────────┴────────────────┘
```
2019-01-30 10:39:46 +00:00
2021-06-22 10:14:24 +00:00
**See Also**
2020-12-23 09:05:59 +00:00
- [arrayReduce ](../../sql-reference/functions/array-functions.md#arrayreduce )
2021-06-22 10:14:24 +00:00
- [initializeAggregation ](#initializeaggregation )
2020-12-23 08:08:57 +00:00
2022-06-02 10:55:18 +00:00
## runningAccumulate
2019-01-30 10:39:46 +00:00
2020-07-09 15:10:35 +00:00
Accumulates states of an aggregate function for each row of a data block.
2020-07-01 13:36:41 +00:00
2022-04-09 13:29:05 +00:00
:::warning
The state is reset for each new data block.
:::
2020-07-01 13:36:41 +00:00
**Syntax**
2020-07-09 15:10:35 +00:00
``` sql
2020-07-01 13:36:41 +00:00
runningAccumulate(agg_state[, grouping]);
```
2021-02-15 21:22:10 +00:00
**Arguments**
2020-07-01 13:36:41 +00:00
2020-07-09 15:10:35 +00:00
- `agg_state` — State of the aggregate function. [AggregateFunction ](../../sql-reference/data-types/aggregatefunction.md#data-type-aggregatefunction ).
- `grouping` — Grouping key. Optional. The state of the function is reset if the `grouping` value is changed. It can be any of the [supported data types ](../../sql-reference/data-types/index.md ) for which the equality operator is defined.
2020-07-01 13:36:41 +00:00
**Returned value**
2020-07-09 15:10:35 +00:00
- Each resulting row contains a result of the aggregate function, accumulated for all the input rows from 0 to the current position. `runningAccumulate` resets states for each new data block or when the `grouping` value changes.
2020-07-01 13:36:41 +00:00
Type depends on the aggregate function used.
**Examples**
Consider how you can use `runningAccumulate` to find the cumulative sum of numbers without and with grouping.
Query:
2020-07-09 15:10:35 +00:00
``` sql
2020-07-01 13:36:41 +00:00
SELECT k, runningAccumulate(sum_k) AS res FROM (SELECT number as k, sumState(k) AS sum_k FROM numbers(10) GROUP BY k ORDER BY k);
```
Result:
2020-07-09 15:10:35 +00:00
``` text
2020-07-01 13:36:41 +00:00
┌─k─┬─res─┐
│ 0 │ 0 │
│ 1 │ 1 │
│ 2 │ 3 │
│ 3 │ 6 │
│ 4 │ 10 │
│ 5 │ 15 │
│ 6 │ 21 │
│ 7 │ 28 │
│ 8 │ 36 │
│ 9 │ 45 │
└───┴─────┘
```
2020-07-09 15:10:35 +00:00
The subquery generates `sumState` for every number from `0` to `9` . `sumState` returns the state of the [sum ](../../sql-reference/aggregate-functions/reference/sum.md ) function that contains the sum of a single number.
2019-01-30 10:39:46 +00:00
2020-07-01 13:36:41 +00:00
The whole query does the following:
2020-07-09 15:10:35 +00:00
1. For the first row, `runningAccumulate` takes `sumState(0)` and returns `0` .
2. For the second row, the function merges `sumState(0)` and `sumState(1)` resulting in `sumState(0 + 1)` , and returns `1` as a result.
3. For the third row, the function merges `sumState(0 + 1)` and `sumState(2)` resulting in `sumState(0 + 1 + 2)` , and returns `3` as a result.
4. The actions are repeated until the block ends.
2020-07-01 13:36:41 +00:00
The following example shows the `groupping` parameter usage:
Query:
2020-07-09 15:10:35 +00:00
``` sql
SELECT
2020-07-01 13:36:41 +00:00
grouping,
item,
runningAccumulate(state, grouping) AS res
2020-07-09 15:10:35 +00:00
FROM
2020-07-01 13:36:41 +00:00
(
2020-07-09 15:10:35 +00:00
SELECT
2020-07-01 13:36:41 +00:00
toInt8(number / 4) AS grouping,
number AS item,
sumState(number) AS state
FROM numbers(15)
GROUP BY item
ORDER BY item ASC
);
```
Result:
2020-07-09 15:10:35 +00:00
``` text
2020-07-01 13:36:41 +00:00
┌─grouping─┬─item─┬─res─┐
│ 0 │ 0 │ 0 │
│ 0 │ 1 │ 1 │
│ 0 │ 2 │ 3 │
│ 0 │ 3 │ 6 │
│ 1 │ 4 │ 4 │
│ 1 │ 5 │ 9 │
│ 1 │ 6 │ 15 │
│ 1 │ 7 │ 22 │
│ 2 │ 8 │ 8 │
│ 2 │ 9 │ 17 │
│ 2 │ 10 │ 27 │
│ 2 │ 11 │ 38 │
│ 3 │ 12 │ 12 │
│ 3 │ 13 │ 25 │
│ 3 │ 14 │ 39 │
└──────────┴──────┴─────┘
```
As you can see, `runningAccumulate` merges states for each group of rows separately.
2019-01-30 10:39:46 +00:00
2022-06-02 10:55:18 +00:00
## joinGet
2019-01-30 10:39:46 +00:00
2020-04-30 18:19:18 +00:00
The function lets you extract data from the table the same way as from a [dictionary ](../../sql-reference/dictionaries/index.md ).
2019-01-30 10:39:46 +00:00
2020-04-30 18:19:18 +00:00
Gets data from [Join ](../../engines/table-engines/special/join.md#creating-a-table ) tables using the specified join key.
2019-06-21 07:58:15 +00:00
2019-07-31 14:49:16 +00:00
Only supports tables created with the `ENGINE = Join(ANY, LEFT, <join_keys>)` statement.
2019-01-30 10:39:46 +00:00
2020-01-24 10:52:26 +00:00
**Syntax**
2019-10-27 16:33:47 +00:00
2020-03-20 10:10:48 +00:00
``` sql
2019-10-27 16:33:47 +00:00
joinGet(join_storage_table_name, `value_column` , join_keys)
```
2021-02-15 21:22:10 +00:00
**Arguments**
2019-10-27 16:33:47 +00:00
2020-06-18 08:24:31 +00:00
- `join_storage_table_name` — an [identifier ](../../sql-reference/syntax.md#syntax-identifiers ) indicates where search is performed. The identifier is searched in the default database (see parameter `default_database` in the config file). To override the default database, use the `USE db_name` or specify the database and the table through the separator `db_name.db_table` , see the example.
2020-03-21 04:11:51 +00:00
- `value_column` — name of the column of the table that contains required data.
- `join_keys` — list of keys.
2019-10-27 16:33:47 +00:00
**Returned value**
Returns list of values corresponded to list of keys.
2021-05-27 19:44:11 +00:00
If certain does not exist in source table then `0` or `null` will be returned based on [join_use_nulls ](../../operations/settings/settings.md#join_use_nulls ) setting.
2019-10-27 16:33:47 +00:00
2020-04-30 18:19:18 +00:00
More info about `join_use_nulls` in [Join operation ](../../engines/table-engines/special/join.md ).
2020-01-24 10:52:26 +00:00
2019-10-27 16:33:47 +00:00
**Example**
Input table:
2020-01-20 09:48:34 +00:00
2020-03-20 10:10:48 +00:00
``` sql
2020-01-20 09:48:34 +00:00
CREATE DATABASE db_test
2020-01-24 11:38:29 +00:00
CREATE TABLE db_test.id_val(`id` UInt32, `val` UInt32) ENGINE = Join(ANY, LEFT, id) SETTINGS join_use_nulls = 1
2020-01-20 09:48:34 +00:00
INSERT INTO db_test.id_val VALUES (1,11)(2,12)(4,13)
2019-10-27 16:33:47 +00:00
```
2020-03-20 10:10:48 +00:00
``` text
2019-10-27 16:33:47 +00:00
┌─id─┬─val─┐
│ 4 │ 13 │
│ 2 │ 12 │
│ 1 │ 11 │
└────┴─────┘
```
Query:
2020-03-20 10:10:48 +00:00
``` sql
2020-01-24 10:52:26 +00:00
SELECT joinGet(db_test.id_val,'val',toUInt32(number)) from numbers(4) SETTINGS join_use_nulls = 1
2019-10-27 16:33:47 +00:00
```
Result:
2020-03-20 10:10:48 +00:00
``` text
2020-01-20 09:48:34 +00:00
┌─joinGet(db_test.id_val, 'val', toUInt32(number))─┐
│ 0 │
│ 11 │
│ 12 │
│ 0 │
└──────────────────────────────────────────────────┘
2019-10-27 16:33:47 +00:00
```
feat: implement catboost in library-bridge
This commit moves the catboost model evaluation out of the server
process into the library-bridge binary. This serves two goals: On the
one hand, crashes / memory corruptions of the catboost library no longer
affect the server. On the other hand, we can forbid loading dynamic
libraries in the server (catboost was the last consumer of this
functionality), thus improving security.
SQL syntax:
SELECT
catboostEvaluate('/path/to/model.bin', FEAT_1, ..., FEAT_N) > 0 AS prediction,
ACTION AS target
FROM amazon_train
LIMIT 10
Required configuration:
<catboost_lib_path>/path/to/libcatboostmodel.so</catboost_lib_path>
*** Implementation Details ***
The internal protocol between the server and the library-bridge is
simple:
- HTTP GET on path "/extdict_ping":
A ping, used during the handshake to check if the library-bridge runs.
- HTTP POST on path "extdict_request"
(1) Send a "catboost_GetTreeCount" request from the server to the
bridge, containing a library path (e.g /home/user/libcatboost.so) and
a model path (e.g. /home/user/model.bin). Rirst, this unloads the
catboost library handler associated to the model path (if it was
loaded), then loads the catboost library handler associated to the
model path, then executes GetTreeCount() on the library handler and
finally sends the result back to the server. Step (1) is called once
by the server from FunctionCatBoostEvaluate::getReturnTypeImpl(). The
library path handler is unloaded in the beginning because it contains
state which may no longer be valid if the user runs
catboost("/path/to/model.bin", ...) more than once and if "model.bin"
was updated in between.
(2) Send "catboost_Evaluate" from the server to the bridge, containing
the model path and the features to run the interference on. Step (2)
is called multiple times (once per chunk) by the server from function
FunctionCatBoostEvaluate::executeImpl(). The library handler for the
given model path is expected to be already loaded by Step (1).
Fixes #27870
2022-08-05 07:53:06 +00:00
## catboostEvaluate(path_to_model, feature_1, feature_2, …, feature_n)
2023-02-16 19:19:25 +00:00
:::note
This function is not available in ClickHouse Cloud.
:::
feat: implement catboost in library-bridge
This commit moves the catboost model evaluation out of the server
process into the library-bridge binary. This serves two goals: On the
one hand, crashes / memory corruptions of the catboost library no longer
affect the server. On the other hand, we can forbid loading dynamic
libraries in the server (catboost was the last consumer of this
functionality), thus improving security.
SQL syntax:
SELECT
catboostEvaluate('/path/to/model.bin', FEAT_1, ..., FEAT_N) > 0 AS prediction,
ACTION AS target
FROM amazon_train
LIMIT 10
Required configuration:
<catboost_lib_path>/path/to/libcatboostmodel.so</catboost_lib_path>
*** Implementation Details ***
The internal protocol between the server and the library-bridge is
simple:
- HTTP GET on path "/extdict_ping":
A ping, used during the handshake to check if the library-bridge runs.
- HTTP POST on path "extdict_request"
(1) Send a "catboost_GetTreeCount" request from the server to the
bridge, containing a library path (e.g /home/user/libcatboost.so) and
a model path (e.g. /home/user/model.bin). Rirst, this unloads the
catboost library handler associated to the model path (if it was
loaded), then loads the catboost library handler associated to the
model path, then executes GetTreeCount() on the library handler and
finally sends the result back to the server. Step (1) is called once
by the server from FunctionCatBoostEvaluate::getReturnTypeImpl(). The
library path handler is unloaded in the beginning because it contains
state which may no longer be valid if the user runs
catboost("/path/to/model.bin", ...) more than once and if "model.bin"
was updated in between.
(2) Send "catboost_Evaluate" from the server to the bridge, containing
the model path and the features to run the interference on. Step (2)
is called multiple times (once per chunk) by the server from function
FunctionCatBoostEvaluate::executeImpl(). The library handler for the
given model path is expected to be already loaded by Step (1).
Fixes #27870
2022-08-05 07:53:06 +00:00
Evaluate external catboost model. [CatBoost ](https://catboost.ai ) is an open-source gradient boosting library developed by Yandex for machine learing.
Accepts a path to a catboost model and model arguments (features). Returns Float64.
``` sql
SELECT feat1, ..., feat_n, catboostEvaluate('/path/to/model.bin', feat_1, ..., feat_n) AS prediction
FROM data_table
```
**Prerequisites**
1. Build the catboost evaluation library
Before evaluating catboost models, the `libcatboostmodel.<so|dylib>` library must be made available. See [CatBoost documentation ](https://catboost.ai/docs/concepts/c-plus-plus-api_dynamic-c-pluplus-wrapper.html ) how to compile it.
Next, specify the path to `libcatboostmodel.<so|dylib>` in the clickhouse configuration:
``` xml
< clickhouse >
...
< catboost_lib_path > /path/to/libcatboostmodel.so< / catboost_lib_path >
...
< / clickhouse >
```
2022-12-02 10:52:01 +00:00
For security and isolation reasons, the model evaluation does not run in the server process but in the clickhouse-library-bridge process.
At the first execution of `catboostEvaluate()` , the server starts the library bridge process if it is not running already. Both processes
communicate using a HTTP interface. By default, port `9012` is used. A different port can be specified as follows - this is useful if port
`9012` is already assigned to a different service.
``` xml
< library_bridge >
< port > 9019< / port >
< / library_bridge >
```
feat: implement catboost in library-bridge
This commit moves the catboost model evaluation out of the server
process into the library-bridge binary. This serves two goals: On the
one hand, crashes / memory corruptions of the catboost library no longer
affect the server. On the other hand, we can forbid loading dynamic
libraries in the server (catboost was the last consumer of this
functionality), thus improving security.
SQL syntax:
SELECT
catboostEvaluate('/path/to/model.bin', FEAT_1, ..., FEAT_N) > 0 AS prediction,
ACTION AS target
FROM amazon_train
LIMIT 10
Required configuration:
<catboost_lib_path>/path/to/libcatboostmodel.so</catboost_lib_path>
*** Implementation Details ***
The internal protocol between the server and the library-bridge is
simple:
- HTTP GET on path "/extdict_ping":
A ping, used during the handshake to check if the library-bridge runs.
- HTTP POST on path "extdict_request"
(1) Send a "catboost_GetTreeCount" request from the server to the
bridge, containing a library path (e.g /home/user/libcatboost.so) and
a model path (e.g. /home/user/model.bin). Rirst, this unloads the
catboost library handler associated to the model path (if it was
loaded), then loads the catboost library handler associated to the
model path, then executes GetTreeCount() on the library handler and
finally sends the result back to the server. Step (1) is called once
by the server from FunctionCatBoostEvaluate::getReturnTypeImpl(). The
library path handler is unloaded in the beginning because it contains
state which may no longer be valid if the user runs
catboost("/path/to/model.bin", ...) more than once and if "model.bin"
was updated in between.
(2) Send "catboost_Evaluate" from the server to the bridge, containing
the model path and the features to run the interference on. Step (2)
is called multiple times (once per chunk) by the server from function
FunctionCatBoostEvaluate::executeImpl(). The library handler for the
given model path is expected to be already loaded by Step (1).
Fixes #27870
2022-08-05 07:53:06 +00:00
2. Train a catboost model using libcatboost
See [Training and applying models ](https://catboost.ai/docs/features/training.html#training ) for how to train catboost models from a training data set.
2022-08-17 20:13:23 +00:00
## throwIf(x\[, message\[, error_code\]\])
2019-01-30 10:39:46 +00:00
Throw an exception if the argument is non zero.
2022-08-17 20:13:23 +00:00
`message` - is an optional parameter: a constant string providing a custom error message
`error_code` - is an optional parameter: a constant integer providing a custom error code
To use the `error_code` argument, configuration parameter `allow_custom_error_code_in_throwif` must be enabled.
2019-09-03 01:27:48 +00:00
2020-03-20 10:10:48 +00:00
``` sql
2019-09-03 01:27:48 +00:00
SELECT throwIf(number = 3, 'Too many') FROM numbers(10);
2019-09-23 15:31:46 +00:00
```
2020-03-20 10:10:48 +00:00
``` text
2019-09-03 01:27:48 +00:00
↙ Progress: 0.00 rows, 0.00 B (0.00 rows/s., 0.00 B/s.) Received exception from server (version 19.14.1):
Code: 395. DB::Exception: Received from localhost:9000. DB::Exception: Too many.
```
2018-10-16 10:47:17 +00:00
2022-06-02 10:55:18 +00:00
## identity
2019-10-22 19:14:56 +00:00
2021-05-27 19:44:11 +00:00
Returns the same value that was used as its argument. Used for debugging and testing, allows to cancel using index, and get the query performance of a full scan. When query is analyzed for possible use of index, the analyzer does not look inside `identity` functions. Also constant folding is not applied too.
2019-09-03 00:18:44 +00:00
2019-10-22 19:14:56 +00:00
**Syntax**
2020-03-20 10:10:48 +00:00
``` sql
2020-02-02 21:25:51 +00:00
identity(x)
2019-10-22 19:14:56 +00:00
```
**Example**
Query:
2019-09-03 00:18:44 +00:00
2020-03-20 10:10:48 +00:00
``` sql
2019-09-03 00:18:44 +00:00
SELECT identity(42)
2019-09-23 15:31:46 +00:00
```
2019-10-22 19:14:56 +00:00
Result:
2020-03-20 10:10:48 +00:00
``` text
2019-09-03 00:18:44 +00:00
┌─identity(42)─┐
│ 42 │
└──────────────┘
```
2022-06-02 10:55:18 +00:00
## randomPrintableASCII
2019-12-25 17:26:40 +00:00
Generates a string with a random set of [ASCII ](https://en.wikipedia.org/wiki/ASCII#Printable_characters ) printable characters.
**Syntax**
2020-03-20 10:10:48 +00:00
``` sql
2019-12-28 23:11:02 +00:00
randomPrintableASCII(length)
2019-12-25 17:26:40 +00:00
```
2021-02-15 21:22:10 +00:00
**Arguments**
2019-12-25 17:26:40 +00:00
2020-03-21 04:11:51 +00:00
- `length` — Resulting string length. Positive integer.
2019-12-25 17:26:40 +00:00
2020-03-21 04:11:51 +00:00
If you pass `length < 0` , behavior of the function is undefined.
2019-12-25 17:26:40 +00:00
**Returned value**
2020-03-21 04:11:51 +00:00
- String with a random set of [ASCII ](https://en.wikipedia.org/wiki/ASCII#Printable_characters ) printable characters.
2019-12-25 17:26:40 +00:00
2020-04-30 18:19:18 +00:00
Type: [String ](../../sql-reference/data-types/string.md )
2019-12-25 17:26:40 +00:00
**Example**
2020-03-20 10:10:48 +00:00
``` sql
2019-12-28 23:11:02 +00:00
SELECT number, randomPrintableASCII(30) as str, length(str) FROM system.numbers LIMIT 3
2019-12-25 17:26:40 +00:00
```
2020-03-20 10:10:48 +00:00
``` text
2019-12-28 23:11:02 +00:00
┌─number─┬─str────────────────────────────┬─length(randomPrintableASCII(30))─┐
│ 0 │ SuiCOSTvC0csfABSw=UcSzp2.`rv8x │ 30 │
│ 1 │ 1Ag NlJ &RCN:*>HVPG; PE-nO"SUFD │ 30 │
│ 2 │ /"+< "wUTh:=LjJ Vm!c& hI*m#XTfzz │ 30 │
└────────┴────────────────────────────────┴──────────────────────────────────┘
2019-12-25 17:26:40 +00:00
```
2022-06-02 10:55:18 +00:00
## randomString
2020-06-15 18:44:05 +00:00
Generates a binary string of the specified length filled with random bytes (including zero bytes).
**Syntax**
``` sql
randomString(length)
```
2021-02-15 21:22:10 +00:00
**Arguments**
2020-06-15 18:44:05 +00:00
- `length` — String length. Positive integer.
**Returned value**
- String filled with random bytes.
Type: [String ](../../sql-reference/data-types/string.md ).
**Example**
Query:
``` sql
SELECT randomString(30) AS str, length(str) AS len FROM numbers(2) FORMAT Vertical;
```
Result:
``` text
Row 1:
──────
str: 3 G : pT ?w тi k aV f6
len: 30
Row 2:
──────
str: 9 ,] ^ ) ]?? 8
len: 30
```
**See Also**
- [generateRandom ](../../sql-reference/table-functions/generate.md#generaterandom )
- [randomPrintableASCII ](../../sql-reference/functions/other-functions.md#randomascii )
2020-07-21 10:14:34 +00:00
2022-06-02 10:55:18 +00:00
## randomFixedString
2020-07-23 12:19:14 +00:00
Generates a binary string of the specified length filled with random bytes (including zero bytes).
**Syntax**
``` sql
randomFixedString(length);
```
2021-02-15 21:22:10 +00:00
**Arguments**
2020-07-23 12:19:14 +00:00
- `length` — String length in bytes. [UInt64 ](../../sql-reference/data-types/int-uint.md ).
**Returned value(s)**
- String filled with random bytes.
Type: [FixedString ](../../sql-reference/data-types/fixedstring.md ).
**Example**
Query:
```sql
SELECT randomFixedString(13) as rnd, toTypeName(rnd)
```
Result:
```text
┌─rnd──────┬─toTypeName(randomFixedString(13))─┐
│ j▒h㋖HɨZ'▒ │ FixedString(13) │
└──────────┴───────────────────────────────────┘
```
2022-06-02 10:55:18 +00:00
## randomStringUTF8
2020-07-21 10:08:11 +00:00
2020-07-21 11:28:46 +00:00
Generates a random string of a specified length. Result string contains valid UTF-8 code points. The value of code points may be outside of the range of assigned Unicode.
2020-07-21 10:08:11 +00:00
**Syntax**
``` sql
randomStringUTF8(length);
```
2021-02-15 21:22:10 +00:00
**Arguments**
2020-07-21 10:08:11 +00:00
- `length` — Required length of the resulting string in code points. [UInt64 ](../../sql-reference/data-types/int-uint.md ).
**Returned value(s)**
- UTF-8 random string.
Type: [String ](../../sql-reference/data-types/string.md ).
**Example**
Query:
2020-08-19 07:52:33 +00:00
```sql
2020-07-21 10:08:11 +00:00
SELECT randomStringUTF8(13)
```
Result:
2020-08-19 07:52:33 +00:00
```text
2020-07-21 10:08:11 +00:00
┌─randomStringUTF8(13)─┐
│ 𘤗д兠庇 │
└──────────────────────┘
```
2022-06-02 10:55:18 +00:00
## getSetting
2020-09-28 02:59:01 +00:00
Returns the current value of a [custom setting ](../../operations/settings/index.md#custom_settings ).
2020-10-09 19:29:42 +00:00
**Syntax**
2020-09-28 02:59:01 +00:00
```sql
2020-10-09 19:29:42 +00:00
getSetting('custom_setting');
2020-09-28 02:59:01 +00:00
```
2020-10-09 19:29:42 +00:00
**Parameter**
2020-09-28 02:59:01 +00:00
- `custom_setting` — The setting name. [String ](../../sql-reference/data-types/string.md ).
**Returned value**
- The setting current value.
**Example**
```sql
SET custom_a = 123;
2020-10-09 19:29:42 +00:00
SELECT getSetting('custom_a');
2020-09-28 02:59:01 +00:00
```
**Result**
```
123
```
2020-10-09 19:29:42 +00:00
**See Also**
2020-09-28 02:59:01 +00:00
- [Custom Settings ](../../operations/settings/index.md#custom_settings )
2022-06-02 10:55:18 +00:00
## isDecimalOverflow
2020-10-07 18:13:01 +00:00
2020-10-09 19:29:42 +00:00
Checks whether the [Decimal ](../../sql-reference/data-types/decimal.md ) value is out of its (or specified) precision.
2020-10-07 18:13:01 +00:00
**Syntax**
``` sql
isDecimalOverflow(d, [p])
```
2021-02-15 21:22:10 +00:00
**Arguments**
2020-10-07 18:13:01 +00:00
2020-10-09 19:29:42 +00:00
- `d` — value. [Decimal ](../../sql-reference/data-types/decimal.md ).
2020-10-27 11:04:03 +00:00
- `p` — precision. Optional. If omitted, the initial precision of the first argument is used. Using of this paratemer could be helpful for data extraction to another DBMS or file. [UInt8 ](../../sql-reference/data-types/int-uint.md#uint-ranges ).
2020-10-07 18:13:01 +00:00
**Returned values**
- `1` — Decimal value has more digits then it's precision allow,
- `0` — Decimal value satisfies the specified precision.
**Example**
Query:
``` sql
SELECT isDecimalOverflow(toDecimal32(1000000000, 0), 9),
isDecimalOverflow(toDecimal32(1000000000, 0)),
isDecimalOverflow(toDecimal32(-1000000000, 0), 9),
isDecimalOverflow(toDecimal32(-1000000000, 0));
```
Result:
``` text
1 1 1 1
```
2022-06-02 10:55:18 +00:00
## countDigits
2020-10-07 18:13:01 +00:00
Returns number of decimal digits you need to represent the value.
**Syntax**
``` sql
countDigits(x)
```
2021-02-15 21:22:10 +00:00
**Arguments**
2020-10-07 18:13:01 +00:00
2020-10-10 07:50:37 +00:00
- `x` — [Int ](../../sql-reference/data-types/int-uint.md ) or [Decimal ](../../sql-reference/data-types/decimal.md ) value.
2020-10-07 18:13:01 +00:00
**Returned value**
Number of digits.
Type: [UInt8 ](../../sql-reference/data-types/int-uint.md#uint-ranges ).
2022-04-09 13:29:05 +00:00
:::note
For `Decimal` values takes into account their scales: calculates result over underlying integer type which is `(value * scale)` . For example: `countDigits(42) = 2` , `countDigits(42.000) = 5` , `countDigits(0.04200) = 4` . I.e. you may check decimal overflow for `Decimal64` with `countDecimal(x) > 18` . It's a slow variant of [isDecimalOverflow ](#is-decimal-overflow ).
:::
2020-10-07 18:13:01 +00:00
**Example**
Query:
``` sql
SELECT countDigits(toDecimal32(1, 9)), countDigits(toDecimal32(-1, 9)),
countDigits(toDecimal64(1, 18)), countDigits(toDecimal64(-1, 18)),
countDigits(toDecimal128(1, 38)), countDigits(toDecimal128(-1, 38));
```
Result:
``` text
10 10 19 19 39 39
```
2020-09-28 02:59:01 +00:00
2022-06-02 10:55:18 +00:00
## errorCodeToName
2020-10-12 18:22:09 +00:00
**Returned value**
- Variable name for the error code.
2020-10-29 06:42:08 +00:00
Type: [LowCardinality(String) ](../../sql-reference/data-types/lowcardinality.md ).
2020-10-12 18:22:09 +00:00
**Syntax**
``` sql
errorCodeToName(1)
```
Result:
``` text
UNSUPPORTED_METHOD
```
2022-06-02 10:55:18 +00:00
## tcpPort
2020-12-21 20:13:26 +00:00
2020-12-22 15:37:34 +00:00
Returns [native interface ](../../interfaces/tcp.md ) TCP port number listened by this server.
2021-10-06 20:13:46 +00:00
If it is executed in the context of a distributed table, then it generates a normal column, otherwise it produces a constant value.
2020-12-21 20:13:26 +00:00
**Syntax**
``` sql
tcpPort()
```
2021-02-15 21:22:10 +00:00
**Arguments**
2020-12-21 20:13:26 +00:00
- None.
**Returned value**
- The TCP port number.
Type: [UInt16 ](../../sql-reference/data-types/int-uint.md ).
**Example**
Query:
``` sql
SELECT tcpPort();
```
Result:
``` text
┌─tcpPort()─┐
│ 9000 │
└───────────┘
```
**See Also**
- [tcp_port ](../../operations/server-configuration-parameters/settings.md#server_configuration_parameters-tcp_port )
2022-06-02 10:55:18 +00:00
## currentProfiles
2021-08-03 12:03:10 +00:00
2021-08-05 06:25:52 +00:00
Returns a list of the current [settings profiles ](../../operations/access-rights.md#settings-profiles-management ) for the current user.
2021-08-06 05:15:55 +00:00
The command [SET PROFILE ](../../sql-reference/statements/set.md#query-set ) could be used to change the current setting profile. If the command `SET PROFILE` was not used the function returns the profiles specified at the current user's definition (see [CREATE USER ](../../sql-reference/statements/create/user.md#create-user-statement )).
2021-08-03 12:03:10 +00:00
**Syntax**
``` sql
2021-08-04 11:26:24 +00:00
currentProfiles()
2021-08-03 12:03:10 +00:00
```
2021-08-04 10:15:50 +00:00
**Returned value**
2021-08-03 12:03:10 +00:00
2021-08-05 06:27:48 +00:00
- List of the current user settings profiles.
2021-08-03 12:03:10 +00:00
2021-08-04 11:26:24 +00:00
Type: [Array ](../../sql-reference/data-types/array.md )([String](../../sql-reference/data-types/string.md)).
2021-08-03 12:03:10 +00:00
2022-06-02 10:55:18 +00:00
## enabledProfiles
2021-08-03 12:03:10 +00:00
2021-08-05 06:28:52 +00:00
Returns settings profiles, assigned to the current user both explicitly and implicitly. Explicitly assigned profiles are the same as returned by the [currentProfiles ](#current-profiles ) function. Implicitly assigned profiles include parent profiles of other assigned profiles, profiles assigned via granted roles, profiles assigned via their own settings, and the main default profile (see the `default_profile` section in the main server configuration file).
2021-08-04 11:26:24 +00:00
2021-08-03 12:03:10 +00:00
**Syntax**
``` sql
2021-08-04 11:26:24 +00:00
enabledProfiles()
2021-08-03 12:03:10 +00:00
```
2021-08-04 10:15:50 +00:00
**Returned value**
2021-08-03 12:03:10 +00:00
2021-08-05 06:28:58 +00:00
- List of the enabled settings profiles.
2021-08-03 12:03:10 +00:00
2021-08-04 11:26:24 +00:00
Type: [Array ](../../sql-reference/data-types/array.md )([String](../../sql-reference/data-types/string.md)).
2021-08-03 12:03:10 +00:00
2022-06-02 10:55:18 +00:00
## defaultProfiles
2021-08-03 12:03:10 +00:00
2021-08-05 06:29:12 +00:00
Returns all the profiles specified at the current user's definition (see [CREATE USER ](../../sql-reference/statements/create/user.md#create-user-statement ) statement).
2021-08-03 12:03:10 +00:00
**Syntax**
``` sql
2021-08-04 11:26:24 +00:00
defaultProfiles()
2021-08-03 12:03:10 +00:00
```
2021-08-04 10:15:50 +00:00
**Returned value**
2021-08-03 12:03:10 +00:00
2021-08-05 06:29:17 +00:00
- List of the default settings profiles.
2021-08-03 12:03:10 +00:00
2021-08-04 11:26:24 +00:00
Type: [Array ](../../sql-reference/data-types/array.md )([String](../../sql-reference/data-types/string.md)).
2021-08-07 15:07:37 +00:00
2022-06-02 10:55:18 +00:00
## currentRoles
2021-08-03 12:03:10 +00:00
2021-08-06 05:23:59 +00:00
Returns the names of the roles which are current for the current user. The current roles can be changed by the [SET ROLE ](../../sql-reference/statements/set-role.md#set-role-statement ) statement. If the `SET ROLE` statement was not used, the function `currentRoles` returns the same as `defaultRoles` .
2021-08-03 12:03:10 +00:00
**Syntax**
``` sql
2021-08-04 09:31:24 +00:00
currentRoles()
2021-08-03 12:03:10 +00:00
```
2021-08-04 09:31:24 +00:00
**Returned value**
2021-08-03 12:03:10 +00:00
2021-08-04 09:31:24 +00:00
- List of the current roles for the current user.
2021-08-03 12:03:10 +00:00
2021-08-04 09:31:24 +00:00
Type: [Array ](../../sql-reference/data-types/array.md )([String](../../sql-reference/data-types/string.md)).
2021-08-03 12:03:10 +00:00
2022-06-02 10:55:18 +00:00
## enabledRoles
2021-08-03 12:03:10 +00:00
2021-08-06 05:24:13 +00:00
Returns the names of the current roles and the roles, granted to some of the current roles.
2021-08-03 12:03:10 +00:00
**Syntax**
``` sql
2021-08-04 09:31:24 +00:00
enabledRoles()
2021-08-03 12:03:10 +00:00
```
2021-08-04 09:31:24 +00:00
**Returned value**
2021-08-03 12:03:10 +00:00
2021-08-04 09:31:24 +00:00
- List of the enabled roles for the current user.
2021-08-03 12:03:10 +00:00
2021-08-04 09:31:24 +00:00
Type: [Array ](../../sql-reference/data-types/array.md )([String](../../sql-reference/data-types/string.md)).
2021-08-03 12:03:10 +00:00
2022-06-02 10:55:18 +00:00
## defaultRoles
2021-08-03 12:03:10 +00:00
2021-08-09 13:16:31 +00:00
Returns the names of the roles which are enabled by default for the current user when he logins. Initially these are all roles granted to the current user (see [GRANT ](../../sql-reference/statements/grant/#grant-select )), but that can be changed with the [SET DEFAULT ROLE ](../../sql-reference/statements/set-role.md#set-default-role-statement ) statement.
2021-08-03 12:03:10 +00:00
**Syntax**
``` sql
2021-08-04 09:31:24 +00:00
defaultRoles()
2021-08-03 12:03:10 +00:00
```
2021-08-04 09:31:24 +00:00
**Returned value**
2021-08-03 12:03:10 +00:00
2021-08-04 09:31:24 +00:00
- List of the default roles for the current user.
2021-08-03 12:03:10 +00:00
2021-08-04 09:31:24 +00:00
Type: [Array ](../../sql-reference/data-types/array.md )([String](../../sql-reference/data-types/string.md)).
2021-08-07 15:07:37 +00:00
2022-06-02 10:55:18 +00:00
## getServerPort
2021-08-22 10:38:43 +00:00
2021-08-25 19:30:14 +00:00
Returns the number of the server port. When the port is not used by the server, throws an exception.
2021-08-22 10:38:43 +00:00
**Syntax**
``` sql
getServerPort(port_name)
```
**Arguments**
2021-08-22 16:02:40 +00:00
- `port_name` — The name of the server port. [String ](../../sql-reference/data-types/string.md#string ). Possible values:
- 'tcp_port'
- 'tcp_port_secure'
- 'http_port'
- 'https_port'
- 'interserver_http_port'
- 'interserver_https_port'
- 'mysql_port'
- 'postgresql_port'
- 'grpc_port'
- 'prometheus.port'
2021-08-22 10:38:43 +00:00
**Returned value**
- The number of the server port.
Type: [UInt16 ](../../sql-reference/data-types/int-uint.md ).
**Example**
Query:
``` sql
SELECT getServerPort('tcp_port');
```
Result:
``` text
┌─getServerPort('tcp_port')─┐
│ 9000 │
└───────────────────────────┘
```
2021-08-29 16:00:29 +00:00
2022-06-02 10:55:18 +00:00
## queryID
2021-08-21 10:47:06 +00:00
2021-08-24 16:51:20 +00:00
Returns the ID of the current query. Other parameters of a query can be extracted from the [system.query_log ](../../operations/system-tables/query_log.md ) table via `query_id` .
2021-08-21 10:47:06 +00:00
2021-08-25 17:33:00 +00:00
In contrast to [initialQueryID ](#initial-query-id ) function, `queryID` can return different results on different shards (see example).
2021-08-21 20:26:27 +00:00
2021-08-21 10:47:06 +00:00
**Syntax**
``` sql
queryID()
```
**Returned value**
- The ID of the current query.
Type: [String ](../../sql-reference/data-types/string.md )
2021-08-21 20:26:27 +00:00
**Example**
Query:
``` sql
2021-08-25 17:15:57 +00:00
CREATE TABLE tmp (str String) ENGINE = Log;
INSERT INTO tmp (*) VALUES ('a');
2021-08-23 19:44:01 +00:00
SELECT count(DISTINCT t) FROM (SELECT queryID() AS t FROM remote('127.0.0.{1..3}', currentDatabase(), 'tmp') GROUP BY queryID());
```
Result:
``` text
┌─count()─┐
│ 3 │
└─────────┘
2021-08-21 20:26:27 +00:00
```
2022-06-02 10:55:18 +00:00
## initialQueryID
2021-08-21 10:47:06 +00:00
2021-08-24 16:51:29 +00:00
Returns the ID of the initial current query. Other parameters of a query can be extracted from the [system.query_log ](../../operations/system-tables/query_log.md ) table via `initial_query_id` .
2021-08-21 10:47:06 +00:00
2021-08-24 16:51:56 +00:00
In contrast to [queryID ](#query-id ) function, `initialQueryID` returns the same results on different shards (see example).
2021-08-21 20:26:27 +00:00
2021-08-21 10:47:06 +00:00
**Syntax**
``` sql
initialQueryID()
```
**Returned value**
- The ID of the initial current query.
Type: [String ](../../sql-reference/data-types/string.md )
2021-08-21 20:26:27 +00:00
**Example**
Query:
``` sql
2021-08-25 17:16:16 +00:00
CREATE TABLE tmp (str String) ENGINE = Log;
INSERT INTO tmp (*) VALUES ('a');
2021-08-23 19:44:01 +00:00
SELECT count(DISTINCT t) FROM (SELECT initialQueryID() AS t FROM remote('127.0.0.{1..3}', currentDatabase(), 'tmp') GROUP BY queryID());
```
Result:
``` text
┌─count()─┐
│ 1 │
└─────────┘
2021-08-21 20:26:27 +00:00
```
2021-09-20 05:37:18 +00:00
2022-06-02 10:55:18 +00:00
## shardNum
2021-09-20 05:37:18 +00:00
2021-10-06 19:50:05 +00:00
Returns the index of a shard which processes a part of data for a distributed query. Indices are started from `1` .
If a query is not distributed then constant value `0` is returned.
2021-09-20 05:37:18 +00:00
**Syntax**
``` sql
shardNum()
```
**Returned value**
2021-10-06 19:50:05 +00:00
- Shard index or constant `0` .
2021-09-20 05:37:18 +00:00
Type: [UInt32 ](../../sql-reference/data-types/int-uint.md ).
2021-10-06 19:50:05 +00:00
**Example**
In the following example a configuration with two shards is used. The query is executed on the [system.one ](../../operations/system-tables/one.md ) table on every shard.
Query:
``` sql
CREATE TABLE shard_num_example (dummy UInt8)
ENGINE=Distributed(test_cluster_two_shards_localhost, system, one, dummy);
SELECT dummy, shardNum(), shardCount() FROM shard_num_example;
```
Result:
``` text
┌─dummy─┬─shardNum()─┬─shardCount()─┐
│ 0 │ 2 │ 2 │
│ 0 │ 1 │ 2 │
└───────┴────────────┴──────────────┘
```
**See Also**
- [Distributed Table Engine ](../../engines/table-engines/special/distributed.md )
2022-06-02 10:55:18 +00:00
## shardCount
2021-09-20 05:37:18 +00:00
2021-10-06 19:50:05 +00:00
Returns the total number of shards for a distributed query.
If a query is not distributed then constant value `0` is returned.
2021-09-20 05:37:18 +00:00
**Syntax**
``` sql
shardCount()
```
**Returned value**
2021-10-06 19:50:05 +00:00
- Total number of shards or `0` .
2021-09-20 05:37:18 +00:00
Type: [UInt32 ](../../sql-reference/data-types/int-uint.md ).
2021-10-06 19:50:05 +00:00
**See Also**
2021-09-20 05:37:18 +00:00
2021-10-06 19:50:05 +00:00
- [shardNum() ](#shard-num ) function example also contains `shardCount()` function call.
2021-10-20 17:20:14 +00:00
2022-06-02 10:55:18 +00:00
## getOSKernelVersion
2021-10-20 17:20:14 +00:00
2021-10-21 18:33:47 +00:00
Returns a string with the current OS kernel version.
2021-10-20 17:20:14 +00:00
**Syntax**
``` sql
getOSKernelVersion()
```
**Arguments**
- None.
**Returned value**
- The current OS kernel version.
Type: [String ](../../sql-reference/data-types/string.md ).
**Example**
Query:
``` sql
SELECT getOSKernelVersion();
```
Result:
``` text
┌─getOSKernelVersion()────┐
│ Linux 4.15.0-55-generic │
└─────────────────────────┘
```
2021-11-07 21:42:57 +00:00
2022-06-02 10:55:18 +00:00
## zookeeperSessionUptime
2021-11-07 21:42:57 +00:00
2021-11-09 10:02:06 +00:00
Returns the uptime of the current ZooKeeper session in seconds.
2021-11-07 21:42:57 +00:00
**Syntax**
``` sql
zookeeperSessionUptime()
```
**Arguments**
- None.
**Returned value**
2021-11-09 10:08:47 +00:00
- Uptime of the current ZooKeeper session in seconds.
2021-11-07 21:42:57 +00:00
Type: [UInt32 ](../../sql-reference/data-types/int-uint.md ).
**Example**
Query:
``` sql
SELECT zookeeperSessionUptime();
```
Result:
``` text
┌─zookeeperSessionUptime()─┐
│ 286 │
└──────────────────────────┘
```
2023-03-09 17:47:14 +00:00
## generateRandomStructure
Generates random table structure in a format `column1_name column1_type, column2_name column2_type, ...` .
**Syntax**
``` sql
generateRandomStructure([number_of_columns, seed])
```
**Arguments**
- `number_of_columns` — The desired number of columns in the result table structure. If set to 0, the number of columns will be random from 1 to 128. Default value - 0.
- `seed` - Random seed to produce stable results. If seed is not specified, it is randomly generated.
All arguments must be constant.
**Returned value**
- Randomly generated table structure.
Type: [String ](../../sql-reference/data-types/string.md ).
**Examples**
Query:
``` sql
SELECT generateRandomStructure()
```
Result:
``` text
┌─generateRandomStructure()─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────┐
│ c1 Decimal32(5), c2 Date, c3 Tuple(LowCardinality(String), Int128, UInt64, UInt16, UInt8, IPv6), c4 Array(UInt128), c5 UInt32, c6 IPv4, c7 Decimal256(64), c8 Decimal128(3), c9 UInt256, c10 UInt64, c11 DateTime │
└───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────┘
```
Query:
``` sql
SELECT generateRandomStructure(1)
```
Result:
``` text
┌─generateRandomStructure(1)─┐
│ c1 Map(UInt256, UInt16) │
└────────────────────────────┘
```
Query:
``` sql
SELECT generateRandomStructure(0, 11)
```
Result:
``` text
┌─generateRandomStructure(0, 11)──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────┐
│ c1 Date32, c2 String, c3 IPv6, c4 DateTime, c5 UInt16, c6 Tuple(e1 UInt32, e2 Date, e3 Date, e4 IPv6, e5 Nested(e1 DateTime, e2 FixedString(110), e3 Int256, e4 Array(Decimal64(4)), e5 Decimal128(18), e6 Enum16('v0' = 0, 'v1' = 1, 'v2' = 2, 'v3' = 3, 'v4' = 4)), e6 DateTime64(4)), c7 DateTime, c8 DateTime64(6), c9 Bool │
└─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────┘
```
This function can be used together with [generateRandom ](../../sql-reference/table-functions/generate.md ) to generate completely random tables.