ClickHouse/docs/en/sql-reference/functions/string-search-functions.md

---
slug: /en/sql-reference/functions/string-search-functions
sidebar_position: 160
sidebar_label: Searching in Strings
---

# Functions for Searching in Strings

All functions in this section search case-sensitively by default. Case-insensitive search is usually provided by separate function variants.

:::note
Case-insensitive search follows the lowercase-uppercase rules of the English language. E.g. Uppercased `i` in the English language is
`I` whereas in the Turkish language it is `İ` - results for languages other than English may be unexpected.
:::

Functions in this section also assume that the searched string (referred to in this section as `haystack`) and the search string (referred to in this section as `needle`) are single-byte encoded text. If this assumption is
violated, no exception is thrown and results are undefined. Search with UTF-8 encoded strings is usually provided by separate function
variants. Likewise, if a UTF-8 function variant is used and the input strings are not UTF-8 encoded text, no exception is thrown and the
results are undefined. Note that no automatic Unicode normalization is performed, however you can use the
[normalizeUTF8*()](https://clickhouse.com../functions/string-functions/) functions for that.

[General strings functions](string-functions.md) and [functions for replacing in strings](string-replace-functions.md) are described separately.

## position

Returns the position (in bytes, starting at 1) of a substring `needle` in a string `haystack`.

**Syntax**

``` sql
position(haystack, needle[, start_pos])
```

Alias:
- `position(needle IN haystack)`

**Arguments**

- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Substring to be searched. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `start_pos` – Position (1-based) in `haystack` at which the search starts. [UInt](../data-types/int-uint.md). Optional.

**Returned value**

- Starting position in bytes and counting from 1, if the substring was found. [UInt64](../data-types/int-uint.md).
- 0, if the substring was not found. [UInt64](../data-types/int-uint.md).

If substring `needle` is empty, these rules apply:
- if no `start_pos` was specified: return `1`
- if `start_pos = 0`: return `1`
- if `start_pos >= 1` and `start_pos <= length(haystack) + 1`: return `start_pos`
- otherwise: return `0`

The same rules also apply to functions `locate`, `positionCaseInsensitive`, `positionUTF8` and `positionCaseInsensitiveUTF8`.

**Examples**

Query:

``` sql
SELECT position('Hello, world!', '!');
```

Result:

``` text
┌─position('Hello, world!', '!')─┐
│                             13 │
└────────────────────────────────┘
```

Example with `start_pos` argument:

Query:

``` sql
SELECT
    position('Hello, world!', 'o', 1),
    position('Hello, world!', 'o', 7)
```

Result:

``` text
┌─position('Hello, world!', 'o', 1)─┬─position('Hello, world!', 'o', 7)─┐
│                                 5 │                                 9 │
└───────────────────────────────────┴───────────────────────────────────┘
```

Example for `needle IN haystack` syntax:

Query:

```sql
SELECT 6 = position('/' IN s) FROM (SELECT 'Hello/World' AS s);
```

Result:

```text
┌─equals(6, position(s, '/'))─┐
│                           1 │
└─────────────────────────────┘
```

Examples with empty `needle` substring:

Query:

``` sql
SELECT
    position('abc', ''),
    position('abc', '', 0),
    position('abc', '', 1),
    position('abc', '', 2),
    position('abc', '', 3),
    position('abc', '', 4),
    position('abc', '', 5)
```

Result:

``` text
┌─position('abc', '')─┬─position('abc', '', 0)─┬─position('abc', '', 1)─┬─position('abc', '', 2)─┬─position('abc', '', 3)─┬─position('abc', '', 4)─┬─position('abc', '', 5)─┐
│                   1 │                      1 │                      1 │                      2 │                      3 │                      4 │                      0 │
└─────────────────────┴────────────────────────┴────────────────────────┴────────────────────────┴────────────────────────┴────────────────────────┴────────────────────────┘
```

## locate

Like [position](#position) but with arguments `haystack` and `locate` switched.

The behavior of this function depends on the ClickHouse version:
- in versions < v24.3, `locate` was an alias of function `position` and accepted arguments `(haystack, needle[, start_pos])`.
- in versions >= 24.3,, `locate` is an individual function (for better compatibility with MySQL) and accepts arguments `(needle, haystack[, start_pos])`. The previous behavior
  can be restored using setting [function_locate_has_mysql_compatible_argument_order = false](../../operations/settings/settings.md#function-locate-has-mysql-compatible-argument-order);

**Syntax**

``` sql
locate(needle, haystack[, start_pos])
```

## positionCaseInsensitive

A case insensitive invariant of [position](#position).

**Example**

Query:

``` sql
SELECT positionCaseInsensitive('Hello, world!', 'hello');
```

Result:

``` text
┌─positionCaseInsensitive('Hello, world!', 'hello')─┐
│                                                 1 │
└───────────────────────────────────────────────────┘
```

## positionUTF8

Like [position](#position) but assumes `haystack` and `needle` are UTF-8 encoded strings.

**Examples**

Function `positionUTF8` correctly counts character `ö` (represented by two points) as a single Unicode codepoint:

Query:

``` sql
SELECT positionUTF8('Motörhead', 'r');
```

Result:

``` text
┌─position('Motörhead', 'r')─┐
│                          5 │
└────────────────────────────┘
```

## positionCaseInsensitiveUTF8

Like [positionUTF8](#positionutf8) but searches case-insensitively.

## multiSearchAllPositions

Like [position](#position) but returns an array of positions (in bytes, starting at 1) for multiple `needle` substrings in a `haystack` string.

:::note
All `multiSearch*()` functions only support up to 2<sup>8</sup> needles.
:::

**Syntax**

``` sql
multiSearchAllPositions(haystack, [needle1, needle2, ..., needleN])
```

**Arguments**

- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Substrings to be searched. [Array](../data-types/array.md).

**Returned value**

- Array of the starting position in bytes and counting from 1, if the substring was found.
- 0, if the substring was not found.

**Example**

Query:

``` sql
SELECT multiSearchAllPositions('Hello, World!', ['hello', '!', 'world']);
```

Result:

``` text
┌─multiSearchAllPositions('Hello, World!', ['hello', '!', 'world'])─┐
│ [0,13,0]                                                          │
└───────────────────────────────────────────────────────────────────┘
```
## multiSearchAllPositionsCaseInsensitive

Like [multiSearchAllPositions](#multisearchallpositions) but ignores case.

**Syntax**

```sql
multiSearchAllPositionsCaseInsensitive(haystack, [needle1, needle2, ..., needleN])
```

**Parameters**

- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Substrings to be searched. [Array](../data-types/array.md).

**Returned value**

- Array of the starting position in bytes and counting from 1 (if the substring was found).
- 0 if the substring was not found.

**Example**

Query:

```sql
SELECT multiSearchAllPositionsCaseInsensitive('ClickHouse',['c','h']);
```

Result:

```response
["1","6"]
```

## multiSearchAllPositionsUTF8

Like [multiSearchAllPositions](#multisearchallpositions) but assumes `haystack` and the `needle` substrings are UTF-8 encoded strings.

**Syntax**

```sql
multiSearchAllPositionsUTF8(haystack, [needle1, needle2, ..., needleN])
```

**Parameters**

- `haystack` — UTF-8 encoded string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — UTF-8 encoded substrings to be searched. [Array](../data-types/array.md).

**Returned value**

- Array of the starting position in bytes and counting from 1 (if the substring was found).
- 0 if the substring was not found.

**Example**

Given `ClickHouse` as a UTF-8 string, find the positions of `C` (`\x43`) and `H` (`\x48`).

Query:

```sql
SELECT multiSearchAllPositionsUTF8('\x43\x6c\x69\x63\x6b\x48\x6f\x75\x73\x65',['\x43','\x48']);
```

Result:

```response
["1","6"]
```

## multiSearchAllPositionsCaseInsensitiveUTF8

Like [multiSearchAllPositionsUTF8](#multisearchallpositionsutf8) but ignores case.

**Syntax**

```sql
multiSearchAllPositionsCaseInsensitiveUTF8(haystack, [needle1, needle2, ..., needleN])
```

**Parameters**

- `haystack` — UTF-8 encoded string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — UTF-8 encoded substrings to be searched. [Array](../data-types/array.md).

**Returned value**

- Array of the starting position in bytes and counting from 1 (if the substring was found).
- 0 if the substring was not found.

**Example**

Given `ClickHouse` as a UTF-8 string, find the positions of `c` (`\x63`) and `h` (`\x68`).

Query:

```sql
SELECT multiSearchAllPositionsCaseInsensitiveUTF8('\x43\x6c\x69\x63\x6b\x48\x6f\x75\x73\x65',['\x63','\x68']);
```

Result:

```response
["1","6"]
```

## multiSearchFirstPosition

Like [`position`](#position) but returns the leftmost offset in a `haystack` string which matches any of multiple `needle` strings.

Functions [`multiSearchFirstPositionCaseInsensitive`](#multisearchfirstpositioncaseinsensitive), [`multiSearchFirstPositionUTF8`](#multisearchfirstpositionutf8) and [`multiSearchFirstPositionCaseInsensitiveUTF8`](#multisearchfirstpositioncaseinsensitiveutf8) provide case-insensitive and/or UTF-8 variants of this function.

**Syntax**

```sql
multiSearchFirstPosition(haystack, [needle1, needle2, ..., needleN])
```

**Parameters**

- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` —  Substrings to be searched. [Array](../data-types/array.md).

**Returned value**

- Leftmost offset in a `haystack` string which matches any of multiple `needle` strings.
- 0, if there was no match.

**Example**

Query:

```sql
SELECT multiSearchFirstPosition('Hello World',['llo', 'Wor', 'ld']);
```

Result:

```response
3
```

## multiSearchFirstPositionCaseInsensitive

Like [`multiSearchFirstPosition`](#multisearchfirstposition) but ignores case.

**Syntax**

```sql
multiSearchFirstPositionCaseInsensitive(haystack, [needle1, needle2, ..., needleN])
```

**Parameters**

- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Array of substrings to be searched. [Array](../data-types/array.md).

**Returned value**

- Leftmost offset in a `haystack` string which matches any of multiple `needle` strings.
- 0, if there was no match.

**Example**

Query:

```sql
SELECT multiSearchFirstPositionCaseInsensitive('HELLO WORLD',['wor', 'ld', 'ello']);
```

Result:

```response
2
```

## multiSearchFirstPositionUTF8

Like [`multiSearchFirstPosition`](#multisearchfirstposition) but assumes `haystack` and `needle` to be UTF-8 strings.

**Syntax**

```sql
multiSearchFirstPositionUTF8(haystack, [needle1, needle2, ..., needleN])
```

**Parameters**

- `haystack` — UTF-8 string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Array of UTF-8 substrings to be searched. [Array](../data-types/array.md).

**Returned value**

- Leftmost offset in a `haystack` string which matches any of multiple `needle` strings.
- 0, if there was no match.

**Example**

Find the leftmost offset in UTF-8 string `hello world` which matches any of the given needles.

Query:

```sql
SELECT multiSearchFirstPositionUTF8('\x68\x65\x6c\x6c\x6f\x20\x77\x6f\x72\x6c\x64',['wor', 'ld', 'ello']);
```

Result:

```response
2
```

## multiSearchFirstPositionCaseInsensitiveUTF8

Like [`multiSearchFirstPosition`](#multisearchfirstposition) but assumes `haystack` and `needle` to be UTF-8 strings and ignores case.

**Syntax**

```sql
multiSearchFirstPositionCaseInsensitiveUTF8(haystack, [needle1, needle2, ..., needleN])
```

**Parameters**

- `haystack` — UTF-8 string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Array of UTF-8 substrings to be searched. [Array](../data-types/array.md)

**Returned value**

- Leftmost offset in a `haystack` string which matches any of multiple `needle` strings, ignoring case.
- 0, if there was no match.

**Example**

Find the leftmost offset in UTF-8 string `HELLO WORLD` which matches any of the given needles.

Query:

```sql
SELECT multiSearchFirstPositionCaseInsensitiveUTF8('\x48\x45\x4c\x4c\x4f\x20\x57\x4f\x52\x4c\x44',['wor', 'ld', 'ello']);
```

Result:

```response
2
```

## multiSearchFirstIndex

Returns the index `i` (starting from 1) of the leftmost found needle<sub>i</sub> in the string `haystack` and 0 otherwise.

Functions [`multiSearchFirstIndexCaseInsensitive`](#multisearchfirstindexcaseinsensitive), [`multiSearchFirstIndexUTF8`](#multisearchfirstindexutf8) and [`multiSearchFirstIndexCaseInsensitiveUTF8`](#multisearchfirstindexcaseinsensitiveutf8) provide case-insensitive and/or UTF-8 variants of this function.

**Syntax**

```sql
multiSearchFirstIndex(haystack, [needle1, needle2, ..., needleN])
```
**Parameters**

- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Substrings to be searched. [Array](../data-types/array.md).

**Returned value**

- index (starting from 1) of the leftmost found needle. Otherwise 0, if there was no match. [UInt8](../data-types/int-uint.md).

**Example**

Query:

```sql
SELECT multiSearchFirstIndex('Hello World',['World','Hello']);
```

Result:

```response
1
```

## multiSearchFirstIndexCaseInsensitive

Returns the index `i` (starting from 1) of the leftmost found needle<sub>i</sub> in the string `haystack` and 0 otherwise. Ignores case.

**Syntax**

```sql
multiSearchFirstIndexCaseInsensitive(haystack, [needle1, needle2, ..., needleN])
```

**Parameters**

- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Substrings to be searched. [Array](../data-types/array.md).

**Returned value**

- index (starting from 1) of the leftmost found needle. Otherwise 0, if there was no match. [UInt8](../data-types/int-uint.md).

**Example**

Query:

```sql
SELECT multiSearchFirstIndexCaseInsensitive('hElLo WoRlD',['World','Hello']);
```

Result:

```response
1
```

## multiSearchFirstIndexUTF8

Returns the index `i` (starting from 1) of the leftmost found needle<sub>i</sub> in the string `haystack` and 0 otherwise. Assumes `haystack` and `needle` are UTF-8 encoded strings.

**Syntax**

```sql
multiSearchFirstIndexUTF8(haystack, [needle1, needle2, ..., needleN])
```

**Parameters**

- `haystack` — UTF-8 string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Array of UTF-8 substrings to be searched. [Array](../data-types/array.md)

**Returned value**

- index (starting from 1) of the leftmost found needle, Otherwise 0, if there was no match. [UInt8](../data-types/int-uint.md).

**Example**

Given `Hello World` as a UTF-8 string, find the first index of UTF-8 strings `Hello` and `World`.

Query:

```sql
SELECT multiSearchFirstIndexUTF8('\x48\x65\x6c\x6c\x6f\x20\x57\x6f\x72\x6c\x64',['\x57\x6f\x72\x6c\x64','\x48\x65\x6c\x6c\x6f']);
```

Result:

```response
1
```

## multiSearchFirstIndexCaseInsensitiveUTF8

Returns the index `i` (starting from 1) of the leftmost found needle<sub>i</sub> in the string `haystack` and 0 otherwise. Assumes `haystack` and `needle` are UTF-8 encoded strings. Ignores case.

**Syntax**

```sql
multiSearchFirstIndexCaseInsensitiveUTF8(haystack, [needle1, needle2, ..., needleN])
```

**Parameters**

- `haystack` — UTF-8 string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Array of UTF-8 substrings to be searched. [Array](../data-types/array.md).

**Returned value**

- index (starting from 1) of the leftmost found needle. Otherwise 0, if there was no match. [UInt8](../data-types/int-uint.md).

**Example**

Given `HELLO WORLD` as a UTF-8 string, find the first index of UTF-8 strings `hello` and `world`.

Query:

```sql
SELECT multiSearchFirstIndexCaseInsensitiveUTF8('\x48\x45\x4c\x4c\x4f\x20\x57\x4f\x52\x4c\x44',['\x68\x65\x6c\x6c\x6f','\x77\x6f\x72\x6c\x64']);
```

Result:

```response
1
```

## multiSearchAny

Returns 1, if at least one string needle<sub>i</sub> matches the string `haystack` and 0 otherwise.

Functions [`multiSearchAnyCaseInsensitive`](#multisearchanycaseinsensitive), [`multiSearchAnyUTF8`](#multisearchanyutf8) and [`multiSearchAnyCaseInsensitiveUTF8`](#multisearchanycaseinsensitiveutf8) provide case-insensitive and/or UTF-8 variants of this function.

**Syntax**

```sql
multiSearchAny(haystack, [needle1, needle2, ..., needleN])
```

**Parameters**

- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Substrings to be searched. [Array](../data-types/array.md).

**Returned value**

- 1, if there was at least one match.
- 0, if there was not at least one match.

**Example**

Query:

```sql
SELECT multiSearchAny('ClickHouse',['C','H']);
```

Result:

```response
1
```

## multiSearchAnyCaseInsensitive

Like [multiSearchAny](#multisearchany) but ignores case.

**Syntax**

```sql
multiSearchAnyCaseInsensitive(haystack, [needle1, needle2, ..., needleN])
```

**Parameters**

- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Substrings to be searched. [Array](../data-types/array.md)

**Returned value**

- 1, if there was at least one case-insensitive match.
- 0, if there was not at least one case-insensitive match.

**Example**

Query:

```sql
SELECT multiSearchAnyCaseInsensitive('ClickHouse',['c','h']);
```

Result:

```response
1
```

## multiSearchAnyUTF8

Like [multiSearchAny](#multisearchany) but assumes `haystack` and the `needle` substrings are UTF-8 encoded strings.

*Syntax**

```sql
multiSearchAnyUTF8(haystack, [needle1, needle2, ..., needleN])
```

**Parameters**

- `haystack` — UTF-8 string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — UTF-8 substrings to be searched. [Array](../data-types/array.md).

**Returned value**

- 1, if there was at least one match.
- 0, if there was not at least one match.

**Example**

Given `ClickHouse` as a UTF-8 string, check if there are any `C` ('\x43') or `H` ('\x48') letters in the word.

Query:

```sql
SELECT multiSearchAnyUTF8('\x43\x6c\x69\x63\x6b\x48\x6f\x75\x73\x65',['\x43','\x48']);
```

Result:

```response
1
```

## multiSearchAnyCaseInsensitiveUTF8

Like [multiSearchAnyUTF8](#multisearchanyutf8) but ignores case.

*Syntax**

```sql
multiSearchAnyCaseInsensitiveUTF8(haystack, [needle1, needle2, ..., needleN])
```

**Parameters**

- `haystack` — UTF-8 string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — UTF-8 substrings to be searched. [Array](../data-types/array.md)

**Returned value**

- 1, if there was at least one case-insensitive match.
- 0, if there was not at least one case-insensitive match.

**Example**

Given `ClickHouse` as a UTF-8 string, check if there is any letter `h`(`\x68`) in the word, ignoring case.

Query:

```sql
SELECT multiSearchAnyCaseInsensitiveUTF8('\x43\x6c\x69\x63\x6b\x48\x6f\x75\x73\x65',['\x68']);
```

Result:

```response
1
```

## match {#match}

Returns whether string `haystack` matches the regular expression `pattern` in [re2 regular syntax](https://github.com/google/re2/wiki/Syntax).

Matching is based on UTF-8, e.g. `.` matches the Unicode code point `¥` which is represented in UTF-8 using two bytes. The regular
expression must not contain null bytes. If the haystack or the pattern are not valid UTF-8, then the behavior is undefined.

Unlike re2's default behavior, `.` matches line breaks. To disable this, prepend the pattern with `(?-s)`.

If you only want to search substrings in a string, you can use functions [like](#like) or [position](#position) instead - they work much faster than this function.

**Syntax**

```sql
match(haystack, pattern)
```

Alias: `haystack REGEXP pattern operator`

## multiMatchAny

Like `match` but returns 1 if at least one of the patterns match and 0 otherwise.

:::note
Functions in the `multi[Fuzzy]Match*()` family use the the (Vectorscan)[https://github.com/VectorCamp/vectorscan] library. As such, they are only enabled if ClickHouse is compiled with support for vectorscan.

To turn off all functions that use hyperscan, use setting `SET allow_hyperscan = 0;`.

Due to restrictions of vectorscan, the length of the `haystack` string must be less than 2<sup>32</sup> bytes.

Hyperscan is generally vulnerable to regular expression denial of service (ReDoS) attacks (e.g. see
(here)[https://www.usenix.org/conference/usenixsecurity22/presentation/turonova], (here)[https://doi.org/10.1007/s10664-021-10033-1] and
(here)[https://doi.org/10.1145/3236024.3236027]. Users are adviced to check the provided patterns carefully.
:::

If you only want to search multiple substrings in a string, you can use function [multiSearchAny](#multisearchany) instead - it works much faster than this function.

**Syntax**

```sql
multiMatchAny(haystack, \[pattern<sub>1</sub>, pattern<sub>2</sub>, ..., pattern<sub>n</sub>\])
```

## multiMatchAnyIndex

Like `multiMatchAny` but returns any index that matches the haystack.

**Syntax**

```sql
multiMatchAnyIndex(haystack, \[pattern<sub>1</sub>, pattern<sub>2</sub>, ..., pattern<sub>n</sub>\])
```

## multiMatchAllIndices

Like `multiMatchAny` but returns the array of all indices that match the haystack in any order.

**Syntax**

```sql
multiMatchAllIndices(haystack, \[pattern<sub>1</sub>, pattern<sub>2</sub>, ..., pattern<sub>n</sub>\])
```

## multiFuzzyMatchAny

Like `multiMatchAny` but returns 1 if any pattern matches the haystack within a constant [edit distance](https://en.wikipedia.org/wiki/Edit_distance). This function relies on the experimental feature of [hyperscan](https://intel.github.io/hyperscan/dev-reference/compilation.html#approximate-matching) library, and can be slow for some corner cases. The performance depends on the edit distance value and patterns used, but it's always more expensive compared to a non-fuzzy variants.

:::note
`multiFuzzyMatch*()` function family do not support UTF-8 regular expressions (it threats them as a sequence of bytes) due to restrictions of hyperscan.
:::

**Syntax**

```sql
multiFuzzyMatchAny(haystack, distance, \[pattern<sub>1</sub>, pattern<sub>2</sub>, ..., pattern<sub>n</sub>\])
```

## multiFuzzyMatchAnyIndex

Like `multiFuzzyMatchAny` but returns any index that matches the haystack within a constant edit distance.

**Syntax**

```sql
multiFuzzyMatchAnyIndex(haystack, distance, \[pattern<sub>1</sub>, pattern<sub>2</sub>, ..., pattern<sub>n</sub>\])
```

## multiFuzzyMatchAllIndices

Like `multiFuzzyMatchAny` but returns the array of all indices in any order that match the haystack within a constant edit distance.

**Syntax**

```sql
multiFuzzyMatchAllIndices(haystack, distance, \[pattern<sub>1</sub>, pattern<sub>2</sub>, ..., pattern<sub>n</sub>\])
```

## extract

Extracts a fragment of a string using a regular expression. If `haystack` does not match the `pattern` regex, an empty string is returned.

For regex without subpatterns, the function uses the fragment that matches the entire regex. Otherwise, it uses the fragment that matches the first subpattern.

**Syntax**

```sql
extract(haystack, pattern)
```

## extractAll

Extracts all fragments of a string using a regular expression. If `haystack` does not match the `pattern` regex, an empty string is returned.

Returns an array of strings consisting of all matches of the regex.

The behavior with respect to subpatterns is the same as in function `extract`.

**Syntax**

```sql
extractAll(haystack, pattern)
```

## extractAllGroupsHorizontal

Matches all groups of the `haystack` string using the `pattern` regular expression. Returns an array of arrays, where the first array includes all fragments matching the first group, the second array - matching the second group, etc.

This function is slower than [extractAllGroupsVertical](#extractallgroupsvertical).

**Syntax**

``` sql
extractAllGroupsHorizontal(haystack, pattern)
```

**Arguments**

- `haystack` — Input string. [String](../data-types/string.md).
- `pattern` — Regular expression with [re2 syntax](https://github.com/google/re2/wiki/Syntax). Must contain groups, each group enclosed in parentheses. If `pattern` contains no groups, an exception is thrown. [String](../data-types/string.md).

**Returned value**

- Array of arrays of matches. [Array](../data-types/array.md).

:::note
If `haystack` does not match the `pattern` regex, an array of empty arrays is returned.
:::

**Example**

``` sql
SELECT extractAllGroupsHorizontal('abc=111, def=222, ghi=333', '("[^"]+"|\\w+)=("[^"]+"|\\w+)');
```

Result:

``` text
┌─extractAllGroupsHorizontal('abc=111, def=222, ghi=333', '("[^"]+"|\\w+)=("[^"]+"|\\w+)')─┐
│ [['abc','def','ghi'],['111','222','333']]                                                │
└──────────────────────────────────────────────────────────────────────────────────────────┘
```

## extractAllGroupsVertical

Matches all groups of the `haystack` string using the `pattern` regular expression. Returns an array of arrays, where each array includes matching fragments from every group. Fragments are grouped in order of appearance in the `haystack`.

**Syntax**

``` sql
extractAllGroupsVertical(haystack, pattern)
```

**Arguments**

- `haystack` — Input string. [String](../data-types/string.md).
- `pattern` — Regular expression with [re2 syntax](https://github.com/google/re2/wiki/Syntax). Must contain groups, each group enclosed in parentheses. If `pattern` contains no groups, an exception is thrown. [String](../data-types/string.md).

**Returned value**

- Array of arrays of matches. [Array](../data-types/array.md).

:::note
If `haystack` does not match the `pattern` regex, an empty array is returned.
:::

**Example**

``` sql
SELECT extractAllGroupsVertical('abc=111, def=222, ghi=333', '("[^"]+"|\\w+)=("[^"]+"|\\w+)');
```

Result:

``` text
┌─extractAllGroupsVertical('abc=111, def=222, ghi=333', '("[^"]+"|\\w+)=("[^"]+"|\\w+)')─┐
│ [['abc','111'],['def','222'],['ghi','333']]                                            │
└────────────────────────────────────────────────────────────────────────────────────────┘
```

## like

Returns whether string `haystack` matches the LIKE expression `pattern`.

A LIKE expression can contain normal characters and the following metasymbols:

- `%` indicates an arbitrary number of arbitrary characters (including zero characters).
- `_` indicates a single arbitrary character.
- `\` is for escaping literals `%`, `_` and `\`.

Matching is based on UTF-8, e.g. `_` matches the Unicode code point `¥` which is represented in UTF-8 using two bytes.

If the haystack or the LIKE expression are not valid UTF-8, the behavior is undefined.

No automatic Unicode normalization is performed, you can use the [normalizeUTF8*()](https://clickhouse.com../functions/string-functions/) functions for that.

To match against literal `%`, `_` and `\` (which are LIKE metacharacters), prepend them with a backslash: `\%`, `\_` and `\\`.
The backslash loses its special meaning (i.e. is interpreted literally) if it prepends a character different than `%`, `_` or `\`.
Note that ClickHouse requires backslashes in strings [to be quoted as well](../syntax.md#string), so you would actually need to write `\\%`, `\\_` and `\\\\`.

For LIKE expressions of the form `%needle%`, the function is as fast as the `position` function.
All other LIKE expressions are internally converted to a regular expression and executed with a performance similar to function `match`.

**Syntax**

```sql
like(haystack, pattern)
```

Alias: `haystack LIKE pattern` (operator)

## notLike {#notlike}

Like `like` but negates the result.

Alias: `haystack NOT LIKE pattern` (operator)

## ilike

Like `like` but searches case-insensitively.

Alias: `haystack ILIKE pattern` (operator)

## notILike

Like `ilike` but negates the result.

Alias: `haystack NOT ILIKE pattern` (operator)

## ngramDistance

Calculates the 4-gram distance between a `haystack` string and a `needle` string. For this, it counts the symmetric difference between two multisets of 4-grams and normalizes it by the sum of their cardinalities. Returns a [Float32](../data-types/float.md/#float32-float64) between 0 and 1. The smaller the result is, the more similar the strings are to each other.

Functions [`ngramDistanceCaseInsensitive`](#ngramdistancecaseinsensitive), [`ngramDistanceUTF8`](#ngramdistanceutf8), [`ngramDistanceCaseInsensitiveUTF8`](#ngramdistancecaseinsensitiveutf8) provide case-insensitive and/or UTF-8 variants of this function.

**Syntax**

```sql
ngramDistance(haystack, needle)
```

**Parameters**

- `haystack`: First comparison string. [String literal](../syntax#string)
- `needle`: Second comparison string. [String literal](../syntax#string)

**Returned value**

- Value between 0 and 1 representing the similarity between the two strings. [Float32](../data-types/float.md/#float32-float64)

**Implementation details**

This function will throw an exception if constant `needle` or `haystack` arguments are more than 32Kb in size. If any non-constant `haystack` or `needle` arguments are more than 32Kb in size, then the distance is always 1.

**Examples**

The more similar two strings are to each other, the closer the result will be to 0 (identical).

Query:

```sql
SELECT ngramDistance('ClickHouse','ClickHouse!');
```

Result:

```response
0.06666667
```

The less similar two strings are to each, the larger the result will be.


Query:

```sql
SELECT ngramDistance('ClickHouse','House');
```

Result:

```response
0.5555556
```

## ngramDistanceCaseInsensitive

Provides a case-insensitive variant of [ngramDistance](#ngramdistance).

**Syntax**

```sql
ngramDistanceCaseInsensitive(haystack, needle)
```

**Parameters**

- `haystack`: First comparison string. [String literal](../syntax#string)
- `needle`: Second comparison string. [String literal](../syntax#string)

**Returned value**

- Value between 0 and 1 representing the similarity between the two strings. [Float32](../data-types/float.md/#float32-float64)

**Examples**

With [ngramDistance](#ngramdistance) differences in case will affect the similarity value:

Query:

```sql
SELECT ngramDistance('ClickHouse','clickhouse');
```

Result:

```response
0.71428573
```

With [ngramDistanceCaseInsensitive](#ngramdistancecaseinsensitive) case is ignored so two identical strings differing only in case will now return a low similarity value:

Query:

```sql
SELECT ngramDistanceCaseInsensitive('ClickHouse','clickhouse');
```

Result:

```response
0
```

## ngramDistanceUTF8

Provides a UTF-8 variant of [ngramDistance](#ngramdistance). Assumes that `needle` and `haystack` strings are UTF-8 encoded strings.

**Syntax**

```sql
ngramDistanceUTF8(haystack, needle)
```

**Parameters**

- `haystack`: First UTF-8 encoded comparison string. [String literal](../syntax#string)
- `needle`: Second UTF-8 encoded comparison string. [String literal](../syntax#string)

**Returned value**

- Value between 0 and 1 representing the similarity between the two strings. [Float32](../data-types/float.md/#float32-float64)

**Example**

Query:

```sql
SELECT ngramDistanceUTF8('abcde','cde');
```

Result:

```response
0.5
```

## ngramDistanceCaseInsensitiveUTF8

Provides a case-insensitive variant of [ngramDistanceUTF8](#ngramdistanceutf8).

**Syntax**

```sql
ngramDistanceCaseInsensitiveUTF8(haystack, needle)
```

**Parameters**

- `haystack`: First UTF-8 encoded comparison string. [String literal](../syntax#string)
- `needle`: Second UTF-8 encoded comparison string. [String literal](../syntax#string)

**Returned value**

- Value between 0 and 1 representing the similarity between the two strings. [Float32](../data-types/float.md/#float32-float64)

**Example**

Query:

```sql
SELECT ngramDistanceCaseInsensitiveUTF8('abcde','CDE');
```

Result:

```response
0.5
```

## ngramSearch

Like `ngramDistance` but calculates the non-symmetric difference between a `needle` string and a `haystack` string, i.e. the number of n-grams from the needle minus the common number of n-grams normalized by the number of `needle` n-grams. Returns a [Float32](../data-types/float.md/#float32-float64) between 0 and 1. The bigger the result is, the more likely `needle` is in the `haystack`. This function is useful for fuzzy string search. Also see function [`soundex`](../../sql-reference/functions/string-functions#soundex).

Functions [`ngramSearchCaseInsensitive`](#ngramsearchcaseinsensitive), [`ngramSearchUTF8`](#ngramsearchutf8), [`ngramSearchCaseInsensitiveUTF8`](#ngramsearchcaseinsensitiveutf8) provide case-insensitive and/or UTF-8 variants of this function.

**Syntax**

```sql
ngramSearch(haystack, needle)
```

**Parameters**

- `haystack`: First comparison string. [String literal](../syntax#string)
- `needle`: Second comparison string. [String literal](../syntax#string)

**Returned value**

- Value between 0 and 1 representing the likelihood of the `needle` being in the `haystack`. [Float32](../data-types/float.md/#float32-float64)

**Implementation details**

:::note
The UTF-8 variants use the 3-gram distance. These are not perfectly fair n-gram distances. We use 2-byte hashes to hash n-grams and then calculate the (non-)symmetric difference between these hash tables – collisions may occur. With UTF-8 case-insensitive format we do not use fair `tolower` function – we zero the 5-th bit (starting from zero) of each codepoint byte and first bit of zeroth byte if bytes more than one – this works for Latin and mostly for all Cyrillic letters.
:::

**Example**

Query:

```sql
SELECT ngramSearch('Hello World','World Hello');
```

Result:

```response
0.5
```

## ngramSearchCaseInsensitive

Provides a case-insensitive variant of [ngramSearch](#ngramsearch).

**Syntax**

```sql
ngramSearchCaseInsensitive(haystack, needle)
```

**Parameters**

- `haystack`: First comparison string. [String literal](../syntax#string)
- `needle`: Second comparison string. [String literal](../syntax#string)

**Returned value**

- Value between 0 and 1 representing the likelihood of the `needle` being in the `haystack`. [Float32](../data-types/float.md/#float32-float64)

The bigger the result is, the more likely `needle` is in the `haystack`.

**Example**

Query:

```sql
SELECT ngramSearchCaseInsensitive('Hello World','hello');
```

Result:

```response
1
```

## ngramSearchUTF8

Provides a UTF-8 variant of [ngramSearch](#ngramsearch) in which `needle` and `haystack` are assumed to be UTF-8 encoded strings.

**Syntax**

```sql
ngramSearchUTF8(haystack, needle)
```

**Parameters**

- `haystack`: First UTF-8 encoded comparison string. [String literal](../syntax#string)
- `needle`: Second UTF-8 encoded comparison string. [String literal](../syntax#string)

**Returned value**

- Value between 0 and 1 representing the likelihood of the `needle` being in the `haystack`. [Float32](../data-types/float.md/#float32-float64)

The bigger the result is, the more likely `needle` is in the `haystack`.

**Example**

Query:

```sql
SELECT ngramSearchUTF8('абвгдеёжз', 'гдеёзд');
```

Result:

```response
0.5
```

## ngramSearchCaseInsensitiveUTF8

Provides a case-insensitive variant of [ngramSearchUTF8](#ngramsearchutf8) in which `needle` and `haystack`.

**Syntax**

```sql
ngramSearchCaseInsensitiveUTF8(haystack, needle)
```

**Parameters**

- `haystack`: First UTF-8 encoded comparison string. [String literal](../syntax#string)
- `needle`: Second UTF-8 encoded comparison string. [String literal](../syntax#string)

**Returned value**

- Value between 0 and 1 representing the likelihood of the `needle` being in the `haystack`. [Float32](../data-types/float.md/#float32-float64)

The bigger the result is, the more likely `needle` is in the `haystack`.

**Example**

Query:

```sql
SELECT ngramSearchCaseInsensitiveUTF8('абвГДЕёжз', 'АбвгдЕЁжз');
```

Result:

```response
0.57142854
```

## countSubstrings

Returns how often a substring `needle` occurs in a string `haystack`.

Functions [`countSubstringsCaseInsensitive`](#countsubstringscaseinsensitive) and [`countSubstringsCaseInsensitiveUTF8`](#countsubstringscaseinsensitiveutf8) provide case-insensitive and case-insensitive + UTF-8 variants of this function respectively.

**Syntax**

``` sql
countSubstrings(haystack, needle[, start_pos])
```

**Arguments**

- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Substring to be searched. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `start_pos` – Position (1-based) in `haystack` at which the search starts. [UInt](../data-types/int-uint.md). Optional.

**Returned value**

- The number of occurrences. [UInt64](../data-types/int-uint.md).

**Examples**

``` sql
SELECT countSubstrings('aaaa', 'aa');
```

Result:

``` text
┌─countSubstrings('aaaa', 'aa')─┐
│                             2 │
└───────────────────────────────┘
```

Example with `start_pos` argument:

```sql
SELECT countSubstrings('abc___abc', 'abc', 4);
```

Result:

``` text
┌─countSubstrings('abc___abc', 'abc', 4)─┐
│                                      1 │
└────────────────────────────────────────┘
```
## countSubstringsCaseInsensitive

Returns how often a substring `needle` occurs in a string `haystack`. Ignores case.

**Syntax**

``` sql
countSubstringsCaseInsensitive(haystack, needle[, start_pos])
```

**Arguments**

- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Substring to be searched. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `start_pos` – Position (1-based) in `haystack` at which the search starts. [UInt](../data-types/int-uint.md). Optional.

**Returned value**

- The number of occurrences. [UInt64](../data-types/int-uint.md).

**Examples**

Query:

``` sql
SELECT countSubstringsCaseInsensitive('AAAA', 'aa');
```

Result:

``` text
┌─countSubstringsCaseInsensitive('AAAA', 'aa')─┐
│                                            2 │
└──────────────────────────────────────────────┘
```

Example with `start_pos` argument:

Query:

```sql
SELECT countSubstringsCaseInsensitive('abc___ABC___abc', 'abc', 4);
```

Result:

``` text
┌─countSubstringsCaseInsensitive('abc___ABC___abc', 'abc', 4)─┐
│                                                           2 │
└─────────────────────────────────────────────────────────────┘
```

## countSubstringsCaseInsensitiveUTF8

Returns how often a substring `needle` occurs in a string `haystack`. Ignores case and assumes that `haystack` is a UTF8 string.

**Syntax**

``` sql
countSubstringsCaseInsensitiveUTF8(haystack, needle[, start_pos])
```

**Arguments**

- `haystack` — UTF-8 string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Substring to be searched. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `start_pos` – Position (1-based) in `haystack` at which the search starts. [UInt](../data-types/int-uint.md). Optional.

**Returned value**

- The number of occurrences. [UInt64](../data-types/int-uint.md).

**Examples**

Query:

``` sql
SELECT countSubstringsCaseInsensitiveUTF8('ложка, кошка, картошка', 'КА');
```

Result:

``` text
┌─countSubstringsCaseInsensitiveUTF8('ложка, кошка, картошка', 'КА')─┐
│                                                                  4 │
└────────────────────────────────────────────────────────────────────┘
```

Example with `start_pos` argument:

Query:

```sql
SELECT countSubstringsCaseInsensitiveUTF8('ложка, кошка, картошка', 'КА', 13);
```

Result:

``` text
┌─countSubstringsCaseInsensitiveUTF8('ложка, кошка, картошка', 'КА', 13)─┐
│                                                                      2 │
└────────────────────────────────────────────────────────────────────────┘
```

## countMatches

Returns the number of regular expression matches for a `pattern` in a `haystack`.

**Syntax**

``` sql
countMatches(haystack, pattern)
```

**Arguments**

- `haystack` — The string to search in. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `pattern` — The regular expression with [re2 syntax](https://github.com/google/re2/wiki/Syntax). [String](../data-types/string.md).

**Returned value**

- The number of matches. [UInt64](../data-types/int-uint.md).

**Examples**

``` sql
SELECT countMatches('foobar.com', 'o+');
```

Result:

``` text
┌─countMatches('foobar.com', 'o+')─┐
│                                2 │
└──────────────────────────────────┘
```

``` sql
SELECT countMatches('aaaa', 'aa');
```

Result:

``` text
┌─countMatches('aaaa', 'aa')────┐
│                             2 │
└───────────────────────────────┘
```

## countMatchesCaseInsensitive

Returns the number of regular expression matches for a pattern in a haystack like [`countMatches`](#countmatches) but matching ignores the case.

**Syntax**

``` sql
countMatchesCaseInsensitive(haystack, pattern)
```

**Arguments**

- `haystack` — The string to search in. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `pattern` — The regular expression with [re2 syntax](https://github.com/google/re2/wiki/Syntax). [String](../data-types/string.md).

**Returned value**

- The number of matches. [UInt64](../data-types/int-uint.md).

**Examples**

Query:

``` sql
SELECT countMatchesCaseInsensitive('AAAA', 'aa');
```

Result:

``` text
┌─countMatchesCaseInsensitive('AAAA', 'aa')────┐
│                                            2 │
└──────────────────────────────────────────────┘
```

## regexpExtract

Extracts the first string in `haystack` that matches the regexp pattern and corresponds to the regex group index.

**Syntax**

``` sql
regexpExtract(haystack, pattern[, index])
```

Alias: `REGEXP_EXTRACT(haystack, pattern[, index])`.

**Arguments**

- `haystack` — String, in which regexp pattern will to be matched. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `pattern` — String, regexp expression, must be constant. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `index` – An integer number greater or equal 0 with default 1. It represents which regex group to extract. [UInt or Int](../data-types/int-uint.md). Optional.

**Returned value**

`pattern` may contain multiple regexp groups, `index` indicates which regex group to extract. An index of 0 means matching the entire regular expression. [String](../data-types/string.md).

**Examples**

``` sql
SELECT
    regexpExtract('100-200', '(\\d+)-(\\d+)', 1),
    regexpExtract('100-200', '(\\d+)-(\\d+)', 2),
    regexpExtract('100-200', '(\\d+)-(\\d+)', 0),
    regexpExtract('100-200', '(\\d+)-(\\d+)');
```

Result:

``` text
┌─regexpExtract('100-200', '(\\d+)-(\\d+)', 1)─┬─regexpExtract('100-200', '(\\d+)-(\\d+)', 2)─┬─regexpExtract('100-200', '(\\d+)-(\\d+)', 0)─┬─regexpExtract('100-200', '(\\d+)-(\\d+)')─┐
│ 100                                          │ 200                                          │ 100-200                                      │ 100                                       │
└──────────────────────────────────────────────┴──────────────────────────────────────────────┴──────────────────────────────────────────────┴───────────────────────────────────────────┘
```

## hasSubsequence

Returns 1 if `needle` is a subsequence of `haystack`, or 0 otherwise.
A subsequence of a string is a sequence that can be derived from the given string by deleting zero or more elements without changing the order of the remaining elements.


**Syntax**

``` sql
hasSubsequence(haystack, needle)
```

**Arguments**

- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Subsequence to be searched. [String](../../sql-reference/syntax.md#syntax-string-literal).

**Returned value**

- 1, if needle is a subsequence of haystack, 0 otherwise. [UInt8](../data-types/int-uint.md).

**Examples**

Query:

``` sql
SELECT hasSubsequence('garbage', 'arg');
```

Result:

``` text
┌─hasSubsequence('garbage', 'arg')─┐
│                                1 │
└──────────────────────────────────┘
```

## hasSubsequenceCaseInsensitive

Like [hasSubsequence](#hassubsequence) but searches case-insensitively.

**Syntax**

``` sql
hasSubsequenceCaseInsensitive(haystack, needle)
```

**Arguments**

- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Subsequence to be searched. [String](../../sql-reference/syntax.md#syntax-string-literal).

**Returned value**

- 1, if needle is a subsequence of haystack, 0 otherwise [UInt8](../data-types/int-uint.md).

**Examples**

Query:

``` sql
SELECT hasSubsequenceCaseInsensitive('garbage', 'ARG');
```

Result:

``` text
┌─hasSubsequenceCaseInsensitive('garbage', 'ARG')─┐
│                                               1 │
└─────────────────────────────────────────────────┘
```

## hasSubsequenceUTF8

Like [hasSubsequence](#hassubsequence) but assumes `haystack` and `needle` are UTF-8 encoded strings.

**Syntax**

``` sql
hasSubsequenceUTF8(haystack, needle)
```

**Arguments**

- `haystack` — String in which the search is performed. UTF-8 encoded [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Subsequence to be searched. UTF-8 encoded [String](../../sql-reference/syntax.md#syntax-string-literal).

**Returned value**

- 1, if needle is a subsequence of haystack, 0, otherwise. [UInt8](../data-types/int-uint.md).

Query:

**Examples**

``` sql
select hasSubsequenceUTF8('ClickHouse - столбцовая система управления базами данных', 'система');
```

Result:

``` text
┌─hasSubsequenceUTF8('ClickHouse - столбцовая система управления базами данных', 'система')─┐
│                                                                                         1 │
└───────────────────────────────────────────────────────────────────────────────────────────┘
```

## hasSubsequenceCaseInsensitiveUTF8

Like [hasSubsequenceUTF8](#hassubsequenceutf8) but searches case-insensitively.

**Syntax**

``` sql
hasSubsequenceCaseInsensitiveUTF8(haystack, needle)
```

**Arguments**

- `haystack` — String in which the search is performed. UTF-8 encoded [String](../../sql-reference/syntax.md#syntax-string-literal).
- `needle` — Subsequence to be searched. UTF-8 encoded [String](../../sql-reference/syntax.md#syntax-string-literal).

**Returned value**

- 1, if needle is a subsequence of haystack, 0 otherwise. [UInt8](../data-types/int-uint.md).

**Examples**

Query:

``` sql
select hasSubsequenceCaseInsensitiveUTF8('ClickHouse - столбцовая система управления базами данных', 'СИСТЕМА');
```

Result:

``` text
┌─hasSubsequenceCaseInsensitiveUTF8('ClickHouse - столбцовая система управления базами данных', 'СИСТЕМА')─┐
│                                                                                                        1 │
└──────────────────────────────────────────────────────────────────────────────────────────────────────────┘
```

## hasToken

Returns 1 if a given token is present in a haystack, or 0 otherwise.

**Syntax**

```sql
hasToken(haystack, token)
```

**Parameters**

- `haystack`: String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `token`: Maximal length substring between two non alphanumeric ASCII characters (or boundaries of haystack).

**Returned value**

- 1, if the token is present in the haystack, 0 otherwise. [UInt8](../data-types/int-uint.md).

**Implementation details**

Token must be a constant string. Supported by tokenbf_v1 index specialization.

**Example**

Query:

```sql
SELECT hasToken('Hello World','Hello');
```

```response
1
```

## hasTokenOrNull

Returns 1 if a given token is present, 0 if not present, and null if the token is ill-formed.

**Syntax**

```sql
hasTokenOrNull(haystack, token)
```

**Parameters**

- `haystack`: String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `token`: Maximal length substring between two non alphanumeric ASCII characters (or boundaries of haystack).

**Returned value**

- 1, if the token is present in the haystack, 0 if it is not present, and null if the token is ill formed. 

**Implementation details**

Token must be a constant string. Supported by tokenbf_v1 index specialization.

**Example**

Where `hasToken` would throw an error for an ill-formed token, `hasTokenOrNull` returns `null` for an ill-formed token.

Query:

```sql
SELECT hasTokenOrNull('Hello World','Hello,World');
```

```response
null
```

## hasTokenCaseInsensitive

Returns 1 if a given token is present in a haystack, 0 otherwise. Ignores case.

**Syntax**

```sql
hasTokenCaseInsensitive(haystack, token)
```

**Parameters**

- `haystack`: String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `token`: Maximal length substring between two non alphanumeric ASCII characters (or boundaries of haystack).

**Returned value**

- 1, if the token is present in the haystack, 0 otherwise. [UInt8](../data-types/int-uint.md).

**Implementation details**

Token must be a constant string. Supported by tokenbf_v1 index specialization.

**Example**

Query:

```sql
SELECT hasTokenCaseInsensitive('Hello World','hello');
```

```response
1
```

## hasTokenCaseInsensitiveOrNull

Returns 1 if a given token is present in a haystack, 0 otherwise. Ignores case and returns null if the token is ill-formed.

**Syntax**

```sql
hasTokenCaseInsensitiveOrNull(haystack, token)
```

**Parameters**

- `haystack`: String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
- `token`: Maximal length substring between two non alphanumeric ASCII characters (or boundaries of haystack).

**Returned value**

- 1, if the token is present in the haystack, 0 if the token is not present, otherwise [`null`](../data-types/nullable.md) if the token is ill-formed. [UInt8](../data-types/int-uint.md).

**Implementation details**

Token must be a constant string. Supported by tokenbf_v1 index specialization.

**Example**


Where `hasTokenCaseInsensitive` would throw an error for an ill-formed token, `hasTokenCaseInsensitiveOrNull` returns `null` for an ill-formed token.

Query:

```sql
SELECT hasTokenCaseInsensitiveOrNull('Hello World','hello,world');
```

```response
null
```
-												Get rid of toc_en.yml (#10023)


											
										
										
											2020-04-03 13:23:32 +00:00
+								---
-												add slugs

											
										
										
											2022-08-28 14:53:34 +00:00
+								slug: /en/sql-reference/functions/string-search-functions
-												Docs: Sort functions in sidebar

											
										
										
											2023-04-19 17:05:55 +00:00
+								sidebar_position: 160
-												Docs: Sidebar: Remove leading "For" from "Searching/Replacing in Strings"

											
										
										
											2023-02-27 08:13:09 +00:00
+								sidebar_label: Searching in Strings
-												Get rid of toc_en.yml (#10023)


											
										
										
											2020-04-03 13:23:32 +00:00
+								---
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								# Functions for Searching in Strings
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Small grammar edits to description at top of the page

											
										
										
											2024-03-28 20:54:26 +00:00
+								All functions in this section search case-sensitively by default. Case-insensitive search is usually provided by separate function variants.
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Small grammar edits to description at top of the page

											
										
										
											2024-03-28 20:54:26 +00:00
+								:::note
 								Case-insensitive search follows the lowercase-uppercase rules of the English language. E.g. Uppercased `i` in the English language is
 								`I` whereas in the Turkish language it is `İ` - results for languages other than English may be unexpected.
 								:::
-												Fix spelling mistake

											
										
										
											2024-04-08 19:55:27 +00:00
+								Functions in this section also assume that the searched string (referred to in this section as `haystack`) and the search string (referred to in this section as `needle`) are single-byte encoded text. If this assumption is
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								violated, no exception is thrown and results are undefined. Search with UTF-8 encoded strings is usually provided by separate function
 								variants. Likewise, if a UTF-8 function variant is used and the input strings are not UTF-8 encoded text, no exception is thrown and the
-												Small grammar edits to description at top of the page

											
										
										
											2024-03-28 20:54:26 +00:00
+								results are undefined. Note that no automatic Unicode normalization is performed, however you can use the
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								[normalizeUTF8*()](https://clickhouse.com../functions/string-functions/) functions for that.
-												Update string-search-functions.md
											
										
										
											2020-06-19 10:08:10 +00:00
-												Cleanup string replace functions

											
										
										
											2023-04-20 10:08:49 +00:00
+								[General strings functions](string-functions.md) and [functions for replacing in strings](string-replace-functions.md) are described separately.
-												new examples

											
										
										
											2021-03-22 16:30:28 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## position
-												draft

											
										
										
											2021-02-22 09:49:49 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Returns the position (in bytes, starting at 1) of a substring `needle` in a string `haystack`.
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
+								**Syntax**
-												Changes in accordance with comments from the developers.

											
										
										
											2018-04-28 11:45:37 +00:00
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								``` sql
-												developer`s comments done

											
										
										
											2021-03-30 06:15:52 +00:00
+								position(haystack, needle[, start_pos])
-												Remove trailing whitespaces from docs

											
										
										
											2021-07-29 15:20:55 +00:00
+								```
-												developer`s comments done

											
										
										
											2021-03-30 06:15:52 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Alias:
 								- `position(needle IN haystack)`
-												++

											
										
										
											2021-03-13 18:25:06 +00:00
-												Global replacement `Parameters` to `Arguments`

											
										
										
											2021-02-15 21:22:10 +00:00
+								**Arguments**
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Docs: Replace annoying three spaces in enumerations by a single space

											
										
										
											2023-04-19 15:55:29 +00:00
+								- `needle` — Substring to be searched. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `start_pos` – Position (1-based) in `haystack` at which the search starts. [UInt](../data-types/int-uint.md). Optional.
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								**Returned value**
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- Starting position in bytes and counting from 1, if the substring was found. [UInt64](../data-types/int-uint.md).
 								- 0, if the substring was not found. [UInt64](../data-types/int-uint.md).
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								If substring `needle` is empty, these rules apply:
 								- if no `start_pos` was specified: return `1`
 								- if `start_pos = 0`: return `1`
 								- if `start_pos >= 1` and `start_pos <= length(haystack) + 1`: return `start_pos`
 								- otherwise: return `0`
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Fixes #61051

											
										
										
											2024-03-08 11:27:09 +00:00
+								The same rules also apply to functions `locate`, `positionCaseInsensitive`, `positionUTF8` and `positionCaseInsensitiveUTF8`.
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								**Examples**
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Small grammar edits to description at top of the page

											
										
										
											2024-03-28 20:54:26 +00:00
+								Query:
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								``` sql
-												Edit and translate to Russian

Поправил шаблоны в английской и русской версиях.

											
										
										
											2021-03-13 18:18:45 +00:00
+								SELECT position('Hello, world!', '!');
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
+								```
 								Result:
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								``` text
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
+								┌─position('Hello, world!', '!')─┐
 								│                             13 │
 								└────────────────────────────────┘
 								```
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Example with `start_pos` argument:
-												Small grammar edits to description at top of the page

											
										
										
											2024-03-28 20:54:26 +00:00
+								Query:
-												Add start_pos argument for position to documentation, case insensitive tests

											
										
										
											2020-08-02 13:29:10 +00:00
+								``` sql
 								SELECT
 								    position('Hello, world!', 'o', 1),
 								    position('Hello, world!', 'o', 7)
 								```
-												refine

											
										
										
											2024-04-01 15:06:54 +00:00
+								Result:
-												Add start_pos argument for position to documentation, case insensitive tests

											
										
										
											2020-08-02 13:29:10 +00:00
+								``` text
 								┌─position('Hello, world!', 'o', 1)─┬─position('Hello, world!', 'o', 7)─┐
 								│                                 5 │                                 9 │
 								└───────────────────────────────────┴───────────────────────────────────┘
 								```
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Example for `needle IN haystack` syntax:
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Small grammar edits to description at top of the page

											
										
										
											2024-03-28 20:54:26 +00:00
+								Query:
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```sql
 								SELECT 6 = position('/' IN s) FROM (SELECT 'Hello/World' AS s);
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
+								```
 								Result:
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```text
 								┌─equals(6, position(s, '/'))─┐
 								│                           1 │
 								└─────────────────────────────┘
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
+								```
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Examples with empty `needle` substring:
-												change as request

											
										
										
											2023-01-30 08:13:12 +00:00
-												Small grammar edits to description at top of the page

											
										
										
											2024-03-28 20:54:26 +00:00
+								Query:
-												change as request

											
										
										
											2023-01-30 08:13:12 +00:00
+								``` sql
 								SELECT
 								    position('abc', ''),
 								    position('abc', '', 0),
 								    position('abc', '', 1),
 								    position('abc', '', 2),
 								    position('abc', '', 3),
 								    position('abc', '', 4),
 								    position('abc', '', 5)
 								```
-												Small grammar edits to description at top of the page

											
										
										
											2024-03-28 20:54:26 +00:00
+								Result:
-												change as request

											
										
										
											2023-01-30 08:13:12 +00:00
+								``` text
 								┌─position('abc', '')─┬─position('abc', '', 0)─┬─position('abc', '', 1)─┬─position('abc', '', 2)─┬─position('abc', '', 3)─┬─position('abc', '', 4)─┬─position('abc', '', 5)─┐
 								│                   1 │                      1 │                      1 │                      2 │                      3 │                      4 │                      0 │
 								└─────────────────────┴────────────────────────┴────────────────────────┴────────────────────────┴────────────────────────┴────────────────────────┴────────────────────────┘
 								```
-												Fixes #61051

											
										
										
											2024-03-08 11:27:09 +00:00
+								## locate
 								Like [position](#position) but with arguments `haystack` and `locate` switched.
 								The behavior of this function depends on the ClickHouse version:
 								- in versions < v24.3, `locate` was an alias of function `position` and accepted arguments `(haystack, needle[, start_pos])`.
 								- in versions >= 24.3,, `locate` is an individual function (for better compatibility with MySQL) and accepts arguments `(needle, haystack[, start_pos])`. The previous behavior
 								  can be restored using setting [function_locate_has_mysql_compatible_argument_order = false](../../operations/settings/settings.md#function-locate-has-mysql-compatible-argument-order);
 								**Syntax**
 								``` sql
 								locate(needle, haystack[, start_pos])
 								```
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								## positionCaseInsensitive
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Small grammar edits to description at top of the page

											
										
										
											2024-03-28 20:54:26 +00:00
+								A case insensitive invariant of [position](#position).
 								**Example**
 								Query:
 								``` sql
-												Fix positionCaseInsensitive example

											
										
										
											2024-07-29 08:31:35 +00:00
+								SELECT positionCaseInsensitive('Hello, world!', 'hello');
-												Small grammar edits to description at top of the page

											
										
										
											2024-03-28 20:54:26 +00:00
+								```
 								Result:
 								``` text
-												Fix positionCaseInsensitive example

											
										
										
											2024-07-29 08:31:35 +00:00
+								┌─positionCaseInsensitive('Hello, world!', 'hello')─┐
 								│                                                 1 │
 								└───────────────────────────────────────────────────┘
-												Small grammar edits to description at top of the page

											
										
										
											2024-03-28 20:54:26 +00:00
+								```
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## positionUTF8
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Like [position](#position) but assumes `haystack` and `needle` are UTF-8 encoded strings.
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								**Examples**
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Function `positionUTF8` correctly counts character `ö` (represented by two points) as a single Unicode codepoint:
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Query:
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								``` sql
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								SELECT positionUTF8('Motörhead', 'r');
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
+								```
 								Result:
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								``` text
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								┌─position('Motörhead', 'r')─┐
 								│                          5 │
 								└────────────────────────────┘
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
+								```
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## positionCaseInsensitiveUTF8
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Like [positionUTF8](#positionutf8) but searches case-insensitively.
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## multiSearchAllPositions
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Like [position](#position) but returns an array of positions (in bytes, starting at 1) for multiple `needle` substrings in a `haystack` string.
 								:::note
 								All `multiSearch*()` functions only support up to 2<sup>8</sup> needles.
 								:::
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
 								**Syntax**
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								``` sql
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								multiSearchAllPositions(haystack, [needle1, needle2, ..., needleN])
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
+								```
-												Global replacement `Parameters` to `Arguments`

											
										
										
											2021-02-15 21:22:10 +00:00
+								**Arguments**
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `needle` — Substrings to be searched. [Array](../data-types/array.md).
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								**Returned value**
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								- Array of the starting position in bytes and counting from 1, if the substring was found.
-												Minor formatting changes to multipleSearchAllPositions

											
										
										
											2024-03-28 20:06:17 +00:00
+								- 0, if the substring was not found.
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								**Example**
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Query:
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								``` sql
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								SELECT multiSearchAllPositions('Hello, World!', ['hello', '!', 'world']);
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
+								```
 								Result:
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								``` text
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								┌─multiSearchAllPositions('Hello, World!', ['hello', '!', 'world'])─┐
 								│ [0,13,0]                                                          │
 								└───────────────────────────────────────────────────────────────────┘
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
+								```
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
+								## multiSearchAllPositionsCaseInsensitive
 								Like [multiSearchAllPositions](#multisearchallpositions) but ignores case.
 								**Syntax**
 								```sql
 								multiSearchAllPositionsCaseInsensitive(haystack, [needle1, needle2, ..., needleN])
 								```
 								**Parameters**
 								- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `needle` — Substrings to be searched. [Array](../data-types/array.md).
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
 								**Returned value**
 								- Array of the starting position in bytes and counting from 1 (if the substring was found).
 								- 0 if the substring was not found.
 								**Example**
 								Query:
 								```sql
 								SELECT multiSearchAllPositionsCaseInsensitive('ClickHouse',['c','h']);
 								```
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Result:
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
+								```response
 								["1","6"]
 								```
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## multiSearchAllPositionsUTF8
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Fix broken links in docs

											
										
										
											2024-06-12 12:09:37 +00:00
+								Like [multiSearchAllPositions](#multisearchallpositions) but assumes `haystack` and the `needle` substrings are UTF-8 encoded strings.
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
+								**Syntax**
 								```sql
 								multiSearchAllPositionsUTF8(haystack, [needle1, needle2, ..., needleN])
 								```
 								**Parameters**
 								- `haystack` — UTF-8 encoded string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `needle` — UTF-8 encoded substrings to be searched. [Array](../data-types/array.md).
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
 								**Returned value**
 								- Array of the starting position in bytes and counting from 1 (if the substring was found).
 								- 0 if the substring was not found.
 								**Example**
-												Finor fixes

											
										
										
											2024-03-28 20:20:33 +00:00
+								Given `ClickHouse` as a UTF-8 string, find the positions of `C` (`\x43`) and `H` (`\x48`).
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
 								Query:
 								```sql
 								SELECT multiSearchAllPositionsUTF8('\x43\x6c\x69\x63\x6b\x48\x6f\x75\x73\x65',['\x43','\x48']);
 								```
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Result:
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
+								```response
 								["1","6"]
 								```
 								## multiSearchAllPositionsCaseInsensitiveUTF8
 								Like [multiSearchAllPositionsUTF8](#multisearchallpositionsutf8) but ignores case.
 								**Syntax**
 								```sql
 								multiSearchAllPositionsCaseInsensitiveUTF8(haystack, [needle1, needle2, ..., needleN])
 								```
 								**Parameters**
 								- `haystack` — UTF-8 encoded string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `needle` — UTF-8 encoded substrings to be searched. [Array](../data-types/array.md).
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
 								**Returned value**
 								- Array of the starting position in bytes and counting from 1 (if the substring was found).
 								- 0 if the substring was not found.
 								**Example**
 								Given `ClickHouse` as a UTF-8 string, find the positions of `c` (`\x63`) and `h` (`\x68`).
 								Query:
 								```sql
 								SELECT multiSearchAllPositionsCaseInsensitiveUTF8('\x43\x6c\x69\x63\x6b\x48\x6f\x75\x73\x65',['\x63','\x68']);
 								```
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Result:
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
+								```response
 								["1","6"]
 								```
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## multiSearchFirstPosition
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Finor fixes

											
										
										
											2024-03-28 20:20:33 +00:00
+								Like [`position`](#position) but returns the leftmost offset in a `haystack` string which matches any of multiple `needle` strings.
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Fix broken links in docs

											
										
										
											2024-06-12 12:09:37 +00:00
+								Functions [`multiSearchFirstPositionCaseInsensitive`](#multisearchfirstpositioncaseinsensitive), [`multiSearchFirstPositionUTF8`](#multisearchfirstpositionutf8) and [`multiSearchFirstPositionCaseInsensitiveUTF8`](#multisearchfirstpositioncaseinsensitiveutf8) provide case-insensitive and/or UTF-8 variants of this function.
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								**Syntax**
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```sql
-												Standardize formatting of multisearchFirstPositionXYZ functions

											
										
										
											2024-03-28 19:52:55 +00:00
+								multiSearchFirstPosition(haystack, [needle1, needle2, ..., needleN])
 								```
 								**Parameters**
 								- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `needle` —  Substrings to be searched. [Array](../data-types/array.md).
-												Standardize formatting of multisearchFirstPositionXYZ functions

											
										
										
											2024-03-28 19:52:55 +00:00
 								**Returned value**
 								- Leftmost offset in a `haystack` string which matches any of multiple `needle` strings.
 								- 0, if there was no match.
 								**Example**
 								Query:
 								```sql
 								SELECT multiSearchFirstPosition('Hello World',['llo', 'Wor', 'ld']);
 								```
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Result:
-												Standardize formatting of multisearchFirstPositionXYZ functions

											
										
										
											2024-03-28 19:52:55 +00:00
+								```response
 
 								```
 								## multiSearchFirstPositionCaseInsensitive
-												Fix broken links in docs

											
										
										
											2024-06-12 12:09:37 +00:00
+								Like [`multiSearchFirstPosition`](#multisearchfirstposition) but ignores case.
-												Standardize formatting of multisearchFirstPositionXYZ functions

											
										
										
											2024-03-28 19:52:55 +00:00
 								**Syntax**
 								```sql
 								multiSearchFirstPositionCaseInsensitive(haystack, [needle1, needle2, ..., needleN])
 								```
 								**Parameters**
 								- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `needle` — Array of substrings to be searched. [Array](../data-types/array.md).
-												Standardize formatting of multisearchFirstPositionXYZ functions

											
										
										
											2024-03-28 19:52:55 +00:00
 								**Returned value**
 								- Leftmost offset in a `haystack` string which matches any of multiple `needle` strings.
 								- 0, if there was no match.
 								**Example**
 								Query:
 								```sql
 								SELECT multiSearchFirstPositionCaseInsensitive('HELLO WORLD',['wor', 'ld', 'ello']);
 								```
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Result:
-												Standardize formatting of multisearchFirstPositionXYZ functions

											
										
										
											2024-03-28 19:52:55 +00:00
+								```response
 
 								```
-												Finor fixes

											
										
										
											2024-03-28 20:20:33 +00:00
+								## multiSearchFirstPositionUTF8
-												Standardize formatting of multisearchFirstPositionXYZ functions

											
										
										
											2024-03-28 19:52:55 +00:00
-												Fix broken links in docs

											
										
										
											2024-06-12 12:09:37 +00:00
+								Like [`multiSearchFirstPosition`](#multisearchfirstposition) but assumes `haystack` and `needle` to be UTF-8 strings.
-												Standardize formatting of multisearchFirstPositionXYZ functions

											
										
										
											2024-03-28 19:52:55 +00:00
 								**Syntax**
 								```sql
 								multiSearchFirstPositionUTF8(haystack, [needle1, needle2, ..., needleN])
 								```
 								**Parameters**
 								- `haystack` — UTF-8 string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `needle` — Array of UTF-8 substrings to be searched. [Array](../data-types/array.md).
-												Standardize formatting of multisearchFirstPositionXYZ functions

											
										
										
											2024-03-28 19:52:55 +00:00
 								**Returned value**
 								- Leftmost offset in a `haystack` string which matches any of multiple `needle` strings.
 								- 0, if there was no match.
 								**Example**
 								Find the leftmost offset in UTF-8 string `hello world` which matches any of the given needles.
 								Query:
 								```sql
 								SELECT multiSearchFirstPositionUTF8('\x68\x65\x6c\x6c\x6f\x20\x77\x6f\x72\x6c\x64',['wor', 'ld', 'ello']);
 								```
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Result:
-												Standardize formatting of multisearchFirstPositionXYZ functions

											
										
										
											2024-03-28 19:52:55 +00:00
+								```response
 
 								```
-												Finor fixes

											
										
										
											2024-03-28 20:20:33 +00:00
+								## multiSearchFirstPositionCaseInsensitiveUTF8
-												Standardize formatting of multisearchFirstPositionXYZ functions

											
										
										
											2024-03-28 19:52:55 +00:00
-												Fix broken links in docs

											
										
										
											2024-06-12 12:09:37 +00:00
+								Like [`multiSearchFirstPosition`](#multisearchfirstposition) but assumes `haystack` and `needle` to be UTF-8 strings and ignores case.
-												Standardize formatting of multisearchFirstPositionXYZ functions

											
										
										
											2024-03-28 19:52:55 +00:00
 								**Syntax**
 								```sql
 								multiSearchFirstPositionCaseInsensitiveUTF8(haystack, [needle1, needle2, ..., needleN])
 								```
 								**Parameters**
 								- `haystack` — UTF-8 string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `needle` — Array of UTF-8 substrings to be searched. [Array](../data-types/array.md)
-												Standardize formatting of multisearchFirstPositionXYZ functions

											
										
										
											2024-03-28 19:52:55 +00:00
 								**Returned value**
 								- Leftmost offset in a `haystack` string which matches any of multiple `needle` strings, ignoring case.
 								- 0, if there was no match.
 								**Example**
 								Find the leftmost offset in UTF-8 string `HELLO WORLD` which matches any of the given needles.
 								Query:
 								```sql
 								SELECT multiSearchFirstPositionCaseInsensitiveUTF8('\x48\x45\x4c\x4c\x4f\x20\x57\x4f\x52\x4c\x44',['wor', 'ld', 'ello']);
 								```
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Result:
-												Standardize formatting of multisearchFirstPositionXYZ functions

											
										
										
											2024-03-28 19:52:55 +00:00
+								```response
 
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
+								```
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## multiSearchFirstIndex
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Returns the index `i` (starting from 1) of the leftmost found needle<sub>i</sub> in the string `haystack` and 0 otherwise.
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Fix broken links in docs

											
										
										
											2024-06-12 12:09:37 +00:00
+								Functions [`multiSearchFirstIndexCaseInsensitive`](#multisearchfirstindexcaseinsensitive), [`multiSearchFirstIndexUTF8`](#multisearchfirstindexutf8) and [`multiSearchFirstIndexCaseInsensitiveUTF8`](#multisearchfirstindexcaseinsensitiveutf8) provide case-insensitive and/or UTF-8 variants of this function.
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
 								**Syntax**
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```sql
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
+								multiSearchFirstIndex(haystack, [needle1, needle2, ..., needleN])
 								```
 								**Parameters**
 								- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `needle` — Substrings to be searched. [Array](../data-types/array.md).
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
 								**Returned value**
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								- index (starting from 1) of the leftmost found needle. Otherwise 0, if there was no match. [UInt8](../data-types/int-uint.md).
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
 								**Example**
 								Query:
 								```sql
 								SELECT multiSearchFirstIndex('Hello World',['World','Hello']);
 								```
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Result:
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
+								```response
 
 								```
-												Finor fixes

											
										
										
											2024-03-28 20:20:33 +00:00
+								## multiSearchFirstIndexCaseInsensitive
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
 								Returns the index `i` (starting from 1) of the leftmost found needle<sub>i</sub> in the string `haystack` and 0 otherwise. Ignores case.
 								**Syntax**
 								```sql
 								multiSearchFirstIndexCaseInsensitive(haystack, [needle1, needle2, ..., needleN])
 								```
 								**Parameters**
 								- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `needle` — Substrings to be searched. [Array](../data-types/array.md).
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
 								**Returned value**
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								- index (starting from 1) of the leftmost found needle. Otherwise 0, if there was no match. [UInt8](../data-types/int-uint.md).
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
 								**Example**
 								Query:
 								```sql
 								SELECT multiSearchFirstIndexCaseInsensitive('hElLo WoRlD',['World','Hello']);
 								```
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Result:
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
+								```response
 
 								```
-												Finor fixes

											
										
										
											2024-03-28 20:20:33 +00:00
+								## multiSearchFirstIndexUTF8
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
 								Returns the index `i` (starting from 1) of the leftmost found needle<sub>i</sub> in the string `haystack` and 0 otherwise. Assumes `haystack` and `needle` are UTF-8 encoded strings.
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
 								**Syntax**
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```sql
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
+								multiSearchFirstIndexUTF8(haystack, [needle1, needle2, ..., needleN])
 								```
 								**Parameters**
 								- `haystack` — UTF-8 string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `needle` — Array of UTF-8 substrings to be searched. [Array](../data-types/array.md)
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
 								**Returned value**
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								- index (starting from 1) of the leftmost found needle, Otherwise 0, if there was no match. [UInt8](../data-types/int-uint.md).
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
 								**Example**
 								Given `Hello World` as a UTF-8 string, find the first index of UTF-8 strings `Hello` and `World`.
 								Query:
 								```sql
 								SELECT multiSearchFirstIndexUTF8('\x48\x65\x6c\x6c\x6f\x20\x57\x6f\x72\x6c\x64',['\x57\x6f\x72\x6c\x64','\x48\x65\x6c\x6c\x6f']);
 								```
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Result:
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
+								```response
 
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
+								```
-												Finor fixes

											
										
										
											2024-03-28 20:20:33 +00:00
+								## multiSearchFirstIndexCaseInsensitiveUTF8
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
 								Returns the index `i` (starting from 1) of the leftmost found needle<sub>i</sub> in the string `haystack` and 0 otherwise. Assumes `haystack` and `needle` are UTF-8 encoded strings. Ignores case.
 								**Syntax**
 								```sql
 								multiSearchFirstIndexCaseInsensitiveUTF8(haystack, [needle1, needle2, ..., needleN])
 								```
 								**Parameters**
 								- `haystack` — UTF-8 string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `needle` — Array of UTF-8 substrings to be searched. [Array](../data-types/array.md).
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
 								**Returned value**
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								- index (starting from 1) of the leftmost found needle. Otherwise 0, if there was no match. [UInt8](../data-types/int-uint.md).
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
 								**Example**
 								Given `HELLO WORLD` as a UTF-8 string, find the first index of UTF-8 strings `hello` and `world`.
 								Query:
 								```sql
 								SELECT multiSearchFirstIndexCaseInsensitiveUTF8('\x48\x45\x4c\x4c\x4f\x20\x57\x4f\x52\x4c\x44',['\x68\x65\x6c\x6c\x6f','\x77\x6f\x72\x6c\x64']);
 								```
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Result:
-												Standardize format of multiSearchFirstIndexXYZ functions

											
										
										
											2024-03-28 18:47:50 +00:00
+								```response
 
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
+								```
-												Finor fixes

											
										
										
											2024-03-28 20:20:33 +00:00
+								## multiSearchAny
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Returns 1, if at least one string needle<sub>i</sub> matches the string `haystack` and 0 otherwise.
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Fix broken links in docs

											
										
										
											2024-06-12 12:09:37 +00:00
+								Functions [`multiSearchAnyCaseInsensitive`](#multisearchanycaseinsensitive), [`multiSearchAnyUTF8`](#multisearchanyutf8) and [`multiSearchAnyCaseInsensitiveUTF8`](#multisearchanycaseinsensitiveutf8) provide case-insensitive and/or UTF-8 variants of this function.
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								**Syntax**
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```sql
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
+								multiSearchAny(haystack, [needle1, needle2, ..., needleN])
 								```
 								**Parameters**
 								- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `needle` — Substrings to be searched. [Array](../data-types/array.md).
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
 								**Returned value**
 								- 1, if there was at least one match.
 								- 0, if there was not at least one match.
 								**Example**
 								Query:
 								```sql
 								SELECT multiSearchAny('ClickHouse',['C','H']);
 								```
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Result:
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
+								```response
 
 								```
-												Finor fixes

											
										
										
											2024-03-28 20:20:33 +00:00
+								## multiSearchAnyCaseInsensitive
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
 								Like [multiSearchAny](#multisearchany) but ignores case.
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								**Syntax**
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```sql
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
+								multiSearchAnyCaseInsensitive(haystack, [needle1, needle2, ..., needleN])
 								```
 								**Parameters**
 								- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `needle` — Substrings to be searched. [Array](../data-types/array.md)
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
 								**Returned value**
 								- 1, if there was at least one case-insensitive match.
 								- 0, if there was not at least one case-insensitive match.
 								**Example**
 								Query:
 								```sql
 								SELECT multiSearchAnyCaseInsensitive('ClickHouse',['c','h']);
 								```
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Result:
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
+								```response
 
 								```
-												Finor fixes

											
										
										
											2024-03-28 20:20:33 +00:00
+								## multiSearchAnyUTF8
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
-												Finor fixes

											
										
										
											2024-03-28 20:20:33 +00:00
+								Like [multiSearchAny](#multisearchany) but assumes `haystack` and the `needle` substrings are UTF-8 encoded strings.
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
 								*Syntax**
 								```sql
 								multiSearchAnyUTF8(haystack, [needle1, needle2, ..., needleN])
 								```
 								**Parameters**
 								- `haystack` — UTF-8 string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `needle` — UTF-8 substrings to be searched. [Array](../data-types/array.md).
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
 								**Returned value**
 								- 1, if there was at least one match.
 								- 0, if there was not at least one match.
 								**Example**
 								Given `ClickHouse` as a UTF-8 string, check if there are any `C` ('\x43') or `H` ('\x48') letters in the word.
 								Query:
 								```sql
 								SELECT multiSearchAnyUTF8('\x43\x6c\x69\x63\x6b\x48\x6f\x75\x73\x65',['\x43','\x48']);
 								```
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Result:
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
+								```response
 
 								```
-												Finor fixes

											
										
										
											2024-03-28 20:20:33 +00:00
+								## multiSearchAnyCaseInsensitiveUTF8
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
-												Fix broken links in docs

											
										
										
											2024-06-12 12:09:37 +00:00
+								Like [multiSearchAnyUTF8](#multisearchanyutf8) but ignores case.
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
 								*Syntax**
 								```sql
 								multiSearchAnyCaseInsensitiveUTF8(haystack, [needle1, needle2, ..., needleN])
 								```
 								**Parameters**
 								- `haystack` — UTF-8 string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `needle` — UTF-8 substrings to be searched. [Array](../data-types/array.md)
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
 								**Returned value**
 								- 1, if there was at least one case-insensitive match.
 								- 0, if there was not at least one case-insensitive match.
 								**Example**
 								Given `ClickHouse` as a UTF-8 string, check if there is any letter `h`(`\x68`) in the word, ignoring case.
 								Query:
 								```sql
 								SELECT multiSearchAnyCaseInsensitiveUTF8('\x43\x6c\x69\x63\x6b\x48\x6f\x75\x73\x65',['\x68']);
 								```
-												More consistency edits

											
										
										
											2024-03-28 20:45:36 +00:00
+								Result:
-												Standardize function formatting for MultiSearchAllPositionsXYZ and MultiSearchAnyXYZ functions

											
										
										
											2024-03-28 11:36:11 +00:00
+								```response
 
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
+								```
-												Add match() to supported functions for token/ngrambf_v1 indexes

											
										
										
											2024-01-10 13:39:19 +00:00
+								## match {#match}
-												DOCS-57: position, positionCaseInsensitive, positionUTF8, positionCaseInsensitiveUTF8 (#9631)


											
										
										
											2020-03-13 06:33:02 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Returns whether string `haystack` matches the regular expression `pattern` in [re2 regular syntax](https://github.com/google/re2/wiki/Syntax).
-												Docs for multi string search (#4123)


											
										
										
											2019-01-23 08:38:32 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Matching is based on UTF-8, e.g. `.` matches the Unicode code point `¥` which is represented in UTF-8 using two bytes. The regular
 								expression must not contain null bytes. If the haystack or the pattern are not valid UTF-8, then the behavior is undefined.
-												Docs for multi string search (#4123)


											
										
										
											2019-01-23 08:38:32 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Unlike re2's default behavior, `.` matches line breaks. To disable this, prepend the pattern with `(?-s)`.
-												Docs for multi string search (#4123)


											
										
										
											2019-01-23 08:38:32 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								If you only want to search substrings in a string, you can use functions [like](#like) or [position](#position) instead - they work much faster than this function.
-												elenbaskakova-DOCSUP-178
docs(multiSearchAllPositions, multiSearchAllPositionsUTF8): Full description of functions  was added

											
										
										
											2019-10-12 21:12:09 +00:00
-												Normalization for en markdown (#9763)


											
										
										
											2020-03-20 10:10:48 +00:00
+								**Syntax**
-												elenbaskakova-DOCSUP-178
docs(multiSearchAllPositions, multiSearchAllPositionsUTF8): Full description of functions  was added

											
										
										
											2019-10-12 21:12:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```sql
 								match(haystack, pattern)
-												elenbaskakova-DOCSUP-178
docs(multiSearchAllPositions, multiSearchAllPositionsUTF8): Full description of functions  was added

											
										
										
											2019-10-12 21:12:09 +00:00
+								```
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Alias: `haystack REGEXP pattern operator`
-												elenbaskakova-DOCSUP-178
docs(multiSearchAllPositions, multiSearchAllPositionsUTF8): Full description of functions  was added

											
										
										
											2019-10-12 21:12:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## multiMatchAny
-												elenbaskakova-DOCSUP-178
docs(multiSearchAllPositions, multiSearchAllPositionsUTF8): Full description of functions  was added

											
										
										
											2019-10-12 21:12:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Like `match` but returns 1 if at least one of the patterns match and 0 otherwise.
-												elenbaskakova-DOCSUP-178
docs(multiSearchAllPositions, multiSearchAllPositionsUTF8): Full description of functions  was added

											
										
										
											2019-10-12 21:12:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								:::note
 								Functions in the `multi[Fuzzy]Match*()` family use the the (Vectorscan)[https://github.com/VectorCamp/vectorscan] library. As such, they are only enabled if ClickHouse is compiled with support for vectorscan.
-												elenbaskakova-DOCSUP-178
docs(multiSearchAllPositions, multiSearchAllPositionsUTF8): Full description of functions  was added

											
										
										
											2019-10-12 21:12:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								To turn off all functions that use hyperscan, use setting `SET allow_hyperscan = 0;`.
-												elenbaskakova-DOCSUP-178
docs(multiSearchAllPositions, multiSearchAllPositionsUTF8): Full description of functions  was added

											
										
										
											2019-10-12 21:12:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Due to restrictions of vectorscan, the length of the `haystack` string must be less than 2<sup>32</sup> bytes.
-												elenbaskakova-DOCSUP-178
docs(multiSearchAllPositions, multiSearchAllPositionsUTF8): Full description of functions  was added

											
										
										
											2019-10-12 21:12:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Hyperscan is generally vulnerable to regular expression denial of service (ReDoS) attacks (e.g. see
 								(here)[https://www.usenix.org/conference/usenixsecurity22/presentation/turonova], (here)[https://doi.org/10.1007/s10664-021-10033-1] and
 								(here)[https://doi.org/10.1145/3236024.3236027]. Users are adviced to check the provided patterns carefully.
 								:::
-												elenbaskakova-DOCSUP-178
docs(multiSearchAllPositions, multiSearchAllPositionsUTF8): Full description of functions  was added

											
										
										
											2019-10-12 21:12:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								If you only want to search multiple substrings in a string, you can use function [multiSearchAny](#multisearchany) instead - it works much faster than this function.
-												elenbaskakova-DOCSUP-178
docs(multiSearchAllPositions, multiSearchAllPositionsUTF8): Full description of functions  was added

											
										
										
											2019-10-12 21:12:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								**Syntax**
 								```sql
-												Review changes and replace … with ...

											
										
										
											2024-05-23 11:54:45 +00:00
+								multiMatchAny(haystack, \[pattern<sub>1</sub>, pattern<sub>2</sub>, ..., pattern<sub>n</sub>\])
-												elenbaskakova-DOCSUP-178
docs(multiSearchAllPositions, multiSearchAllPositionsUTF8): Full description of functions  was added

											
										
										
											2019-10-12 21:12:09 +00:00
+								```
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## multiMatchAnyIndex
-												elenbaskakova-DOCSUP-178
docs(multiSearchAllPositions, multiSearchAllPositionsUTF8): Full description of functions  was added

											
										
										
											2019-10-12 21:12:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Like `multiMatchAny` but returns any index that matches the haystack.
-												Docs for multi string search (#4123)


											
										
										
											2019-01-23 08:38:32 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								**Syntax**
-												Docs for multi string search (#4123)


											
										
										
											2019-01-23 08:38:32 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```sql
-												Review changes and replace … with ...

											
										
										
											2024-05-23 11:54:45 +00:00
+								multiMatchAnyIndex(haystack, \[pattern<sub>1</sub>, pattern<sub>2</sub>, ..., pattern<sub>n</sub>\])
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```
-												Docs for multi string search (#4123)


											
										
										
											2019-01-23 08:38:32 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## multiMatchAllIndices
-												Docs for multi string search (#4123)


											
										
										
											2019-01-23 08:38:32 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Like `multiMatchAny` but returns the array of all indices that match the haystack in any order.
-												Renamings, fixes to search algorithms, more tests

											
										
										
											2019-03-23 22:49:38 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								**Syntax**
-												Renamings, fixes to search algorithms, more tests

											
										
										
											2019-03-23 22:49:38 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```sql
-												Review changes and replace … with ...

											
										
										
											2024-05-23 11:54:45 +00:00
+								multiMatchAllIndices(haystack, \[pattern<sub>1</sub>, pattern<sub>2</sub>, ..., pattern<sub>n</sub>\])
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```
-												Docs for multi string search (#4123)


											
										
										
											2019-01-23 08:38:32 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## multiFuzzyMatchAny
-												Docs for multi string search (#4123)


											
										
										
											2019-01-23 08:38:32 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Like `multiMatchAny` but returns 1 if any pattern matches the haystack within a constant [edit distance](https://en.wikipedia.org/wiki/Edit_distance). This function relies on the experimental feature of [hyperscan](https://intel.github.io/hyperscan/dev-reference/compilation.html#approximate-matching) library, and can be slow for some corner cases. The performance depends on the edit distance value and patterns used, but it's always more expensive compared to a non-fuzzy variants.
-												Docs for multi string search (#4123)


											
										
										
											2019-01-23 08:38:32 +00:00
-												More details on matching with Unicode

											
										
										
											2022-06-17 13:26:59 +00:00
+								:::note
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								`multiFuzzyMatch*()` function family do not support UTF-8 regular expressions (it threats them as a sequence of bytes) due to restrictions of hyperscan.
-												Removed /ja folder, cleaned up /ru markdown

											
										
										
											2022-04-09 13:29:05 +00:00
+								:::
-												More restrictions added

											
										
										
											2019-03-28 15:12:37 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								**Syntax**
-												Docs: Mention non-standard DOTALL behavior of ClickHouse's match()

Cf. #34603

											
										
										
											2023-01-06 11:14:49 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```sql
-												Review changes and replace … with ...

											
										
										
											2024-05-23 11:54:45 +00:00
+								multiFuzzyMatchAny(haystack, distance, \[pattern<sub>1</sub>, pattern<sub>2</sub>, ..., pattern<sub>n</sub>\])
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## multiFuzzyMatchAnyIndex
-												Renamings, fixes to search algorithms, more tests

											
										
										
											2019-03-23 22:49:38 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Like `multiFuzzyMatchAny` but returns any index that matches the haystack within a constant edit distance.
-												Renamings, fixes to search algorithms, more tests

											
										
										
											2019-03-23 22:49:38 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								**Syntax**
-												Reject DoS-prone hyperscan regexes

											
										
										
											2023-02-08 13:07:27 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```sql
-												Review changes and replace … with ...

											
										
										
											2024-05-23 11:54:45 +00:00
+								multiFuzzyMatchAnyIndex(haystack, distance, \[pattern<sub>1</sub>, pattern<sub>2</sub>, ..., pattern<sub>n</sub>\])
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```
-												Reject DoS-prone hyperscan regexes

											
										
										
											2023-02-08 13:07:27 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## multiFuzzyMatchAllIndices
-												Fix hyperscan, add some notes, test, 4 more perf tests

											
										
										
											2019-03-24 21:47:34 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Like `multiFuzzyMatchAny` but returns the array of all indices in any order that match the haystack within a constant edit distance.
-												Hyperscan multi regular expressions search

											
										
										
											2019-03-23 19:40:16 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								**Syntax**
-												Hyperscan multi regular expressions search

											
										
										
											2019-03-23 19:40:16 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```sql
-												Review changes and replace … with ...

											
										
										
											2024-05-23 11:54:45 +00:00
+								multiFuzzyMatchAllIndices(haystack, distance, \[pattern<sub>1</sub>, pattern<sub>2</sub>, ..., pattern<sub>n</sub>\])
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```
-												All multi{Fuzzy}MatchAllIndices functions

											
										
										
											2019-10-13 13:22:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## extract
-												All multi{Fuzzy}MatchAllIndices functions

											
										
										
											2019-10-13 13:22:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Extracts a fragment of a string using a regular expression. If `haystack` does not match the `pattern` regex, an empty string is returned.
-												Added hyperscan fuzzy search

											
										
										
											2019-03-29 01:02:05 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								For regex without subpatterns, the function uses the fragment that matches the entire regex. Otherwise, it uses the fragment that matches the first subpattern.
-												Added hyperscan fuzzy search

											
										
										
											2019-03-29 01:02:05 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								**Syntax**
-												Added hyperscan fuzzy search

											
										
										
											2019-03-29 01:02:05 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```sql
 								extract(haystack, pattern)
 								```
-												Added hyperscan fuzzy search

											
										
										
											2019-03-29 01:02:05 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## extractAll
-												All multi{Fuzzy}MatchAllIndices functions

											
										
										
											2019-10-13 13:22:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Extracts all fragments of a string using a regular expression. If `haystack` does not match the `pattern` regex, an empty string is returned.
-												All multi{Fuzzy}MatchAllIndices functions

											
										
										
											2019-10-13 13:22:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Returns an array of strings consisting of all matches of the regex.
-												fix hyperscan to treat regular expressions as utf-8 expressions

											
										
										
											2019-05-05 06:51:36 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								The behavior with respect to subpatterns is the same as in function `extract`.
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								**Syntax**
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```sql
 								extractAll(haystack, pattern)
 								```
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								## extractAllGroupsHorizontal
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
-												Remove trailing whitespaces from docs

											
										
										
											2021-07-29 15:27:50 +00:00
+								Matches all groups of the `haystack` string using the `pattern` regular expression. Returns an array of arrays, where the first array includes all fragments matching the first group, the second array - matching the second group, etc.
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
-												Fix broken links in docs

											
										
										
											2024-06-12 12:09:37 +00:00
+								This function is slower than [extractAllGroupsVertical](#extractallgroupsvertical).
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
-												Remove trailing whitespaces from docs

											
										
										
											2021-07-29 15:20:55 +00:00
+								**Syntax**
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
 								``` sql
 								extractAllGroupsHorizontal(haystack, pattern)
 								```
-												Remove trailing whitespaces from docs

											
										
										
											2021-07-29 15:20:55 +00:00
+								**Arguments**
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `haystack` — Input string. [String](../data-types/string.md).
 								- `pattern` — Regular expression with [re2 syntax](https://github.com/google/re2/wiki/Syntax). Must contain groups, each group enclosed in parentheses. If `pattern` contains no groups, an exception is thrown. [String](../data-types/string.md).
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
 								**Returned value**
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- Array of arrays of matches. [Array](../data-types/array.md).
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
-												Update return type formatting

											
										
										
											2024-05-23 13:48:20 +00:00
+								:::note
-												Remove trailing whitespaces from docs

											
										
										
											2021-07-29 15:20:55 +00:00
+								If `haystack` does not match the `pattern` regex, an array of empty arrays is returned.
-												Update return type formatting

											
										
										
											2024-05-23 13:48:20 +00:00
+								:::
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
 								**Example**
 								``` sql
-												Edit and translate to Russian

Поправил шаблоны в английской и русской версиях.

											
										
										
											2021-03-13 18:18:45 +00:00
+								SELECT extractAllGroupsHorizontal('abc=111, def=222, ghi=333', '("[^"]+"|\\w+)=("[^"]+"|\\w+)');
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
+								```
 								Result:
 								``` text
 								┌─extractAllGroupsHorizontal('abc=111, def=222, ghi=333', '("[^"]+"|\\w+)=("[^"]+"|\\w+)')─┐
 								│ [['abc','def','ghi'],['111','222','333']]                                                │
 								└──────────────────────────────────────────────────────────────────────────────────────────┘
 								```
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								## extractAllGroupsVertical
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
 								Matches all groups of the `haystack` string using the `pattern` regular expression. Returns an array of arrays, where each array includes matching fragments from every group. Fragments are grouped in order of appearance in the `haystack`.
-												Remove trailing whitespaces from docs

											
										
										
											2021-07-29 15:20:55 +00:00
+								**Syntax**
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
 								``` sql
 								extractAllGroupsVertical(haystack, pattern)
 								```
-												Remove trailing whitespaces from docs

											
										
										
											2021-07-29 15:20:55 +00:00
+								**Arguments**
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `haystack` — Input string. [String](../data-types/string.md).
 								- `pattern` — Regular expression with [re2 syntax](https://github.com/google/re2/wiki/Syntax). Must contain groups, each group enclosed in parentheses. If `pattern` contains no groups, an exception is thrown. [String](../data-types/string.md).
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
 								**Returned value**
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- Array of arrays of matches. [Array](../data-types/array.md).
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
-												Update return type formatting

											
										
										
											2024-05-23 13:48:20 +00:00
+								:::note
-												Remove trailing whitespaces from docs

											
										
										
											2021-07-29 15:20:55 +00:00
+								If `haystack` does not match the `pattern` regex, an empty array is returned.
-												Update return type formatting

											
										
										
											2024-05-23 13:48:20 +00:00
+								:::
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
 								**Example**
 								``` sql
-												Edit and translate to Russian

Поправил шаблоны в английской и русской версиях.

											
										
										
											2021-03-13 18:18:45 +00:00
+								SELECT extractAllGroupsVertical('abc=111, def=222, ghi=333', '("[^"]+"|\\w+)=("[^"]+"|\\w+)');
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
+								```
 								Result:
 								``` text
 								┌─extractAllGroupsVertical('abc=111, def=222, ghi=333', '("[^"]+"|\\w+)=("[^"]+"|\\w+)')─┐
 								│ [['abc','111'],['def','222'],['ghi','333']]                                            │
 								└────────────────────────────────────────────────────────────────────────────────────────┘
 								```
-												Fix broken links in docs

											
										
										
											2024-06-12 12:09:37 +00:00
+								## like
-												DOCSUP-1674: Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English) (#14317)

* Docs for the extractAllGroupsHorizontal and extractAllGroupsVertical functions (English).

* Minor fixes (en).

* Misspelling fixed.

* English docs corrected and translated into Russian.

* English misspelling corrected.

Co-authored-by: Olga Revyakina <revolg@yandex.ru>
Co-authored-by: Olga Revyakina <revolg@yandex-team.ru>
											
										
										
											2020-10-06 11:17:19 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Returns whether string `haystack` matches the LIKE expression `pattern`.
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								A LIKE expression can contain normal characters and the following metasymbols:
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Docs: Replace annoying three spaces in enumerations by a single space

											
										
										
											2023-04-19 15:55:29 +00:00
+								- `%` indicates an arbitrary number of arbitrary characters (including zero characters).
 								- `_` indicates a single arbitrary character.
 								- `\` is for escaping literals `%`, `_` and `\`.
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												More details on matching with Unicode

											
										
										
											2022-06-17 13:26:59 +00:00
+								Matching is based on UTF-8, e.g. `_` matches the Unicode code point `¥` which is represented in UTF-8 using two bytes.
-												Clarify that match() & like() assume UTF-8

The previous explanation sentence

  "The regular expression works with the string as if it is a set of bytes."

suggested otherwise and since we don't have separate methods matchUTF8()
and likeUTF8(), it makes sense to clarify.

											
										
										
											2022-06-02 09:56:06 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								If the haystack or the LIKE expression are not valid UTF-8, the behavior is undefined.
-												Sources for english documentation switched to Markdown.
Edit page link is fixed too for both language versions of documentation.

											
										
										
											2017-12-28 15:13:23 +00:00
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								No automatic Unicode normalization is performed, you can use the [normalizeUTF8*()](https://clickhouse.com../functions/string-functions/) functions for that.
-												DOCSUP-3478: Documented the iLike function (#15880)

* Description of the iLike function

Добавил описание функции iLike и добавил оператор ILIKE.

* Update string-search-functions.md

Changed by comments.

* Update and translation ilike function and ILIKE operator..

Внес поправки в английскую версию и сделал перевод на русский язык.

Co-authored-by: Dmitriy <sevirov@yandex-team.ru>
											
										
										
											2020-10-19 15:32:09 +00:00
-												Fix typo in `like` function documentation
											
										
										
											2024-04-11 13:56:47 +00:00
+								To match against literal `%`, `_` and `\` (which are LIKE metacharacters), prepend them with a backslash: `\%`, `\_` and `\\`.
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								The backslash loses its special meaning (i.e. is interpreted literally) if it prepends a character different than `%`, `_` or `\`.
 								Note that ClickHouse requires backslashes in strings [to be quoted as well](../syntax.md#string), so you would actually need to write `\\%`, `\\_` and `\\\\`.
-												DOCSUP-3478: Documented the iLike function (#15880)

* Description of the iLike function

Добавил описание функции iLike и добавил оператор ILIKE.

* Update string-search-functions.md

Changed by comments.

* Update and translation ilike function and ILIKE operator..

Внес поправки в английскую версию и сделал перевод на русский язык.

Co-authored-by: Dmitriy <sevirov@yandex-team.ru>
											
										
										
											2020-10-19 15:32:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								For LIKE expressions of the form `%needle%`, the function is as fast as the `position` function.
 								All other LIKE expressions are internally converted to a regular expression and executed with a performance similar to function `match`.
-												More details on matching with Unicode

											
										
										
											2022-06-17 13:26:59 +00:00
-												DOCSUP-3478: Documented the iLike function (#15880)

* Description of the iLike function

Добавил описание функции iLike и добавил оператор ILIKE.

* Update string-search-functions.md

Changed by comments.

* Update and translation ilike function and ILIKE operator..

Внес поправки в английскую версию и сделал перевод на русский язык.

Co-authored-by: Dmitriy <sevirov@yandex-team.ru>
											
										
										
											2020-10-19 15:32:09 +00:00
+								**Syntax**
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```sql
 								like(haystack, pattern)
-												DOCSUP-3478: Documented the iLike function (#15880)

* Description of the iLike function

Добавил описание функции iLike и добавил оператор ILIKE.

* Update string-search-functions.md

Changed by comments.

* Update and translation ilike function and ILIKE operator..

Внес поправки в английскую версию и сделал перевод на русский язык.

Co-authored-by: Dmitriy <sevirov@yandex-team.ru>
											
										
										
											2020-10-19 15:32:09 +00:00
+								```
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Alias: `haystack LIKE pattern` (operator)
-												DOCSUP-3478: Documented the iLike function (#15880)

* Description of the iLike function

Добавил описание функции iLike и добавил оператор ILIKE.

* Update string-search-functions.md

Changed by comments.

* Update and translation ilike function and ILIKE operator..

Внес поправки в английскую версию и сделал перевод на русский язык.

Co-authored-by: Dmitriy <sevirov@yandex-team.ru>
											
										
										
											2020-10-19 15:32:09 +00:00
-												Fix links

											
										
										
											2024-01-10 13:34:55 +00:00
+								## notLike {#notlike}
-												DOCSUP-3478: Documented the iLike function (#15880)

* Description of the iLike function

Добавил описание функции iLike и добавил оператор ILIKE.

* Update string-search-functions.md

Changed by comments.

* Update and translation ilike function and ILIKE operator..

Внес поправки в английскую версию и сделал перевод на русский язык.

Co-authored-by: Dmitriy <sevirov@yandex-team.ru>
											
										
										
											2020-10-19 15:32:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Like `like` but negates the result.
-												DOCSUP-3478: Documented the iLike function (#15880)

* Description of the iLike function

Добавил описание функции iLike и добавил оператор ILIKE.

* Update string-search-functions.md

Changed by comments.

* Update and translation ilike function and ILIKE operator..

Внес поправки в английскую версию и сделал перевод на русский язык.

Co-authored-by: Dmitriy <sevirov@yandex-team.ru>
											
										
										
											2020-10-19 15:32:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Alias: `haystack NOT LIKE pattern` (operator)
-												DOCSUP-3478: Documented the iLike function (#15880)

* Description of the iLike function

Добавил описание функции iLike и добавил оператор ILIKE.

* Update string-search-functions.md

Changed by comments.

* Update and translation ilike function and ILIKE operator..

Внес поправки в английскую версию и сделал перевод на русский язык.

Co-authored-by: Dmitriy <sevirov@yandex-team.ru>
											
										
										
											2020-10-19 15:32:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## ilike
-												DOCSUP-3478: Documented the iLike function (#15880)

* Description of the iLike function

Добавил описание функции iLike и добавил оператор ILIKE.

* Update string-search-functions.md

Changed by comments.

* Update and translation ilike function and ILIKE operator..

Внес поправки в английскую версию и сделал перевод на русский язык.

Co-authored-by: Dmitriy <sevirov@yandex-team.ru>
											
										
										
											2020-10-19 15:32:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Like `like` but searches case-insensitively.
-												DOCSUP-3478: Documented the iLike function (#15880)

* Description of the iLike function

Добавил описание функции iLike и добавил оператор ILIKE.

* Update string-search-functions.md

Changed by comments.

* Update and translation ilike function and ILIKE operator..

Внес поправки в английскую версию и сделал перевод на русский язык.

Co-authored-by: Dmitriy <sevirov@yandex-team.ru>
											
										
										
											2020-10-19 15:32:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Alias: `haystack ILIKE pattern` (operator)
-												DOCSUP-3478: Documented the iLike function (#15880)

* Description of the iLike function

Добавил описание функции iLike и добавил оператор ILIKE.

* Update string-search-functions.md

Changed by comments.

* Update and translation ilike function and ILIKE operator..

Внес поправки в английскую версию и сделал перевод на русский язык.

Co-authored-by: Dmitriy <sevirov@yandex-team.ru>
											
										
										
											2020-10-19 15:32:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## notILike
-												DOCSUP-3478: Documented the iLike function (#15880)

* Description of the iLike function

Добавил описание функции iLike и добавил оператор ILIKE.

* Update string-search-functions.md

Changed by comments.

* Update and translation ilike function and ILIKE operator..

Внес поправки в английскую версию и сделал перевод на русский язык.

Co-authored-by: Dmitriy <sevirov@yandex-team.ru>
											
										
										
											2020-10-19 15:32:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Like `ilike` but negates the result.
-												DOCSUP-3478: Documented the iLike function (#15880)

* Description of the iLike function

Добавил описание функции iLike и добавил оператор ILIKE.

* Update string-search-functions.md

Changed by comments.

* Update and translation ilike function and ILIKE operator..

Внес поправки в английскую версию и сделал перевод на русский язык.

Co-authored-by: Dmitriy <sevirov@yandex-team.ru>
											
										
										
											2020-10-19 15:32:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Alias: `haystack NOT ILIKE pattern` (operator)
-												DOCSUP-3478: Documented the iLike function (#15880)

* Description of the iLike function

Добавил описание функции iLike и добавил оператор ILIKE.

* Update string-search-functions.md

Changed by comments.

* Update and translation ilike function and ILIKE operator..

Внес поправки в английскую версию и сделал перевод на русский язык.

Co-authored-by: Dmitriy <sevirov@yandex-team.ru>
											
										
										
											2020-10-19 15:32:09 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## ngramDistance
-												Remove "Original article links"

											
										
										
											2023-01-09 14:13:36 +00:00
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								Calculates the 4-gram distance between a `haystack` string and a `needle` string. For this, it counts the symmetric difference between two multisets of 4-grams and normalizes it by the sum of their cardinalities. Returns a [Float32](../data-types/float.md/#float32-float64) between 0 and 1. The smaller the result is, the more similar the strings are to each other.
-												DOCSUP-3478: Documented the iLike function (#15880)

* Description of the iLike function

Добавил описание функции iLike и добавил оператор ILIKE.

* Update string-search-functions.md

Changed by comments.

* Update and translation ilike function and ILIKE operator..

Внес поправки в английскую версию и сделал перевод на русский язык.

Co-authored-by: Dmitriy <sevirov@yandex-team.ru>
											
										
										
											2020-10-19 15:32:09 +00:00
-												Update ngramDistance functions

											
										
										
											2024-03-30 15:30:55 +00:00
+								Functions [`ngramDistanceCaseInsensitive`](#ngramdistancecaseinsensitive), [`ngramDistanceUTF8`](#ngramdistanceutf8), [`ngramDistanceCaseInsensitiveUTF8`](#ngramdistancecaseinsensitiveutf8) provide case-insensitive and/or UTF-8 variants of this function.
-												Rename trigramDistance to ngramDistance, add more functions with CaseInsensitive and UTF, update docs, more job done in perf, added some perf tests for string search that I would like to see

											
										
										
											2019-03-05 22:42:28 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								**Syntax**
-												Rename trigramDistance to ngramDistance, add more functions with CaseInsensitive and UTF, update docs, more job done in perf, added some perf tests for string search that I would like to see

											
										
										
											2019-03-05 22:42:28 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```sql
 								ngramDistance(haystack, needle)
 								```
-												Rename trigramDistance to ngramDistance, add more functions with CaseInsensitive and UTF, update docs, more job done in perf, added some perf tests for string search that I would like to see

											
										
										
											2019-03-05 22:42:28 +00:00
-												Update ngramDistance functions

											
										
										
											2024-03-30 15:30:55 +00:00
+								**Parameters**
-												Correct link to string literal

											
										
										
											2024-04-01 17:41:18 +00:00
+								- `haystack`: First comparison string. [String literal](../syntax#string)
 								- `needle`: Second comparison string. [String literal](../syntax#string)
-												Update ngramDistance functions

											
										
										
											2024-03-30 15:30:55 +00:00
 								**Returned value**
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- Value between 0 and 1 representing the similarity between the two strings. [Float32](../data-types/float.md/#float32-float64)
-												Update ngramDistance functions

											
										
										
											2024-03-30 15:30:55 +00:00
 								**Implementation details**
 								This function will throw an exception if constant `needle` or `haystack` arguments are more than 32Kb in size. If any non-constant `haystack` or `needle` arguments are more than 32Kb in size, then the distance is always 1.
 								**Examples**
 								The more similar two strings are to each other, the closer the result will be to 0 (identical).
 								Query:
 								```sql
 								SELECT ngramDistance('ClickHouse','ClickHouse!');
 								```
 								Result:
 								```response
 .06666667
 								```
 								The less similar two strings are to each, the larger the result will be.
 								Query:
 								```sql
 								SELECT ngramDistance('ClickHouse','House');
 								```
 								Result:
 								```response
 .5555556
 								```
 								## ngramDistanceCaseInsensitive
-												Fix spelling mistake

insensitve -> insensitive
											
										
										
											2024-03-30 19:04:51 +00:00
+								Provides a case-insensitive variant of [ngramDistance](#ngramdistance).
-												Update ngramDistance functions

											
										
										
											2024-03-30 15:30:55 +00:00
 								**Syntax**
 								```sql
 								ngramDistanceCaseInsensitive(haystack, needle)
 								```
 								**Parameters**
-												Correct link to string literal

											
										
										
											2024-04-01 17:41:18 +00:00
+								- `haystack`: First comparison string. [String literal](../syntax#string)
 								- `needle`: Second comparison string. [String literal](../syntax#string)
-												Update ngramDistance functions

											
										
										
											2024-03-30 15:30:55 +00:00
 								**Returned value**
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- Value between 0 and 1 representing the similarity between the two strings. [Float32](../data-types/float.md/#float32-float64)
-												Update ngramDistance functions

											
										
										
											2024-03-30 15:30:55 +00:00
 								**Examples**
-												Small fixes

											
										
										
											2024-04-01 18:00:30 +00:00
+								With [ngramDistance](#ngramdistance) differences in case will affect the similarity value:
-												Update ngramDistance functions

											
										
										
											2024-03-30 15:30:55 +00:00
 								Query:
 								```sql
 								SELECT ngramDistance('ClickHouse','clickhouse');
 								```
 								Result:
 								```response
 .71428573
 								```
-												Small fixes

											
										
										
											2024-04-01 18:00:30 +00:00
+								With [ngramDistanceCaseInsensitive](#ngramdistancecaseinsensitive) case is ignored so two identical strings differing only in case will now return a low similarity value:
-												Update ngramDistance functions

											
										
										
											2024-03-30 15:30:55 +00:00
 								Query:
 								```sql
 								SELECT ngramDistanceCaseInsensitive('ClickHouse','clickhouse');
 								```
 								Result:
 								```response
 
 								```
 								## ngramDistanceUTF8
 								Provides a UTF-8 variant of [ngramDistance](#ngramdistance). Assumes that `needle` and `haystack` strings are UTF-8 encoded strings.
 								**Syntax**
 								```sql
 								ngramDistanceUTF8(haystack, needle)
 								```
 								**Parameters**
-												Correct link to string literal

											
										
										
											2024-04-01 17:41:18 +00:00
+								- `haystack`: First UTF-8 encoded comparison string. [String literal](../syntax#string)
 								- `needle`: Second UTF-8 encoded comparison string. [String literal](../syntax#string)
-												Update ngramDistance functions

											
										
										
											2024-03-30 15:30:55 +00:00
 								**Returned value**
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- Value between 0 and 1 representing the similarity between the two strings. [Float32](../data-types/float.md/#float32-float64)
-												Update ngramDistance functions

											
										
										
											2024-03-30 15:30:55 +00:00
-												Add examples for UTF8 variants of ngramDistance

											
										
										
											2024-04-01 17:31:02 +00:00
+								**Example**
 								Query:
 								```sql
 								SELECT ngramDistanceUTF8('abcde','cde');
 								```
 								Result:
 								```response
 .5
 								```
-												Update ngramDistance functions

											
										
										
											2024-03-30 15:30:55 +00:00
+								## ngramDistanceCaseInsensitiveUTF8
 								Provides a case-insensitive variant of [ngramDistanceUTF8](#ngramdistanceutf8).
 								**Syntax**
 								```sql
 								ngramDistanceCaseInsensitiveUTF8(haystack, needle)
 								```
 								**Parameters**
-												Correct link to string literal

											
										
										
											2024-04-01 17:41:18 +00:00
+								- `haystack`: First UTF-8 encoded comparison string. [String literal](../syntax#string)
 								- `needle`: Second UTF-8 encoded comparison string. [String literal](../syntax#string)
-												Update ngramDistance functions

											
										
										
											2024-03-30 15:30:55 +00:00
 								**Returned value**
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- Value between 0 and 1 representing the similarity between the two strings. [Float32](../data-types/float.md/#float32-float64)
-												Update ngramDistance functions

											
										
										
											2024-03-30 15:30:55 +00:00
-												Add examples for UTF8 variants of ngramDistance

											
										
										
											2024-04-01 17:31:02 +00:00
+								**Example**
 								Query:
 								```sql
 								SELECT ngramDistanceCaseInsensitiveUTF8('abcde','CDE');
 								```
 								Result:
 								```response
 .5
 								```
-												Update ngramDistance functions

											
										
										
											2024-03-30 15:30:55 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## ngramSearch
-												Rename trigramDistance to ngramDistance, add more functions with CaseInsensitive and UTF, update docs, more job done in perf, added some perf tests for string search that I would like to see

											
										
										
											2019-03-05 22:42:28 +00:00
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								Like `ngramDistance` but calculates the non-symmetric difference between a `needle` string and a `haystack` string, i.e. the number of n-grams from the needle minus the common number of n-grams normalized by the number of `needle` n-grams. Returns a [Float32](../data-types/float.md/#float32-float64) between 0 and 1. The bigger the result is, the more likely `needle` is in the `haystack`. This function is useful for fuzzy string search. Also see function [`soundex`](../../sql-reference/functions/string-functions#soundex).
-												ngramEntry function was added

											
										
										
											2019-05-25 18:47:26 +00:00
-												Update ngramSearch functions

											
										
										
											2024-03-30 16:22:31 +00:00
+								Functions [`ngramSearchCaseInsensitive`](#ngramsearchcaseinsensitive), [`ngramSearchUTF8`](#ngramsearchutf8), [`ngramSearchCaseInsensitiveUTF8`](#ngramsearchcaseinsensitiveutf8) provide case-insensitive and/or UTF-8 variants of this function.
 								**Syntax**
 								```sql
 								ngramSearch(haystack, needle)
 								```
 								**Parameters**
-												Correct link to string literal

											
										
										
											2024-04-01 17:41:18 +00:00
+								- `haystack`: First comparison string. [String literal](../syntax#string)
 								- `needle`: Second comparison string. [String literal](../syntax#string)
-												Update ngramSearch functions

											
										
										
											2024-03-30 16:22:31 +00:00
 								**Returned value**
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- Value between 0 and 1 representing the likelihood of the `needle` being in the `haystack`. [Float32](../data-types/float.md/#float32-float64)
-												Update ngramSearch functions

											
										
										
											2024-03-30 16:22:31 +00:00
 								**Implementation details**
-												ngramEntry function was added

											
										
										
											2019-05-25 18:47:26 +00:00
-												More details on matching with Unicode

											
										
										
											2022-06-17 13:26:59 +00:00
+								:::note
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								The UTF-8 variants use the 3-gram distance. These are not perfectly fair n-gram distances. We use 2-byte hashes to hash n-grams and then calculate the (non-)symmetric difference between these hash tables – collisions may occur. With UTF-8 case-insensitive format we do not use fair `tolower` function – we zero the 5-th bit (starting from zero) of each codepoint byte and first bit of zeroth byte if bytes more than one – this works for Latin and mostly for all Cyrillic letters.
-												Removed /ja folder, cleaned up /ru markdown

											
										
										
											2022-04-09 13:29:05 +00:00
+								:::
-												WIP on docs/website (#3383)

* CLICKHOUSE-4063: less manual html @ index.md

* CLICKHOUSE-4063: recommend markdown="1" in README.md

* CLICKHOUSE-4003: manually purge custom.css for now

* CLICKHOUSE-4064: expand <details> before any print (including to pdf)

* CLICKHOUSE-3927: rearrange interfaces/formats.md a bit

* CLICKHOUSE-3306: add few http headers

* Remove copy-paste introduced in #3392

* Hopefully better chinese fonts #3392

* get rid of tabs @ custom.css

* Apply comments and patch from #3384

* Add jdbc.md to ToC and some translation, though it still looks badly incomplete

* minor punctuation

* Add some backlinks to official website from mirrors that just blindly take markdown sources

* Do not make fonts extra light

* find . -name '*.md' -type f | xargs -I{} perl -pi -e 's//g' {}

* find . -name '*.md' -type f | xargs -I{} perl -pi -e 's/ sql/g' {}

* Remove outdated stuff from roadmap.md

* Not so light font on front page too

* Refactor Chinese formats.md to match recent changes in other languages

											
										
										
											2018-10-16 10:47:17 +00:00
-												Update ngramSearch functions

											
										
										
											2024-03-30 16:22:31 +00:00
+								**Example**
 								Query:
 								```sql
 								SELECT ngramSearch('Hello World','World Hello');
 								```
 								Result:
 								```response
 .5
 								```
 								## ngramSearchCaseInsensitive
-												Fix broken links in docs

											
										
										
											2024-06-12 12:09:37 +00:00
+								Provides a case-insensitive variant of [ngramSearch](#ngramsearch).
-												Update ngramSearch functions

											
										
										
											2024-03-30 16:22:31 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								**Syntax**
 								```sql
-												Update ngramSearch functions

											
										
										
											2024-03-30 16:22:31 +00:00
+								ngramSearchCaseInsensitive(haystack, needle)
 								```
 								**Parameters**
-												Correct link to string literal

											
										
										
											2024-04-01 17:41:18 +00:00
+								- `haystack`: First comparison string. [String literal](../syntax#string)
 								- `needle`: Second comparison string. [String literal](../syntax#string)
-												Update ngramSearch functions

											
										
										
											2024-03-30 16:22:31 +00:00
 								**Returned value**
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- Value between 0 and 1 representing the likelihood of the `needle` being in the `haystack`. [Float32](../data-types/float.md/#float32-float64)
-												Update ngramSearch functions

											
										
										
											2024-03-30 16:22:31 +00:00
 								The bigger the result is, the more likely `needle` is in the `haystack`.
 								**Example**
 								Query:
 								```sql
 								SELECT ngramSearchCaseInsensitive('Hello World','hello');
 								```
 								Result:
 								```response
 
 								```
 								## ngramSearchUTF8
 								Provides a UTF-8 variant of [ngramSearch](#ngramsearch) in which `needle` and `haystack` are assumed to be UTF-8 encoded strings.
 								**Syntax**
 								```sql
 								ngramSearchUTF8(haystack, needle)
 								```
 								**Parameters**
-												Correct link to string literal

											
										
										
											2024-04-01 17:41:18 +00:00
+								- `haystack`: First UTF-8 encoded comparison string. [String literal](../syntax#string)
 								- `needle`: Second UTF-8 encoded comparison string. [String literal](../syntax#string)
-												Update ngramSearch functions

											
										
										
											2024-03-30 16:22:31 +00:00
 								**Returned value**
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- Value between 0 and 1 representing the likelihood of the `needle` being in the `haystack`. [Float32](../data-types/float.md/#float32-float64)
-												Update ngramSearch functions

											
										
										
											2024-03-30 16:22:31 +00:00
 								The bigger the result is, the more likely `needle` is in the `haystack`.
 								**Example**
 								Query:
 								```sql
 								SELECT ngramSearchUTF8('абвгдеёжз', 'гдеёзд');
 								```
 								Result:
 								```response
 .5
 								```
 								## ngramSearchCaseInsensitiveUTF8
 								Provides a case-insensitive variant of [ngramSearchUTF8](#ngramsearchutf8) in which `needle` and `haystack`.
 								**Syntax**
 								```sql
 								ngramSearchCaseInsensitiveUTF8(haystack, needle)
 								```
 								**Parameters**
-												Correct link to string literal

											
										
										
											2024-04-01 17:41:18 +00:00
+								- `haystack`: First UTF-8 encoded comparison string. [String literal](../syntax#string)
 								- `needle`: Second UTF-8 encoded comparison string. [String literal](../syntax#string)
-												Update ngramSearch functions

											
										
										
											2024-03-30 16:22:31 +00:00
 								**Returned value**
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- Value between 0 and 1 representing the likelihood of the `needle` being in the `haystack`. [Float32](../data-types/float.md/#float32-float64)
-												Update ngramSearch functions

											
										
										
											2024-03-30 16:22:31 +00:00
 								The bigger the result is, the more likely `needle` is in the `haystack`.
 								**Example**
 								Query:
 								```sql
 								SELECT ngramSearchCaseInsensitiveUTF8('абвГДЕёжз', 'АбвгдЕЁжз');
 								```
 								Result:
 								```response
 .57142854
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								```
-												Remove H1 anchor tags from docs

											
										
										
											2022-06-02 10:55:18 +00:00
+								## countSubstrings
-												Implement countSubstrings()

Function to count number of substring occurrences in the string:
- in case of needle is multi char - counts non-intersecting substrings
- the code is based on position helpers.

The following new functions is available:
- countSubstrings()
- countSubstringsCaseInsensitive()
- countSubstringsCaseInsensitiveUTF8()

v0: substringCount()

v2:
- add substringCountCaseInsensitiveUTF8
- improve tests
- fix coding style issues
- fix multichar needle

v3: rename to countSubstrings (by analogy with countEqual())

											
										
										
											2020-11-26 18:16:07 +00:00
-												minor fixes

											
										
										
											2024-04-13 09:32:40 +00:00
+								Returns how often a substring `needle` occurs in a string `haystack`.
-												Implement countSubstrings()

Function to count number of substring occurrences in the string:
- in case of needle is multi char - counts non-intersecting substrings
- the code is based on position helpers.

The following new functions is available:
- countSubstrings()
- countSubstringsCaseInsensitive()
- countSubstringsCaseInsensitiveUTF8()

v0: substringCount()

v2:
- add substringCountCaseInsensitiveUTF8
- improve tests
- fix coding style issues
- fix multichar needle

v3: rename to countSubstrings (by analogy with countEqual())

											
										
										
											2020-11-26 18:16:07 +00:00
-												Add variants

											
										
										
											2024-04-13 09:25:08 +00:00
+								Functions [`countSubstringsCaseInsensitive`](#countsubstringscaseinsensitive) and [`countSubstringsCaseInsensitiveUTF8`](#countsubstringscaseinsensitiveutf8) provide case-insensitive and case-insensitive + UTF-8 variants of this function respectively.
-												Implement countSubstrings()

Function to count number of substring occurrences in the string:
- in case of needle is multi char - counts non-intersecting substrings
- the code is based on position helpers.

The following new functions is available:
- countSubstrings()
- countSubstringsCaseInsensitive()
- countSubstringsCaseInsensitiveUTF8()

v0: substringCount()

v2:
- add substringCountCaseInsensitiveUTF8
- improve tests
- fix coding style issues
- fix multichar needle

v3: rename to countSubstrings (by analogy with countEqual())

											
										
										
											2020-11-26 18:16:07 +00:00
 								**Syntax**
 								``` sql
 								countSubstrings(haystack, needle[, start_pos])
 								```
-												Global replacement `Parameters` to `Arguments`

											
										
										
											2021-02-15 21:22:10 +00:00
+								**Arguments**
-												Implement countSubstrings()

Function to count number of substring occurrences in the string:
- in case of needle is multi char - counts non-intersecting substrings
- the code is based on position helpers.

The following new functions is available:
- countSubstrings()
- countSubstringsCaseInsensitive()
- countSubstringsCaseInsensitiveUTF8()

v0: substringCount()

v2:
- add substringCountCaseInsensitiveUTF8
- improve tests
- fix coding style issues
- fix multichar needle

v3: rename to countSubstrings (by analogy with countEqual())

											
										
										
											2020-11-26 18:16:07 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
 								- `needle` — Substring to be searched. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `start_pos` – Position (1-based) in `haystack` at which the search starts. [UInt](../data-types/int-uint.md). Optional.
-												Implement countSubstrings()

Function to count number of substring occurrences in the string:
- in case of needle is multi char - counts non-intersecting substrings
- the code is based on position helpers.

The following new functions is available:
- countSubstrings()
- countSubstringsCaseInsensitive()
- countSubstringsCaseInsensitiveUTF8()

v0: substringCount()

v2:
- add substringCountCaseInsensitiveUTF8
- improve tests
- fix coding style issues
- fix multichar needle

v3: rename to countSubstrings (by analogy with countEqual())

											
										
										
											2020-11-26 18:16:07 +00:00
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								**Returned value**
-												Implement countSubstrings()

Function to count number of substring occurrences in the string:
- in case of needle is multi char - counts non-intersecting substrings
- the code is based on position helpers.

The following new functions is available:
- countSubstrings()
- countSubstringsCaseInsensitive()
- countSubstringsCaseInsensitiveUTF8()

v0: substringCount()

v2:
- add substringCountCaseInsensitiveUTF8
- improve tests
- fix coding style issues
- fix multichar needle

v3: rename to countSubstrings (by analogy with countEqual())

											
										
										
											2020-11-26 18:16:07 +00:00
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- The number of occurrences. [UInt64](../data-types/int-uint.md).
-												Implement countSubstrings()

Function to count number of substring occurrences in the string:
- in case of needle is multi char - counts non-intersecting substrings
- the code is based on position helpers.

The following new functions is available:
- countSubstrings()
- countSubstringsCaseInsensitive()
- countSubstringsCaseInsensitiveUTF8()

v0: substringCount()

v2:
- add substringCountCaseInsensitiveUTF8
- improve tests
- fix coding style issues
- fix multichar needle

v3: rename to countSubstrings (by analogy with countEqual())

											
										
										
											2020-11-26 18:16:07 +00:00
 								**Examples**
 								``` sql
-												Edited English description

											
										
										
											2020-12-29 10:54:17 +00:00
+								SELECT countSubstrings('aaaa', 'aa');
-												Implement countSubstrings()

Function to count number of substring occurrences in the string:
- in case of needle is multi char - counts non-intersecting substrings
- the code is based on position helpers.

The following new functions is available:
- countSubstrings()
- countSubstringsCaseInsensitive()
- countSubstringsCaseInsensitiveUTF8()

v0: substringCount()

v2:
- add substringCountCaseInsensitiveUTF8
- improve tests
- fix coding style issues
- fix multichar needle

v3: rename to countSubstrings (by analogy with countEqual())

											
										
										
											2020-11-26 18:16:07 +00:00
+								```
 								Result:
 								``` text
 								┌─countSubstrings('aaaa', 'aa')─┐
 								│                             2 │
 								└───────────────────────────────┘
 								```
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Example with `start_pos` argument:
-												Edited English description

											
										
										
											2020-12-29 10:54:17 +00:00
 								```sql
 								SELECT countSubstrings('abc___abc', 'abc', 4);
 								```
 								Result:
 								``` text
 								┌─countSubstrings('abc___abc', 'abc', 4)─┐
 								│                                      1 │
 								└────────────────────────────────────────┘
 								```
-												Add variants

											
										
										
											2024-04-13 09:25:08 +00:00
+								## countSubstringsCaseInsensitive
-												minor fixes

											
										
										
											2024-04-13 09:32:40 +00:00
+								Returns how often a substring `needle` occurs in a string `haystack`. Ignores case.
-												Add variants

											
										
										
											2024-04-13 09:25:08 +00:00
 								**Syntax**
 								``` sql
 								countSubstringsCaseInsensitive(haystack, needle[, start_pos])
 								```
 								**Arguments**
 								- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
 								- `needle` — Substring to be searched. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `start_pos` – Position (1-based) in `haystack` at which the search starts. [UInt](../data-types/int-uint.md). Optional.
-												Add variants

											
										
										
											2024-04-13 09:25:08 +00:00
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								**Returned value**
-												Add variants

											
										
										
											2024-04-13 09:25:08 +00:00
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- The number of occurrences. [UInt64](../data-types/int-uint.md).
-												Add variants

											
										
										
											2024-04-13 09:25:08 +00:00
 								**Examples**
-												minor fixes

											
										
										
											2024-04-13 09:32:40 +00:00
+								Query:
-												Add variants

											
										
										
											2024-04-13 09:25:08 +00:00
+								``` sql
 								SELECT countSubstringsCaseInsensitive('AAAA', 'aa');
 								```
 								Result:
 								``` text
 								┌─countSubstringsCaseInsensitive('AAAA', 'aa')─┐
 								│                                            2 │
 								└──────────────────────────────────────────────┘
 								```
 								Example with `start_pos` argument:
-												minor fixes

											
										
										
											2024-04-13 09:32:40 +00:00
+								Query:
-												Add variants

											
										
										
											2024-04-13 09:25:08 +00:00
+								```sql
 								SELECT countSubstringsCaseInsensitive('abc___ABC___abc', 'abc', 4);
 								```
 								Result:
 								``` text
 								┌─countSubstringsCaseInsensitive('abc___ABC___abc', 'abc', 4)─┐
 								│                                                           2 │
 								└─────────────────────────────────────────────────────────────┘
 								```
 								## countSubstringsCaseInsensitiveUTF8
-												minor fixes

											
										
										
											2024-04-13 09:32:40 +00:00
+								Returns how often a substring `needle` occurs in a string `haystack`. Ignores case and assumes that `haystack` is a UTF8 string.
-												Add variants

											
										
										
											2024-04-13 09:25:08 +00:00
 								**Syntax**
 								``` sql
 								countSubstringsCaseInsensitiveUTF8(haystack, needle[, start_pos])
 								```
 								**Arguments**
 								- `haystack` — UTF-8 string in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
 								- `needle` — Substring to be searched. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `start_pos` – Position (1-based) in `haystack` at which the search starts. [UInt](../data-types/int-uint.md). Optional.
-												Add variants

											
										
										
											2024-04-13 09:25:08 +00:00
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								**Returned value**
-												Add variants

											
										
										
											2024-04-13 09:25:08 +00:00
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- The number of occurrences. [UInt64](../data-types/int-uint.md).
-												Add variants

											
										
										
											2024-04-13 09:25:08 +00:00
 								**Examples**
-												minor fixes

											
										
										
											2024-04-13 09:32:40 +00:00
+								Query:
-												Add variants

											
										
										
											2024-04-13 09:25:08 +00:00
+								``` sql
 								SELECT countSubstringsCaseInsensitiveUTF8('ложка, кошка, картошка', 'КА');
 								```
 								Result:
 								``` text
 								┌─countSubstringsCaseInsensitiveUTF8('ложка, кошка, картошка', 'КА')─┐
 								│                                                                  4 │
 								└────────────────────────────────────────────────────────────────────┘
 								```
 								Example with `start_pos` argument:
-												minor fixes

											
										
										
											2024-04-13 09:32:40 +00:00
+								Query:
-												Add variants

											
										
										
											2024-04-13 09:25:08 +00:00
+								```sql
 								SELECT countSubstringsCaseInsensitiveUTF8('ложка, кошка, картошка', 'КА', 13);
 								```
 								Result:
 								``` text
 								┌─countSubstringsCaseInsensitiveUTF8('ложка, кошка, картошка', 'КА', 13)─┐
 								│                                                                      2 │
 								└────────────────────────────────────────────────────────────────────────┘
 								```
-												Edited English description

											
										
										
											2020-12-29 10:54:17 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## countMatches
-												 add countMatches sql function (issue #15413)

											
										
										
											2020-10-23 04:28:25 +00:00
 								Returns the number of regular expression matches for a `pattern` in a `haystack`.
-												Edited English description

											
										
										
											2020-12-29 10:54:17 +00:00
-												Document the countMatches function.

Задокументировал функцию countMatches.

											
										
										
											2020-12-21 19:30:37 +00:00
+								**Syntax**
 								``` sql
 								countMatches(haystack, pattern)
 								```
-												Global replacement `Parameters` to `Arguments`

											
										
										
											2021-02-15 21:22:10 +00:00
+								**Arguments**
-												Document the countMatches function.

Задокументировал функцию countMatches.

											
										
										
											2020-12-21 19:30:37 +00:00
-												Docs: Replace annoying three spaces in enumerations by a single space

											
										
										
											2023-04-19 15:55:29 +00:00
+								- `haystack` — The string to search in. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `pattern` — The regular expression with [re2 syntax](https://github.com/google/re2/wiki/Syntax). [String](../data-types/string.md).
-												Document the countMatches function.

Задокументировал функцию countMatches.

											
										
										
											2020-12-21 19:30:37 +00:00
 								**Returned value**
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- The number of matches. [UInt64](../data-types/int-uint.md).
-												Document the countMatches function.

Задокументировал функцию countMatches.

											
										
										
											2020-12-21 19:30:37 +00:00
 								**Examples**
 								``` sql
-												Update countMatches function

Поставил ';' в конце запросов.

											
										
										
											2020-12-24 17:06:11 +00:00
+								SELECT countMatches('foobar.com', 'o+');
-												Document the countMatches function.

Задокументировал функцию countMatches.

											
										
										
											2020-12-21 19:30:37 +00:00
+								```
 								Result:
 								``` text
-												Translation into Russian language

Выполнил перевод на русский язык.

											
										
										
											2020-12-22 19:10:03 +00:00
+								┌─countMatches('foobar.com', 'o+')─┐
 								│                                2 │
 								└──────────────────────────────────┘
-												Document the countMatches function.

Задокументировал функцию countMatches.

											
										
										
											2020-12-21 19:30:37 +00:00
+								```
 								``` sql
-												Update countMatches function

Поставил ';' в конце запросов.

											
										
										
											2020-12-24 17:06:11 +00:00
+								SELECT countMatches('aaaa', 'aa');
-												Document the countMatches function.

Задокументировал функцию countMatches.

											
										
										
											2020-12-21 19:30:37 +00:00
+								```
 								Result:
 								``` text
-												Update countMatches function

Поставил ';' в конце запросов.

											
										
										
											2020-12-24 17:06:11 +00:00
+								┌─countMatches('aaaa', 'aa')────┐
-												Document the countMatches function.

Задокументировал функцию countMatches.

											
										
										
											2020-12-21 19:30:37 +00:00
+								│                             2 │
 								└───────────────────────────────┘
 								```
-												add docs for function regexExtract

											
										
										
											2023-02-16 09:33:51 +00:00
-												Refactorings,  pt. I

											
										
										
											2024-02-08 11:27:24 +00:00
+								## countMatchesCaseInsensitive
-												Give explicit description to countMatchesCaseInsensitive in addition to a link to its base function

											
										
										
											2024-04-13 09:34:57 +00:00
+								Returns the number of regular expression matches for a pattern in a haystack like [`countMatches`](#countmatches) but matching ignores the case.
-												Update countMatchesCaseInsensitive function

											
										
										
											2024-04-13 09:05:43 +00:00
 								**Syntax**
 								``` sql
 								countMatchesCaseInsensitive(haystack, pattern)
 								```
 								**Arguments**
 								- `haystack` — The string to search in. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `pattern` — The regular expression with [re2 syntax](https://github.com/google/re2/wiki/Syntax). [String](../data-types/string.md).
-												Update countMatchesCaseInsensitive function

											
										
										
											2024-04-13 09:05:43 +00:00
 								**Returned value**
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- The number of matches. [UInt64](../data-types/int-uint.md).
-												Update countMatchesCaseInsensitive function

											
										
										
											2024-04-13 09:05:43 +00:00
 								**Examples**
 								Query:
 								``` sql
 								SELECT countMatchesCaseInsensitive('AAAA', 'aa');
 								```
 								Result:
 								``` text
 								┌─countMatchesCaseInsensitive('AAAA', 'aa')────┐
 								│                                            2 │
 								└──────────────────────────────────────────────┘
 								```
-												Refactorings,  pt. I

											
										
										
											2024-02-08 11:27:24 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								## regexpExtract
-												add docs for function regexExtract

											
										
										
											2023-02-16 09:33:51 +00:00
-												refine

											
										
										
											2024-04-01 15:06:54 +00:00
+								Extracts the first string in `haystack` that matches the regexp pattern and corresponds to the regex group index.
-												add docs for function regexExtract

											
										
										
											2023-02-16 09:33:51 +00:00
 								**Syntax**
 								``` sql
 								regexpExtract(haystack, pattern[, index])
 								```
 								Alias: `REGEXP_EXTRACT(haystack, pattern[, index])`.
 								**Arguments**
-												Docs: Replace annoying three spaces in enumerations by a single space

											
										
										
											2023-04-19 15:55:29 +00:00
+								- `haystack` — String, in which regexp pattern will to be matched. [String](../../sql-reference/syntax.md#syntax-string-literal).
 								- `pattern` — String, regexp expression, must be constant. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Standardize references to data type docs

											
										
										
											2024-05-24 03:54:16 +00:00
+								- `index` – An integer number greater or equal 0 with default 1. It represents which regex group to extract. [UInt or Int](../data-types/int-uint.md). Optional.
-												add docs for function regexExtract

											
										
										
											2023-02-16 09:33:51 +00:00
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								**Returned value**
-												add docs for function regexExtract

											
										
										
											2023-02-16 09:33:51 +00:00
-												Update return type formatting

											
										
										
											2024-05-23 13:48:20 +00:00
+								`pattern` may contain multiple regexp groups, `index` indicates which regex group to extract. An index of 0 means matching the entire regular expression. [String](../data-types/string.md).
-												add docs for function regexExtract

											
										
										
											2023-02-16 09:33:51 +00:00
 								**Examples**
 								``` sql
 								SELECT
 								    regexpExtract('100-200', '(\\d+)-(\\d+)', 1),
 								    regexpExtract('100-200', '(\\d+)-(\\d+)', 2),
 								    regexpExtract('100-200', '(\\d+)-(\\d+)', 0),
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								    regexpExtract('100-200', '(\\d+)-(\\d+)');
 								```
-												add docs for function regexExtract

											
										
										
											2023-02-16 09:33:51 +00:00
-												Clean up string search function docs

											
										
										
											2023-04-20 09:30:11 +00:00
+								Result:
 								``` text
-												add docs for function regexExtract

											
										
										
											2023-02-16 09:33:51 +00:00
+								┌─regexpExtract('100-200', '(\\d+)-(\\d+)', 1)─┬─regexpExtract('100-200', '(\\d+)-(\\d+)', 2)─┬─regexpExtract('100-200', '(\\d+)-(\\d+)', 0)─┬─regexpExtract('100-200', '(\\d+)-(\\d+)')─┐
 								│ 100                                          │ 200                                          │ 100-200                                      │ 100                                       │
 								└──────────────────────────────────────────────┴──────────────────────────────────────────────┴──────────────────────────────────────────────┴───────────────────────────────────────────┘
 								```
-												Add more tests

											
										
										
											2023-07-06 19:43:37 +00:00
 								## hasSubsequence
-												refine

											
										
										
											2024-04-01 15:06:54 +00:00
+								Returns 1 if `needle` is a subsequence of `haystack`, or 0 otherwise.
-												Add more tests

											
										
										
											2023-07-06 19:43:37 +00:00
+								A subsequence of a string is a sequence that can be derived from the given string by deleting zero or more elements without changing the order of the remaining elements.
 								**Syntax**
 								``` sql
 								hasSubsequence(haystack, needle)
 								```
 								**Arguments**
 								- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Add case ins utf8 impl + tests

											
										
										
											2023-07-10 09:18:09 +00:00
+								- `needle` — Subsequence to be searched. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Add more tests

											
										
										
											2023-07-06 19:43:37 +00:00
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								**Returned value**
-												Add more tests

											
										
										
											2023-07-06 19:43:37 +00:00
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								- 1, if needle is a subsequence of haystack, 0 otherwise. [UInt8](../data-types/int-uint.md).
-												Add more tests

											
										
										
											2023-07-06 19:43:37 +00:00
 								**Examples**
-												Updates to hasSubsequence functions

											
										
										
											2024-03-30 13:13:30 +00:00
+								Query:
-												Add more tests

											
										
										
											2023-07-06 19:43:37 +00:00
+								``` sql
-												Updates to hasSubsequence functions

											
										
										
											2024-03-30 13:13:30 +00:00
+								SELECT hasSubsequence('garbage', 'arg');
-												Add more tests

											
										
										
											2023-07-06 19:43:37 +00:00
+								```
 								Result:
 								``` text
 								┌─hasSubsequence('garbage', 'arg')─┐
 								│                                1 │
 								└──────────────────────────────────┘
 								```
 								## hasSubsequenceCaseInsensitive
-												Fix broken links in docs

											
										
										
											2024-06-12 12:09:37 +00:00
+								Like [hasSubsequence](#hassubsequence) but searches case-insensitively.
-												Add more tests

											
										
										
											2023-07-06 19:43:37 +00:00
-												Updates to hasSubsequence functions

											
										
										
											2024-03-30 13:13:30 +00:00
+								**Syntax**
 								``` sql
-												Minor detail fixes

											
										
										
											2024-03-30 14:16:01 +00:00
+								hasSubsequenceCaseInsensitive(haystack, needle)
-												Updates to hasSubsequence functions

											
										
										
											2024-03-30 13:13:30 +00:00
+								```
 								**Arguments**
 								- `haystack` — String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
 								- `needle` — Subsequence to be searched. [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								**Returned value**
-												Updates to hasSubsequence functions

											
										
										
											2024-03-30 13:13:30 +00:00
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								- 1, if needle is a subsequence of haystack, 0 otherwise [UInt8](../data-types/int-uint.md).
-												Updates to hasSubsequence functions

											
										
										
											2024-03-30 13:13:30 +00:00
 								**Examples**
 								Query:
 								``` sql
 								SELECT hasSubsequenceCaseInsensitive('garbage', 'ARG');
 								```
 								Result:
 								``` text
 								┌─hasSubsequenceCaseInsensitive('garbage', 'ARG')─┐
 								│                                               1 │
 								└─────────────────────────────────────────────────┘
 								```
-												Add more tests

											
										
										
											2023-07-06 19:43:37 +00:00
+								## hasSubsequenceUTF8
-												Second pass fix remaining broken links

											
										
										
											2024-06-12 13:09:50 +00:00
+								Like [hasSubsequence](#hassubsequence) but assumes `haystack` and `needle` are UTF-8 encoded strings.
-												Add more tests

											
										
										
											2023-07-06 19:43:37 +00:00
-												Updates to hasSubsequence functions

											
										
										
											2024-03-30 13:13:30 +00:00
+								**Syntax**
 								``` sql
 								hasSubsequenceUTF8(haystack, needle)
 								```
 								**Arguments**
 								- `haystack` — String in which the search is performed. UTF-8 encoded [String](../../sql-reference/syntax.md#syntax-string-literal).
 								- `needle` — Subsequence to be searched. UTF-8 encoded [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								**Returned value**
-												Updates to hasSubsequence functions

											
										
										
											2024-03-30 13:13:30 +00:00
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								- 1, if needle is a subsequence of haystack, 0, otherwise. [UInt8](../data-types/int-uint.md).
-												Updates to hasSubsequence functions

											
										
										
											2024-03-30 13:13:30 +00:00
 								Query:
 								**Examples**
 								``` sql
 								select hasSubsequenceUTF8('ClickHouse - столбцовая система управления базами данных', 'система');
 								```
 								Result:
 								``` text
 								┌─hasSubsequenceUTF8('ClickHouse - столбцовая система управления базами данных', 'система')─┐
 								│                                                                                         1 │
 								└───────────────────────────────────────────────────────────────────────────────────────────┘
 								```
-												Add more tests

											
										
										
											2023-07-06 19:43:37 +00:00
+								## hasSubsequenceCaseInsensitiveUTF8
-												Fix broken links in docs

											
										
										
											2024-06-12 12:09:37 +00:00
+								Like [hasSubsequenceUTF8](#hassubsequenceutf8) but searches case-insensitively.
-												Add documentation for hasToken functions

											
										
										
											2024-03-28 10:22:28 +00:00
-												Updates to hasSubsequence functions

											
										
										
											2024-03-30 13:13:30 +00:00
+								**Syntax**
 								``` sql
 								hasSubsequenceCaseInsensitiveUTF8(haystack, needle)
 								```
 								**Arguments**
 								- `haystack` — String in which the search is performed. UTF-8 encoded [String](../../sql-reference/syntax.md#syntax-string-literal).
 								- `needle` — Subsequence to be searched. UTF-8 encoded [String](../../sql-reference/syntax.md#syntax-string-literal).
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								**Returned value**
-												Updates to hasSubsequence functions

											
										
										
											2024-03-30 13:13:30 +00:00
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								- 1, if needle is a subsequence of haystack, 0 otherwise. [UInt8](../data-types/int-uint.md).
-												Updates to hasSubsequence functions

											
										
										
											2024-03-30 13:13:30 +00:00
 								**Examples**
 								Query:
 								``` sql
 								select hasSubsequenceCaseInsensitiveUTF8('ClickHouse - столбцовая система управления базами данных', 'СИСТЕМА');
 								```
 								Result:
 								``` text
 								┌─hasSubsequenceCaseInsensitiveUTF8('ClickHouse - столбцовая система управления базами данных', 'СИСТЕМА')─┐
 								│                                                                                                        1 │
 								└──────────────────────────────────────────────────────────────────────────────────────────────────────────┘
 								```
-												Add documentation for hasToken functions

											
										
										
											2024-03-28 10:22:28 +00:00
+								## hasToken
 								Returns 1 if a given token is present in a haystack, or 0 otherwise.
 								**Syntax**
 								```sql
 								hasToken(haystack, token)
 								```
 								**Parameters**
 								- `haystack`: String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
 								- `token`: Maximal length substring between two non alphanumeric ASCII characters (or boundaries of haystack).
 								**Returned value**
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								- 1, if the token is present in the haystack, 0 otherwise. [UInt8](../data-types/int-uint.md).
-												Add documentation for hasToken functions

											
										
										
											2024-03-28 10:22:28 +00:00
 								**Implementation details**
 								Token must be a constant string. Supported by tokenbf_v1 index specialization.
 								**Example**
 								Query:
 								```sql
 								SELECT hasToken('Hello World','Hello');
 								```
 								```response
 
 								```
 								## hasTokenOrNull
 								Returns 1 if a given token is present, 0 if not present, and null if the token is ill-formed.
 								**Syntax**
 								```sql
 								hasTokenOrNull(haystack, token)
 								```
 								**Parameters**
 								- `haystack`: String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
 								- `token`: Maximal length substring between two non alphanumeric ASCII characters (or boundaries of haystack).
 								**Returned value**
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								- 1, if the token is present in the haystack, 0 if it is not present, and null if the token is ill formed.
-												Add documentation for hasToken functions

											
										
										
											2024-03-28 10:22:28 +00:00
 								**Implementation details**
 								Token must be a constant string. Supported by tokenbf_v1 index specialization.
 								**Example**
 								Where `hasToken` would throw an error for an ill-formed token, `hasTokenOrNull` returns `null` for an ill-formed token.
 								Query:
 								```sql
 								SELECT hasTokenOrNull('Hello World','Hello,World');
 								```
 								```response
 								null
 								```
 								## hasTokenCaseInsensitive
 								Returns 1 if a given token is present in a haystack, 0 otherwise. Ignores case.
 								**Syntax**
 								```sql
 								hasTokenCaseInsensitive(haystack, token)
 								```
 								**Parameters**
 								- `haystack`: String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
 								- `token`: Maximal length substring between two non alphanumeric ASCII characters (or boundaries of haystack).
 								**Returned value**
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								- 1, if the token is present in the haystack, 0 otherwise. [UInt8](../data-types/int-uint.md).
-												Add documentation for hasToken functions

											
										
										
											2024-03-28 10:22:28 +00:00
 								**Implementation details**
 								Token must be a constant string. Supported by tokenbf_v1 index specialization.
 								**Example**
 								Query:
 								```sql
 								SELECT hasTokenCaseInsensitive('Hello World','hello');
 								```
 								```response
 
 								```
 								## hasTokenCaseInsensitiveOrNull
 								Returns 1 if a given token is present in a haystack, 0 otherwise. Ignores case and returns null if the token is ill-formed.
 								**Syntax**
 								```sql
-												Minor detail fixes

											
										
										
											2024-03-30 14:16:01 +00:00
+								hasTokenCaseInsensitiveOrNull(haystack, token)
-												Add documentation for hasToken functions

											
										
										
											2024-03-28 10:22:28 +00:00
+								```
 								**Parameters**
 								- `haystack`: String in which the search is performed. [String](../../sql-reference/syntax.md#syntax-string-literal).
 								- `token`: Maximal length substring between two non alphanumeric ASCII characters (or boundaries of haystack).
 								**Returned value**
-												Turn multi-line returns into a single line

											
										
										
											2024-05-24 04:42:13 +00:00
+								- 1, if the token is present in the haystack, 0 if the token is not present, otherwise [`null`](../data-types/nullable.md) if the token is ill-formed. [UInt8](../data-types/int-uint.md).
-												Add documentation for hasToken functions

											
										
										
											2024-03-28 10:22:28 +00:00
 								**Implementation details**
 								Token must be a constant string. Supported by tokenbf_v1 index specialization.
 								**Example**
 								Where `hasTokenCaseInsensitive` would throw an error for an ill-formed token, `hasTokenCaseInsensitiveOrNull` returns `null` for an ill-formed token.
 								Query:
 								```sql
 								SELECT hasTokenCaseInsensitiveOrNull('Hello World','hello,world');
 								```
 								```response
 								null
-												Fix typo in `like` function documentation
											
										
										
											2024-04-11 13:56:47 +00:00
+								```