ClickHouse/docs/en/sql-reference/aggregate-functions/reference/topk.md

---
toc_priority: 108
---

# topK {#topk}

Returns an array of the approximately most frequent values in the specified column. The resulting array is sorted in descending order of approximate frequency of values (not by the values themselves).

Implements the [Filtered Space-Saving](http://www.l2f.inesc-id.pt/~fmmb/wiki/uploads/Work/misnis.ref0a.pdf) algorithm for analyzing TopK, based on the reduce-and-combine algorithm from [Parallel Space Saving](https://arxiv.org/pdf/1401.0702.pdf).

``` sql
topK(N)(column)
```

This function doesn’t provide a guaranteed result. In certain situations, errors might occur and it might return frequent values that aren’t the most frequent values.

We recommend using the `N < 10` value; performance is reduced with large `N` values. Maximum value of `N = 65536`.

**Arguments**

-   `N` – The number of elements to return.

If the parameter is omitted, default value 10 is used.

**Arguments**

-   `x` – The value to calculate frequency.

**Example**

Take the [OnTime](../../../getting-started/example-datasets/ontime.md) data set and select the three most frequently occurring values in the `AirlineID` column.

``` sql
SELECT topK(3)(AirlineID) AS res
FROM ontime
```

``` text
┌─res─────────────────┐
│ [19393,19790,19805] │
└─────────────────────┘
```
-												[docs] split aggregate function and system table references (#11742)

* prefer relative links from root

* wip

* split aggregate function reference

* split system tables
											
										
										
											2020-06-18 08:24:31 +00:00
+								---
 								toc_priority: 108
 								---
 								# topK {#topk}
 								Returns an array of the approximately most frequent values in the specified column. The resulting array is sorted in descending order of approximate frequency of values (not by the values themselves).
 								Implements the [Filtered Space-Saving](http://www.l2f.inesc-id.pt/~fmmb/wiki/uploads/Work/misnis.ref0a.pdf) algorithm for analyzing TopK, based on the reduce-and-combine algorithm from [Parallel Space Saving](https://arxiv.org/pdf/1401.0702.pdf).
 								``` sql
 								topK(N)(column)
 								```
 								This function doesn’t provide a guaranteed result. In certain situations, errors might occur and it might return frequent values that aren’t the most frequent values.
 								We recommend using the `N < 10` value; performance is reduced with large `N` values. Maximum value of `N = 65536`.
-												Replacement `Parameters` to `Arguments` for aggregate functions

											
										
										
											2021-02-15 21:33:53 +00:00
+								**Arguments**
-												[docs] split aggregate function and system table references (#11742)

* prefer relative links from root

* wip

* split aggregate function reference

* split system tables
											
										
										
											2020-06-18 08:24:31 +00:00
-												Edit and translate to Russian

Поправил шаблоны в английской и русской версиях.

											
										
										
											2021-03-13 18:18:45 +00:00
+								-   `N` – The number of elements to return.
-												[docs] split aggregate function and system table references (#11742)

* prefer relative links from root

* wip

* split aggregate function reference

* split system tables
											
										
										
											2020-06-18 08:24:31 +00:00
 								If the parameter is omitted, default value 10 is used.
 								**Arguments**
-												Edit and translate to Russian

Поправил шаблоны в английской и русской версиях.

											
										
										
											2021-03-13 18:18:45 +00:00
+								-   `x` – The value to calculate frequency.
-												[docs] split aggregate function and system table references (#11742)

* prefer relative links from root

* wip

* split aggregate function reference

* split system tables
											
										
										
											2020-06-18 08:24:31 +00:00
 								**Example**
 								Take the [OnTime](../../../getting-started/example-datasets/ontime.md) data set and select the three most frequently occurring values in the `AirlineID` column.
 								``` sql
 								SELECT topK(3)(AirlineID) AS res
 								FROM ontime
 								```
 								``` text
 								┌─res─────────────────┐
 								│ [19393,19790,19805] │
 								└─────────────────────┘
 								```