ClickHouse/docs/en/sql-reference/aggregate-functions/reference/corrstable.md

Ignoring revisions in .git-blame-ignore-revs. Click here to bypass and see the normal blame view.

58 lines
1.5 KiB
Markdown
Raw Normal View History

2024-05-30 10:35:24 +00:00
---
slug: /en/sql-reference/aggregate-functions/reference/corrstable
sidebar_position: 119
2024-05-30 10:35:24 +00:00
---
# corrStable
2024-06-03 15:32:09 +00:00
Calculates the [Pearson correlation coefficient](https://en.wikipedia.org/wiki/Pearson_correlation_coefficient):
$$
\frac{\Sigma{(x - \bar{x})(y - \bar{y})}}{\sqrt{\Sigma{(x - \bar{x})^2} * \Sigma{(y - \bar{y})^2}}}
$$
Similar to the [`corr`](../reference/corr.md) function, but uses a numerically stable algorithm. As a result, `corrStable` is slower than `corr` but produces a more accurate result.
2024-05-30 10:35:24 +00:00
**Syntax**
```sql
corrStable(x, y)
```
**Arguments**
- `x` — first variable. [(U)Int*](../../data-types/int-uint.md), [Float*](../../data-types/float.md), [Decimal](../../data-types/decimal.md).
- `y` — second variable. [(U)Int*](../../data-types/int-uint.md), [Float*](../../data-types/float.md), [Decimal](../../data-types/decimal.md).
**Returned Value**
- The Pearson correlation coefficient. [Float64](../../data-types/float.md).
***Example**
Query:
```sql
DROP TABLE IF EXISTS series;
CREATE TABLE series
(
2024-06-03 15:32:09 +00:00
i UInt32,
x_value Float64,
y_value Float64
2024-05-30 10:35:24 +00:00
)
2024-06-03 15:32:09 +00:00
ENGINE = Memory;
INSERT INTO series(i, x_value, y_value) VALUES (1, 5.6, -4.4),(2, -9.6, 3),(3, -1.3, -4),(4, 5.3, 9.7),(5, 4.4, 0.037),(6, -8.6, -7.8),(7, 5.1, 9.3),(8, 7.9, -3.6),(9, -8.2, 0.62),(10, -3, 7.3);
2024-05-30 10:35:24 +00:00
```
```sql
SELECT corrStable(x_value, y_value)
2024-06-03 15:32:09 +00:00
FROM series;
2024-05-30 10:35:24 +00:00
```
Result:
```response
┌─corrStable(x_value, y_value)─┐
│ 0.17302657554532558 │
└──────────────────────────────┘
```