ClickHouse/docs/en/sql-reference/table-functions/azureBlobStorage.md

---
slug: /en/sql-reference/table-functions/azureBlobStorage
sidebar_position: 10
sidebar_label: azureBlobStorage
keywords: [azure blob storage]
---

# azureBlobStorage Table Function

Provides a table-like interface to select/insert files in [Azure Blob Storage](https://azure.microsoft.com/en-us/products/storage/blobs). This table function is similar to the [s3 function](../../sql-reference/table-functions/s3.md).

**Syntax**

``` sql
azureBlobStorage(- connection_string|storage_account_url, container_name, blobpath, [account_name, account_key, format, compression, structure])
```

**Arguments**

- `connection_string|storage_account_url` — connection_string includes account name & key ([Create connection string](https://learn.microsoft.com/en-us/azure/storage/common/storage-configure-connection-string?toc=%2Fazure%2Fstorage%2Fblobs%2Ftoc.json&bc=%2Fazure%2Fstorage%2Fblobs%2Fbreadcrumb%2Ftoc.json#configure-a-connection-string-for-an-azure-storage-account)) or you could also provide the storage account url here and account name & account key as separate parameters (see parameters account_name & account_key)
- `container_name` - Container name
- `blobpath` - file path. Supports following wildcards in readonly mode: `*`, `**`, `?`, `{abc,def}` and `{N..M}` where `N`, `M` — numbers, `'abc'`, `'def'` — strings.
- `account_name` - if storage_account_url is used, then account name can be specified here
- `account_key` - if storage_account_url is used, then account key can be specified here
- `format` — The [format](../../interfaces/formats.md#formats) of the file.
- `compression` — Supported values: `none`, `gzip/gz`, `brotli/br`, `xz/LZMA`, `zstd/zst`. By default, it will autodetect compression by file extension. (same as setting to `auto`).
- `structure` — Structure of the table. Format `'column1_name column1_type, column2_name column2_type, ...'`.

**Returned value**

A table with the specified structure for reading or writing data in the specified file.

**Examples**

Write data into azure blob storage using the following :

```sql
INSERT INTO TABLE FUNCTION azureBlobStorage('http://azurite1:10000/devstoreaccount1',
    'test_container', 'test_{_partition_id}.csv', 'devstoreaccount1', 'Eby8vdM02xNOcqFlqUwJPLlmEtlCDXJ1OUzFT50uSRZ6IFsuFq2UVErCz4I6tq/K1SZFPTOtr/KBHBeksoGMGw==',
    'CSV', 'auto', 'column1 UInt32, column2 UInt32, column3 UInt32') PARTITION BY column3 VALUES (1, 2, 3), (3, 2, 1), (78, 43, 3);
```

And then it can be read using

```sql
SELECT * FROM azureBlobStorage('http://azurite1:10000/devstoreaccount1',
    'test_container', 'test_1.csv', 'devstoreaccount1', 'Eby8vdM02xNOcqFlqUwJPLlmEtlCDXJ1OUzFT50uSRZ6IFsuFq2UVErCz4I6tq/K1SZFPTOtr/KBHBeksoGMGw==',
    'CSV', 'auto', 'column1 UInt32, column2 UInt32, column3 UInt32');
```

```response
┌───column1─┬────column2─┬───column3─┐
│     3     │       2    │      1    │
└───────────┴────────────┴───────────┘
```

or using connection_string

```sql
SELECT count(*) FROM azureBlobStorage('DefaultEndpointsProtocol=https;AccountName=devstoreaccount1;AccountKey=Eby8vdM02xNOcqFlqUwJPLlmEtlCDXJ1OUzFT50uSRZ6IFsuFq2UVErCz4I6tq/K1SZFPTOtr/KBHBeksoGMGw==;EndPointSuffix=core.windows.net',
    'test_container', 'test_3.csv', 'CSV', 'auto' , 'column1 UInt32, column2 UInt32, column3 UInt32');
```

``` text
┌─count()─┐
│      2  │
└─────────┘
```

## Virtual Columns {#virtual-columns}

- `_path` — Path to the file. Type: `LowCardinalty(String)`.
- `_file` — Name of the file. Type: `LowCardinalty(String)`.
- `_size` — Size of the file in bytes. Type: `Nullable(UInt64)`. If the file size is unknown, the value is `NULL`.
- `_time` — Last modified time of the file. Type: `Nullable(DateTime)`. If the time is unknown, the value is `NULL`.

**See Also**

- [AzureBlobStorage Table Engine](/docs/en/engines/table-engines/integrations/azureBlobStorage.md)

## Hive-style partitioning {#hive-style-partitioning}

When setting `use_hive_partitioning` is set to 1, ClickHouse will detect Hive-style partitioning in the path (`/name=value/`) and will allow to use partition columns as virtual columns in the query. These virtual columns will have the same names as in the partitioned path, but starting with `_`.

**Example**

Use virtual column, created with Hive-style partitioning

``` sql
SET use_hive_partitioning = 1;
SELECT * from azureBlobStorage(config, storage_account_url='...', container='...', blob_path='http://data/path/date=*/country=*/code=*/*.parquet') where _date > '2020-01-01' and _country = 'Netherlands' and _code = 42;
```
Updated docs for azureBlobStorage Table function & engine 2023-06-10 21:00:59 +00:00			`---`
Fixed typo 2023-06-13 13:48:42 +00:00			`slug: /en/sql-reference/table-functions/azureBlobStorage`
Alphabetize table functions and engines 2023-06-23 13:16:22 +00:00			`sidebar_position: 10`
Fix heading and sidebar for azureBlobStorage table function 2023-06-13 08:54:54 +00:00			`sidebar_label: azureBlobStorage`
Updated docs for azureBlobStorage Table function & engine 2023-06-10 21:00:59 +00:00			`keywords: [azure blob storage]`
			`---`

Fix heading and sidebar for azureBlobStorage table function 2023-06-13 08:54:54 +00:00			`# azureBlobStorage Table Function`
Updated docs for azureBlobStorage Table function & engine 2023-06-10 21:00:59 +00:00
			`Provides a table-like interface to select/insert files in [Azure Blob Storage](https://azure.microsoft.com/en-us/products/storage/blobs). This table function is similar to the [s3 function](../../sql-reference/table-functions/s3.md).`

			`Syntax`

			``` sql
			`azureBlobStorage(- connection_string\|storage_account_url, container_name, blobpath, [account_name, account_key, format, compression, structure])`
			```

			`Arguments`

			- `connection_string\|storage_account_url` — connection_string includes account name & key ([Create connection string](https://learn.microsoft.com/en-us/azure/storage/common/storage-configure-connection-string?toc=%2Fazure%2Fstorage%2Fblobs%2Ftoc.json&bc=%2Fazure%2Fstorage%2Fblobs%2Fbreadcrumb%2Ftoc.json#configure-a-connection-string-for-an-azure-storage-account)) or you could also provide the storage account url here and account name & account key as separate parameters (see parameters account_name & account_key)
			- `container_name` - Container name
add ** wildcard to docs, which available from 22.11 Signed-off-by: Slach <bloodjazman@gmail.com> 2023-08-09 12:30:33 +00:00			- `blobpath` - file path. Supports following wildcards in readonly mode: ``, `*`, `?`, `{abc,def}` and `{N..M}` where `N`, `M` — numbers, `'abc'`, `'def'` — strings.
Updated docs for azureBlobStorage Table function & engine 2023-06-10 21:00:59 +00:00			- `account_name` - if storage_account_url is used, then account name can be specified here
			- `account_key` - if storage_account_url is used, then account key can be specified here
			- `format` — The [format](../../interfaces/formats.md#formats) of the file.
			- `compression` — Supported values: `none`, `gzip/gz`, `brotli/br`, `xz/LZMA`, `zstd/zst`. By default, it will autodetect compression by file extension. (same as setting to `auto`).
			- `structure` — Structure of the table. Format `'column1_name column1_type, column2_name column2_type, ...'`.

			`Returned value`

			`A table with the specified structure for reading or writing data in the specified file.`

			`Examples`

			`Write data into azure blob storage using the following :`

			```sql
Alphabetize table functions and engines 2023-06-23 13:16:22 +00:00			`INSERT INTO TABLE FUNCTION azureBlobStorage('http://azurite1:10000/devstoreaccount1',`
Updated docs for azureBlobStorage Table function & engine 2023-06-10 21:00:59 +00:00			`'test_container', 'test_{_partition_id}.csv', 'devstoreaccount1', 'Eby8vdM02xNOcqFlqUwJPLlmEtlCDXJ1OUzFT50uSRZ6IFsuFq2UVErCz4I6tq/K1SZFPTOtr/KBHBeksoGMGw==',`
			`'CSV', 'auto', 'column1 UInt32, column2 UInt32, column3 UInt32') PARTITION BY column3 VALUES (1, 2, 3), (3, 2, 1), (78, 43, 3);`
			```

Alphabetize table functions and engines 2023-06-23 13:16:22 +00:00			`And then it can be read using`
Updated docs for azureBlobStorage Table function & engine 2023-06-10 21:00:59 +00:00
			```sql
Alphabetize table functions and engines 2023-06-23 13:16:22 +00:00			`SELECT * FROM azureBlobStorage('http://azurite1:10000/devstoreaccount1',`
			`'test_container', 'test_1.csv', 'devstoreaccount1', 'Eby8vdM02xNOcqFlqUwJPLlmEtlCDXJ1OUzFT50uSRZ6IFsuFq2UVErCz4I6tq/K1SZFPTOtr/KBHBeksoGMGw==',`
Updated docs for azureBlobStorage Table function & engine 2023-06-10 21:00:59 +00:00			`'CSV', 'auto', 'column1 UInt32, column2 UInt32, column3 UInt32');`
			```

			```response
			`┌───column1─┬────column2─┬───column3─┐`
			`│ 3 │ 2 │ 1 │`
			`└───────────┴────────────┴───────────┘`
			```

Added example for table engine and fixed typos 2023-06-11 07:55:20 +00:00			`or using connection_string`
Updated docs for azureBlobStorage Table function & engine 2023-06-10 21:00:59 +00:00
			```sql
			`SELECT count(*) FROM azureBlobStorage('DefaultEndpointsProtocol=https;AccountName=devstoreaccount1;AccountKey=Eby8vdM02xNOcqFlqUwJPLlmEtlCDXJ1OUzFT50uSRZ6IFsuFq2UVErCz4I6tq/K1SZFPTOtr/KBHBeksoGMGw==;EndPointSuffix=core.windows.net',`
			`'test_container', 'test_3.csv', 'CSV', 'auto' , 'column1 UInt32, column2 UInt32, column3 UInt32');`
			```

			``` text
			`┌─count()─┐`
			`│ 2 │`
			`└─────────┘`
			```

Add docs 2023-11-22 18:21:30 +00:00			`## Virtual Columns {#virtual-columns}`

			- `_path` — Path to the file. Type: `LowCardinalty(String)`.
			- `_file` — Name of the file. Type: `LowCardinalty(String)`.
			- `_size` — Size of the file in bytes. Type: `Nullable(UInt64)`. If the file size is unknown, the value is `NULL`.
time_virtual_col: tests, doc, small refactoring 2024-06-06 21:00:47 +00:00			- `_time` — Last modified time of the file. Type: `Nullable(DateTime)`. If the time is unknown, the value is `NULL`.
Add docs 2023-11-22 18:21:30 +00:00
edits 2023-06-10 22:20:39 +00:00			`See Also`

Added example for table engine and fixed typos 2023-06-11 07:55:20 +00:00			`- [AzureBlobStorage Table Engine](/docs/en/engines/table-engines/integrations/azureBlobStorage.md)`
fixes after review 2024-07-15 16:27:38 +00:00
			`## Hive-style partitioning {#hive-style-partitioning}`

			When setting `use_hive_partitioning` is set to 1, ClickHouse will detect Hive-style partitioning in the path (`/name=value/`) and will allow to use partition columns as virtual columns in the query. These virtual columns will have the same names as in the partitioned path, but starting with `_`.

			`Example`

			`Use virtual column, created with Hive-style partitioning`

			``` sql
			`SET use_hive_partitioning = 1;`
some more fixes (docs + storageObjectStorage) 2024-07-15 17:14:11 +00:00			`SELECT * from azureBlobStorage(config, storage_account_url='...', container='...', blob_path='http://data/path/date=/country=/code=/.parquet') where _date > '2020-01-01' and _country = 'Netherlands' and _code = 42;`
fixes after review 2024-07-15 16:27:38 +00:00			```