ClickHouse/icebergCluster.md at 7af8b25490840e1bbe2e2512840615420d10dfa0

mirror of https://github.com/ClickHouse/ClickHouse.git synced 2024-12-13 18:02:24 +00:00

Mikhail Artemenko 4ccebd9a24 fix syntax for iceberg in docs

2024-11-20 11:15:39 +00:00

2.0 KiB

Raw Blame History

slug	sidebar_position	sidebar_label	title
/en/sql-reference/table-functions/icebergCluster	91	icebergCluster	icebergCluster Table Function

This is an extension to the iceberg table function.

Allows processing files from Apache Iceberg in parallel from many nodes in a specified cluster. On initiator it creates a connection to all nodes in the cluster and dispatches each file dynamically. On the worker node it asks the initiator about the next task to process and processes it. This is repeated until all tasks are finished.

Syntax

icebergS3Cluster(cluster_name, url [, NOSIGN | access_key_id, secret_access_key, [session_token]] [,format] [,compression_method])
icebergS3Cluster(cluster_name, named_collection[, option=value [,..]])

icebergAzureCluster(cluster_name, connection_string|storage_account_url, container_name, blobpath, [,account_name], [,account_key] [,format] [,compression_method])
icebergAzureCluster(cluster_name, named_collection[, option=value [,..]])

icebergHDFSCluster(cluster_name, path_to_table, [,format] [,compression_method])
icebergHDFSCluster(cluster_name, named_collection[, option=value [,..]])

Arguments

cluster_name — Name of a cluster that is used to build a set of addresses and connection parameters to remote and local servers.
Description of all other arguments coincides with description of arguments in equivalent iceberg table function.

Returned value

A table with the specified structure for reading data from cluster in the specified Iceberg table.

Examples

SELECT * FROM icebergS3Cluster('cluster_simple', 'http://test.s3.amazonaws.com/clickhouse-bucket/test_table', 'test', 'test')

See Also

2.0 KiB Raw Blame History

2.0 KiB

Raw Blame History