mirror of
https://github.com/ClickHouse/ClickHouse.git
synced 2024-11-17 13:13:36 +00:00
f2857ac6f2
Co-Authored-By: Ivan Blinkov <github@blinkov.ru>
1.3 KiB
1.3 KiB
HDFS
This engine provides integration with Apache Hadoop ecosystem by allowing to manage data on HDFSvia ClickHouse. This engine is similar to the File and URL engines, but provides Hadoop-specific features.
Usage
ENGINE = HDFS(URI, format)
The format
parameter specifies one of the available file formats. To perform
SELECT
queries, the format must be supported for input, and to perform
INSERT
queries -- for output. The available formats are listed in the
Formats section.
Example:
1. Set up the HDFS_engine_table
table:
CREATE TABLE hdfs_engine_table (name String, value UInt32) ENGINE=HDFS('hdfs://hdfs1:9000/other_storage', 'TSV')
2. Query the data:
SELECT * FROM hdfs_engine_table LIMIT 2
┌─name─┬─value─┐
│ one │ 1 │
│ two │ 2 │
└──────┴───────┘
Implementation Details
- Reads and writes can be parallel
- Not supported:
ALTER
andSELECT...SAMPLE
operations.- Indexes.
- Replication.