mirror of https://github.com/ClickHouse/ClickHouse.git synced 2024-11-17 13:13:36 +00:00

Olga Khvostikova f2857ac6f2

Update docs/en/operations/table_engines/hdfs.md

Co-Authored-By: Ivan Blinkov <github@blinkov.ru>

2019-09-04 16:26:03 +03:00

1.3 KiB

Raw Blame History

HDFS

This engine provides integration with Apache Hadoop ecosystem by allowing to manage data on HDFSvia ClickHouse. This engine is similar to the File and URL engines, but provides Hadoop-specific features.

Usage

ENGINE = HDFS(URI, format)

The format parameter specifies one of the available file formats. To perform SELECT queries, the format must be supported for input, and to perform INSERT queries -- for output. The available formats are listed in the Formats section.

Example:

1. Set up the HDFS_engine_table table:

CREATE TABLE hdfs_engine_table (name String, value UInt32) ENGINE=HDFS('hdfs://hdfs1:9000/other_storage', 'TSV')

2. Query the data:

SELECT * FROM hdfs_engine_table LIMIT 2

┌─name─┬─value─┐
│ one  │     1 │
│ two  │     2 │
└──────┴───────┘

Implementation Details

Reads and writes can be parallel
Not supported:
- ALTER and SELECT...SAMPLE operations.
- Indexes.
- Replication.

Original article

1.3 KiB Raw Blame History

HDFS

Usage

Implementation Details

1.3 KiB

Raw Blame History